| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 13
|
|
| Author |
|
|
ThreadRipper
Veteran Cruncher Sweden Joined: Apr 26, 2007 Post Count: 1324 Status: Offline Project Badges:
|
Hi,
----------------------------------------I am getting only invalids so far on both the MCM WUs verified so far on my new A10-7850K @ stock. PC is stable otherwise, with no indications of instability. The invalid WUs: MCM1_ 0001867_ 1140_ 0-- MCM1_ 0001867_ 7855_ 0-- Perhaps techs have more insight, if this could be a more general architechtural incompatibility of some kind? ![]() Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1 AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock AMD 3800X @ PBO AMD 2700X @ 4GHz |
||
|
|
Randzo
Senior Cruncher Slovakia Joined: Jan 10, 2008 Post Count: 339 Status: Offline Project Badges:
|
Hello,
please post here the error log. |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7849 Status: Offline Project Badges:
|
I am seeing a rash of invalids in several different batches, 1848,1851,1853,1856,1864,1869. I do not see anything valid in those batches on any of my machines, but saw the wingmen did get valids on them. I have not had any invalids since batch 1869 and am currently seeing batches up to 1898. Must have been something transient.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
ThreadRipper
Veteran Cruncher Sweden Joined: Apr 26, 2007 Post Count: 1324 Status: Offline Project Badges:
|
Thanks for the replies!
----------------------------------------Yes there seems to be some sort of glitch perhaps in some batches. Here are my error logs anyway: Result Name: MCM1_ 0001867_ 1140_ 0-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 1867_1140 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 14 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 23726 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 0 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 23726 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 9 TrainFrac = 1.0 NFolds = 10 VMethod = OOB ModelType = SVM FitnessFn = 0 MinFitness = -1 SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0" SvmLearnLimit = 500000 RSeed = 379641140 [17:02:06] Initializing [17:02:13] Running [17:02:13] EvaluateFitnessOfStartingGeneSignatures 23726 Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt [17:55:21] Initializing [17:55:26] Running [17:55:27] EvaluateFitnessOfStartingGeneSignatures 23726 Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt [16:58:21] Initializing [16:58:26] Running [16:58:26] EvaluateFitnessOfStartingGeneSignatures 23726 [17:21:31] Writing final output [17:21:32] Closing Output Stream [17:21:32] Cleaning up Result.out = 4818791.000000 Run complete, CPU time: 29989.035625 17:21:32 (4180): called boinc_finish </stderr_txt> ]]> and the second WU: Result Name: MCM1_ 0001867_ 2428_ 1-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 1867_2428 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 14 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 23726 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 0 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 23726 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 9 TrainFrac = 1.0 NFolds = 10 VMethod = OOB ModelType = SVM FitnessFn = 0 MinFitness = -1 SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0" SvmLearnLimit = 500000 RSeed = 379642428 [17:02:07] Initializing [17:02:14] Running [17:02:14] EvaluateFitnessOfStartingGeneSignatures 23726 Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt [17:55:21] Initializing [17:55:26] Running [17:55:27] EvaluateFitnessOfStartingGeneSignatures 23726 Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt [16:58:21] Initializing [16:58:26] Running [16:58:26] EvaluateFitnessOfStartingGeneSignatures 23726 [17:24:11] Writing final output [17:24:12] Closing Output Stream [17:24:12] Cleaning up Result.out = 4818973.000000 Run complete, CPU time: 30148.203125 17:24:12 (4204): called boinc_finish </stderr_txt> ]]> Seems also now I did get some valid ones too, but in different batches... ![]() Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1 AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock AMD 3800X @ PBO AMD 2700X @ 4GHz |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7849 Status: Offline Project Badges:
|
Update:
----------------------------------------I have gotten some more invalids in subsequent batches, but as of yet I can see no pattern. After a period of very good stability, it seems some issues are resurfacing. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges:
|
We are looking into the issues and will report back once we know more. Based on reports in the forums it does look like an issue with restoring from a checkpoint for a certain type of workunits we are running now. A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart.
Thanks, armstrdj |
||
|
|
David_L6
Senior Cruncher USA Joined: Aug 24, 2006 Post Count: 296 Status: Offline Project Badges:
|
A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart. Thanks, armstrdj I always have mine set like that on all computers. The last week or so I've been getting almost all invalids on MCM on one computer. It has a Q6700 and Windows XP (32 bit). Yesterday it ran 6 or 8 FAAH work units and all of those were valid. I have two other computers running MCM and they aren't getting any invalids. They also have Q6700 CPUs but they are running Vista (one 32 bit and the other 64 bit). ![]() |
||
|
|
ThreadRipper
Veteran Cruncher Sweden Joined: Apr 26, 2007 Post Count: 1324 Status: Offline Project Badges:
|
We are looking into the issues and will report back once we know more. Based on reports in the forums it does look like an issue with restoring from a checkpoint for a certain type of workunits we are running now. A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart. Thanks, armstrdj Thanks for the info! And for all replies from everyone! I also run with "leave application in memory while suspended", but I know that there was a computer power cycle in-between start and finish of those invalid WUs. Now I have returned mostly valid results as well with that 7850K so that's good. Yes, it seems it does have something to do with checkpoints, since I have not a single invalid returned on from my main 3930K cruncher which operates 24/7 so there should be no (or very few) checkpoint restarts needed there. Alright crunching on... ![]() Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1 AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock AMD 3800X @ PBO AMD 2700X @ 4GHz |
||
|
|
l_mckeon
Senior Cruncher Joined: Oct 20, 2007 Post Count: 439 Status: Offline Project Badges:
|
I haven't checked for invalids, but I notice MCM seems to lose all work since the last checkpoint if you suspend and then re-enable a task (according to BoincTasks).
I do leave suspended tasks in memory. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I just checked my rigs and I've got a fair few invalid as well, here's my list.
MCM1_ 0001896_ 1623_ 1-- Intel-i7-2700K Invalid 2/2/14 22:21:44 2/5/14 00:00:21 5.26 / 5.31 136.9 / 59.3 MCM1_ 0001896_ 1086_ 0-- Intel-i7-2700K Invalid 2/2/14 22:21:44 2/4/14 23:11:37 5.28 / 5.34 136.6 / 75.6 MCM1_ 0001894_ 5145_ 0-- Black-AMD-X6 Invalid 2/2/14 20:33:13 2/4/14 02:32:45 7.12 / 7.17 125.1 / 67.4 MCM1_ 0001894_ 4990_ 0-- Black-AMD-X6 Invalid 2/2/14 20:33:13 2/4/14 03:01:32 7.42 / 7.48 133.4 / 73.5 MCM1_ 0001877_ 5417_ 1-- Intel-i7-2700K Invalid 2/2/14 01:28:23 2/4/14 01:34:01 6.34 / 6.40 170.6 / 107.4 MCM1_ 0001877_ 4972_ 0-- Intel-i7-2700K Invalid 2/2/14 01:28:23 2/4/14 01:34:01 6.34 / 6.40 170.7 / 91.8 MCM1_ 0001875_ 5353_ 1-- Black-AMD-X6 Invalid 2/1/14 23:06:15 2/3/14 03:36:31 8.83 / 8.89 208.2 / 111.8 MCM1_ 0001872_ 3272_ 0-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 20:39:13 6.51 / 6.76 198.6 / 102.6 MCM1_ 0001872_ 2918_ 0-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 20:39:13 6.54 / 6.77 198.7 / 99.5 MCM1_ 0001872_ 2777_ 1-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 02:12:21 6.51 / 6.57 216.2 / 103.9 The only time these might get restarted, is when I restart in the mornings. |
||
|
|
|