Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 120
|
![]() |
Author |
|
andreic
Advanced Cruncher Canada Joined: Nov 19, 2004 Post Count: 100 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Quick question, is there a way to find out how many WUs are left in the queue?
|
||
|
mikefinn
Cruncher USA Joined: Apr 27, 2007 Post Count: 43 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I was able to restart 3 work units. One of them had only 53 seconds left to completion when I interrupted it. After restarting and picking up from the checkpoint, that one now had 7 minutes of time left. Each work unit has been returned and is in Pending Verification status. I looked at the wingman and in each case, they did not have a restart work unit. I then compared the result.out of my work units to the corresponding wingmen and my restarted units did not match the result.out of the wingmen. I am waiting to see what the pending resend produces. I expect mine to be invalid since this is what was happening to some of my regular MCM units.
----------------------------------------Update: All have gone invalid. In two of them, the 3rd wingman ran without restart and matched the result.out of the 1st wingman. In the remaining one, a 3rd wingman had a restart and his result.out didn't match the two original replications. A 4th wingman ran without restart and the wingmen with restarts were marked invalid. Update 2: I looked at some of my valid units. I found one that was a restart but the result.out matched the value of the wingman who did not have a restart and the quorum was satisfied without another replication being sent. [Edit 2 times, last edit by mikefinn at Dec 11, 2013 11:43:51 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
andreic,
----------------------------------------No, not at WCG. [Edit 1 times, last edit by Former Member at Dec 10, 2013 7:36:21 AM] |
||
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Still a few Pending Verification:
----------------------------------------Result Name: BETA_ MCM1_ 0000144_ 5839_ 0-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_intelx86 -SettingsFile MCM1_0000144_5839.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 0000144_5839 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 16 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 7738 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 0 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 7738 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 10 TrainFrac = 0.7 NFolds = 10 VMethod = NFCV ModelType = SVM FitnessFn = 0 MinFitness = -1 SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0" SvmLearnLimit = 500000 RSeed = 345839 [04:08:09] Initializing wcg_learn_limit = 500000 [04:08:19] Running [04:08:20] EvaluateFitnessOfStartingGeneSignatures 7738 Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_intelx86 -SettingsFile MCM1_0000144_5839.txt -DatabaseFile dataset-17_72_SDG_v1.txt [08:19:19] Initializing wcg_learn_limit = 500000 [08:19:30] Running [08:19:30] EvaluateFitnessOfStartingGeneSignatures 7738 [08:42:15] Writing final output [08:42:15] Closing Output Stream [08:42:15] Cleaning up Result.out = 1627390.000000 Run complete, CPU time: 15169.384265 08:42:15 (1960): called boinc_finish ------------------------------------------------------------------------------ Result Name: BETA_ MCM1_ 0000144_ 0737_ 0-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_x86_64 -SettingsFile MCM1_0000144_0737.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 0000144_0737 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 16 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 7738 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 0 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 7738 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 10 TrainFrac = 0.7 NFolds = 10 VMethod = NFCV ModelType = SVM FitnessFn = 0 MinFitness = -1 SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0" SvmLearnLimit = 500000 RSeed = 340737 [04:56:57] Initializing wcg_learn_limit = 500000 [04:57:05] Running [04:57:05] EvaluateFitnessOfStartingGeneSignatures 7738 Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_x86_64 -SettingsFile MCM1_0000144_0737.txt -DatabaseFile dataset-17_72_SDG_v1.txt [08:19:28] Initializing wcg_learn_limit = 500000 [08:19:39] Running [08:19:39] EvaluateFitnessOfStartingGeneSignatures 7738 [09:10:01] Writing final output [09:10:01] Closing Output Stream [09:10:01] Cleaning up Result.out = 1627162.000000 Run complete, CPU time: 14362.873048 09:10:01 (3464): called boinc_finish ------------------------------------------------------------------------------ Result Name: BETA_ MCM1_ 0000144_ 0344_ 0-- <core_client_version>7.2.33</core_client_version> <![CDATA[ <stderr_txt> Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.27_windows_intelx86 -SettingsFile MCM1_0000144_0344.txt -DatabaseFile dataset-17_72_SDG_v1.txt Settings File DateOfDesign = 11/08/2013 Designer = PMCC_OCI WorkOrderID = 0000144_0344 DatasetID = 17_72_SDG_v1 NumberOfGenesInStartingSignature = 16 NumberOfGenesInSignatureMin = 10 NumberOfGenesInSignatureMax = 20 GroupVectorValues = {A}{B}{C}{D}{E}{F} ExplicitStartingGeneSignatures = A B D F StartingGeneSignatureAlgorithm = randomFixedLengthSearch SearchAlgorithmNumberToCreate = 7738 SearchAlgorithmSequentialStartPosition = 5 RunPermutationAlgorithm = 0 PermutationGroups = A PermutationGroupsForReplacement = G PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy PermutationsNumIterations = 7738 OptimizationAlgorithmFrequency = 0 0 1 FBeta = 1.5 SimAnnealIMax = 20000 SimAnnealAlpha = 0.9996 NReps = 10 TrainFrac = 0.7 NFolds = 10 VMethod = NFCV ModelType = SVM FitnessFn = 0 MinFitness = -1 SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0" SvmLearnLimit = 500000 RSeed = 340344 [03:15:15] Initializing wcg_learn_limit = 500000 [03:15:22] Running [03:15:22] EvaluateFitnessOfStartingGeneSignatures 7738 [06:15:00] Writing final output [06:15:00] Closing Output Stream [06:15:00] Cleaning up Result.out = 1626882.000000 Run complete, CPU time: 10601.609375 06:15:00 (304): called boinc_finish ------------------------------------------------------------------------------ ![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well, armstrdj did write they were not expecting to catch all PVAL > PVer > Invalid tracking problems and a next beta was projected for later this week based on findings of this round.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have two that validated even though they restarted and the wingmen didn't. The rest are all PVal apart from one in PVer where mine restarted and the wingman didn't, but the _2 copy is "Waiting to be sent " since 9:41.
Clearly none of the robots has decided it needs to wake a tech ![]() |
||
|
Pedro Manuel Silva
Cruncher Portugal Joined: Aug 12, 2008 Post Count: 2 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Got one that came in and is running high priority as it appears we are back to a one day deadline (no it is not a resend), they also appear to be significantly longer than the first few I received. I have also a few of those workunits, 24 hour deadline, and estimated completion time also around 24 hours (sometimes even longer). Units: BETA_MCM1_0000218_0003_1 BETA_MCM1_0000218_0561_0 BETA_MCM1_0000218_0556_1 BETA_MCM1_0000218_1174_0 BETA_MCM1_0000218_8139_1 BETA_MCM1_0000218_8915_1 BETA_MCM1_0000218_5443_0 ![]() |
||
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
All seems fine here too. Admittedly, I only tested the "suspend/resume" with LAIM off on one machine, but it worked just fine. I also suspended/resumed other Betas on 2 other machines (but with LAIM on) and that, of course, worked fine too. Still have some WUs to crunch on, then I hope the "project has no work" problem will also be fixed by the time I finish my cache.
----------------------------------------Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() ![]() |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The 0000089 and 0000144 are the fairly normal ones with relatively even run times, the 0000218 are the biggies. Got 4 of those chewing.
----------------------------------------Since armstrdj has several times commented on the many varieties of jobs, maybe he could expand on that a little by what he means with that. Job names indicate nothing at all to me, just batch and task number, in a batch. edit: bad English [to me]. [Edit 1 times, last edit by Former Member at Dec 10, 2013 10:36:10 AM] |
||
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
The 0000089 and 0000144 where the fairly normal ones with relatively even run times, the 0000218 are the biggies. Got 4 of those chewing. I see I too got a couple of the 218s on my PC, but will have to wait until I get back home to see just how big they really are. Hopefully, I will be able to test them too. Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() ![]() |
||
|
|
![]() |