Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: Discovering Dengue Drugs - Together Phase 2 BETA |
No member browsing this thread |
Thread Status: Locked Total posts in this thread: 192
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Once credit is given, I hope you do get credit.
I am still waiting on the explanation on the disappearing monster wu's from the result pages without any credit given so far.. Until now it's enjoy the silence.. |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
EEEK my wu: BETA_erag_a059_ps0000_6 is stuck in a loop.
----------------------------------------A) It had 5 mins left at 98. something %. I was like woohooo! B) THEN it jumped to 2+ hrs left and then it stopped advancing. C) so I suspended the BOINC manager and un-suspended. Didn't help. D) So re-booted machine. The % dropped back to 96.00% and the to completion time jump to 4+ hrs. E) So I kept suspending the wu and un-suspending it for a few times. Didn't help. F) Keeps looping over and over without any success. What do you advise I do? EDIT: Okay, the CPU time has stopped looping, and it is increasing, but the progress is not moving at all. [Edit 1 times, last edit by Diana G. at Oct 20, 2009 6:46:13 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Daina G., first let me fetch the kevlar ;>)
----------------------------------------Looks like you've done all that a hung job could kick back into life. Presume the looping resumes at 98% not at 96% right on resume. What are the memory usages like? Does the quorum partner show completion? What does that log result say? Suggest for the moment to suspend the job and if new work is needed to briefly un-suspend.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
Here is the status:
----------------------------------------Workunit Status Project Name: Beta - Discovering Dengue Drugs - Together - Phase 2 Created: 10/6/09 Name: BETA_erag_a059_ps0000 Minimum Quorum: 2 Replication: 6 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit BETA_ erag_ a059_ ps0000_ 7-- 607 Too Late 10/18/09 08:41:10 10/20/09 03:36:43 28.13 600.9 / 0.0 BETA_ erag_ a059_ ps0000_ 6-- - No Reply 10/15/09 23:07:56 10/18/09 08:43:56 0.00 0.0 / 0.0 <<<<------Mine BETA_ erag_ a059_ ps0000_ 3-- 607 User Aborted 10/13/09 13:34:45 10/14/09 06:02:09 9.14 195.0 / 0.0 BETA_ erag_ a059_ ps0000_ 4-- 607 Too Late 10/13/09 13:34:26 10/20/09 11:14:47 28.73 603.9 / 0.0 BETA_ erag_ a059_ ps0000_ 5-- 607 Too Late 10/13/09 13:34:23 10/16/09 02:08:32 47.51 720.8 / 0.0 BETA_ erag_ a059_ ps0000_ 0-- - No Reply 10/6/09 20:13:15 10/12/09 20:13:15 0.00 0.0 / 0.0 BETA_ erag_ a059_ ps0000_ 1-- 607 Too Late 10/6/09 20:13:10 10/12/09 22:30:37 52.87 831.5 / 0.0 BETA_ erag_ a059_ ps0000_ 2-- 607 Too Late 10/6/09 20:13:02 10/17/09 11:58:15 57.90 996.0 / 0.0 |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
Nope, it is looping still. So does that mean it will never get uploaded? *cries*
---------------------------------------- |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
It's okay. I can live with disappointments ^_^ There will be other wu's, so always have hope :D
----------------------------------------Thanks!! |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Whilst the job is suspended visit the applicable job slot ...\slots\.. and see what the job log says. The file is stderr.txt.
----------------------------------------What is the current memory use indicating for this A type job? Yes, I fear that a user abort is looming, sadly so, with 5 jobs in Too Late state suggesting that the job is bad else validation would have already succeeded.
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Oct 20, 2009 7:17:36 PM] |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
The stderr.txt shows:
----------------------------------------INFO: No state to restore. Start from the beginning. wcgStepsDone = 100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.000400 wcgStepsDone = 200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.000800 wcgStepsDone = 300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.001200 wcgStepsDone = 400 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.001600 wcgStepsDone = 500 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.002000 wcgStepsDone = 600 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.002400 wcgStepsDone = 700 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.002800 wcgStepsDone = 800 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.003200 wcgStepsDone = 900 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.003600 wcgStepsDone = 1000 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.004000 wcgStepsDone = 1100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.004400 wcgStepsDone = 1200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.004800 wcgStepsDone = 1300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.005200 wcgStepsDone = 1400 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.005600 wcgStepsDone = 1500 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.006000 wcgStepsDone = 1600 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.006400 wcgStepsDone = 1700 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.006800 wcgStepsDone = 1800 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.007200 wcgStepsDone = 1900 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.007600 wcgStepsDone = 2000 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.008000 wcgStepsDone = 2100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.008400 wcgStepsDone = 2200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.008800 wcgStepsDone = 2300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.009200 wcgStepsDone = 2400 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.009600 wcgStepsDone = 2500 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.010000 wcgStepsDone = 2600 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.010400 wcgStepsDone = 2700 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.010800 wcgStepsDone = 2800 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.011200 wcgStepsDone = 2900 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.011600 wcgStepsDone = 3000 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.012000 wcgStepsDone = 3100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.012400 wcgStepsDone = 3200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.012800 wcgStepsDone = 3300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.013200 wcgStepsDone = 3400 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.013600 wcgStepsDone = 3500 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.014000 wcgStepsDone = 3600 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.014400 wcgStepsDone = 3700 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.014800 wcgStepsDone = 3800 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.015200 wcgStepsDone = 3900 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.015600 wcgStepsDone = 4000 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.016000 wcgStepsDone = 4100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.016400 wcgStepsDone = 4200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.016800 wcgStepsDone = 4300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.017200 wcgStepsDone = 4400 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.017600 wcgStepsDone = 4500 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.018000 wcgStepsDone = 4600 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.018400 wcgStepsDone = 4700 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.018800 wcgStepsDone = 4800 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.019200 wcgStepsDone = 4900 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.019600 wcgStepsDone = 5000 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.020000 . . <SNIP> . . wcgStepsDone = 800 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.963200 wcgStepsDone = 900 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.963600 wcgStepsDone = 1000 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.964000 wcgStepsDone = 1100 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.964400 wcgStepsDone = 1200 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.964800 wcgStepsDone = 1300 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.965200 wcgStepsDone = 1400 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.965600 wcgStepsDone = 1500 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.966000 wcgStepsDone = 1600 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.966400 wcgStepsDone = 1700 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.966800 wcgStepsDone = 1800 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.967200 wcgStepsDone = 1900 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.967600 wcgStepsDone = 2000 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.968000 wcgStepsDone = 2100 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.968400 wcgStepsDone = 2200 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.968800 wcgStepsDone = 2300 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.969200 wcgStepsDone = 2400 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.969600 wcgStepsDone = 2500 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.970000 wcgStepsDone = 2600 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.970400 wcgStepsDone = 2700 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.970800 wcgStepsDone = 2800 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.971200 wcgStepsDone = 2900 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.971600 wcgStepsDone = 3000 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.972000 wcgStepsDone = 3100 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.972400 wcgStepsDone = 3200 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.972800 wcgStepsDone = 3300 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.973200 wcgStepsDone = 3400 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.973600 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 wcgStepsDone = 0 wcgSteps1 = 5000 wcgCyclesDone = 48 wcgCycles = 50 pctComplete = 0.960000 Not sure about memory useage. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
If you look in Task Manager you can see how much RAM / VM is used. The up and coming 6.10 clients allows to see that through a job properties view, one of the more convenient features of that release.
----------------------------------------The .960000 repetition maybe a reflection of the multiple restarts, but unless anyone else knows how to get it past the loop point, the plug is likely needing pulling. First look in a log of any of the Too Late results if there are simularities or true finishing to 1.000000
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
Diana G.
Master Cruncher Joined: Apr 6, 2005 Post Count: 3003 Status: Offline Project Badges: |
They all had true finishes, Sekerob.
----------------------------------------My memory is outrageous with this wu now. the PF soars to 2.10 GB from 810 MB when I unsuspended it. I can abort it. |
||
|
|