Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: DDD2 Type B work units going out. |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 369
|
Author |
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: |
Good morning, it appears we have some work ahead of us. As many of you have pointed out, there appears to be a checkpointing bug on these work units. Please note that if you did encounter an invalid, you will still get normal credit for those during beta.
Thanks, -Uplinger |
||
|
I need a bath
Senior Cruncher USA Joined: Apr 12, 2007 Post Count: 347 Status: Offline Project Badges: |
Good morning, it appears we have some work ahead of us. As many of you have pointed out, there appears to be a checkpointing bug on these work units. Please note that if you did encounter an invalid, you will still get normal credit for those during beta. Thanks, -Uplinger Yep. Same thing happened to me. The only invalid one was when I had to hibernate my laptop. I suspended activity before I did so, but apparently that wasn't good enough. The WU seemed to proceed normally but was inconclusive and then invalid when the third result came in. |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
Thanks. That explains why all my beta's that were swaping out with other project's wu's were invalid.
---------------------------------------- |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
I have two wu’s running on a Pentium D that have run 12+ hours and are above 100% completion.
----------------------------------------BETA_erlc_a134_pe0000 @ 135.500% BETA_erlc_a107_pe0000 @ 108.050% Is this an expected condition? |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: |
No it is not normal for them to be above 100%...Please try to find the slots directory they're in and copy the result.out file and he stderr.txt file to your desktop. the stderr.txt file may be a link to the projects/www.worldcommunitygrid.org folder. These are in your data folder.
-Uplinger |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1313 Status: Offline Project Badges: |
Just to be sure I checked 4 tasks of 1 single machine (Linux64):
----------------------------------------1 valid (without restarting during the task). 2 invalids (the one 2 restarts and the other 1 restart during the job) -- Leave in memory was on. 1 Pending validation (without restart during the task; waiting for wingmen). |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
No it is not normal for them to be above 100%...Please try to find the slots directory they're in and copy the result.out file and he stderr.txt file to your desktop. the stderr.txt file may be a link to the projects/www.worldcommunitygrid.org folder. These are in your data folder. -Uplinger It is odd but now they went back 95% and 82% respectively. They appear to be running OK albeit a bit long. I must leave for an appointment. I'll try to get that info when I get back. Strange. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
As Mathilde2006 pointed out in a recent note this can happen if a WU restarts from a checkpoint. First some 100+ percent value is shown until the next checkpoint is reached. Then the percentage is adjusted and jumps to the correct value. Coluld it be that your pentiums restarted the WU or did a project swap to WCG? I guess it takes some time for them to reach the next checkpoint...
I did not mention the unusual percentage value because I'm used to see such values from HCMD2... ;-) Matthias |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: |
Ok, it appears Matthias is correct. When restoring from a checkpoint, you will see the percentage complete go up higher than expected. This is just an issue with the calculation of percentage complete...as noted, it is corrected and updated on next checkpoint, usually 5-15 minutes into the work unit.
-Uplinger |
||
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
OK, thanks for the information. Yes, both wu's were swaping with POEM wu's. Both have completed now. Odd though, although both were swaping; one is inconclusive and the other is pending validation.
----------------------------------------Looks like DDDT2 is not quite ready for "prime time" yet. |
||
|
|