| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 129
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Nope, *all* of the 13 copies for the WU have been changed to Other, not just the last one. None of the dates or other data were changed. It just went from 12 Inconclusives (counting mine) and 1 In Progress to being 13 Others instead, and it's finally stopped. Looks like after it finished crunching copy 13 it finally realized it wasn't getting anywhere, and since none of the other status codes applied, everything ended up set to Other. I would've expected them to all be flagged as Errors, but whatever.
|
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Okay, thats good. That means the knreed rule of "12 max, then quit" is starting to kick in, possibly now manually as 14 as said was the last reported record.
----------------------------------------
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Jul 8, 2006 7:18:24 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
BOINC User ID 225561, Host ID 41341.
Error 8: za064_ 00199, returned 07/08/2006 08:56:36, 3 other copies in progress, 1 other copy immediate error after 0.03 hours, mine aborted after 3.93 hours of normal checkpoint messages with "Incorrect function. (0x1) - exit code 1 (0x1)". No system abort code or address given. Interesting to note that this is a "za064" unit in the middle of a za098 and a za099, yet the log shows it was first sent out as a normal batch of 3 just a few hours ago (i.e. I'm not doing mop up). Could be some of the earlier failed units are being reissued from scratch again, maybe to reproduce problems or to try out 5.07 changes. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Very probable.....WCG just also released a new HPF2 exe for UD agent with fixes. You may recollect, they had to redo a few batches for HPF1 just before that project close due compiling issues. Even yesterday 100 days worth of HPF1 was still coming thru.....could include late reporters of course.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My process has been running for 125hrs and still at 0%. I restarted my machine and it has been running for another 9hrs at 0%.
My device ID is 128385. Agent Version is 3.0(2844) It started about 4 days ago. Thanks, Ed ekevans@aol.com |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Ed, this is beyond your call of duty.....go to taskmanager (right click with mouse on startbar), seek out a process which is named something like ud_7324001.exe. Terminate it, this one only and a reload of a new workunit will be invoked. The broken WU bits will be send back during this exchange.
----------------------------------------Note that you will get a new HPF2 unit directly, when u run HPF2 only. If running also FAAH, chances are, you get one of those first. The version number of the HPF2 science is in the 'i' screen and should be something like 5.05.03. The Agent itself (the front end GUI), has a number of its own which should for all be 3.0(2844) at this time of writing. If you get a new HPF, can u please confirm here and the version from the 'i' screen of UD Agent. Also the UD_7.....exe number. Just so that the next readers of the forum know if they use a new or older version. thx PS Restarting does mostly not help. It runs the risk of beginning all over. Again the 'i' screen should show progress in 0.1% steps and 3 line graphs should move.....if neither does, the WU is dead....refer to top of post.
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 2 times, last edit by Sekerob at Jul 8, 2006 3:01:26 PM] |
||
|
|
depriens
Senior Cruncher The Netherlands Joined: Jul 29, 2005 Post Count: 350 Status: Offline Project Badges:
|
Unfortunately I don't have access to my computers at the moment, but I'll post the unusual results anyway. Should it be unusable without the additional information, please delete this message.
----------------------------------------za086_ 00132 Invalid 07/03/2006 06:01:31 07/06/2006 16:16:54 14.41 73 / 52 za068_ 00712 Error 07/05/2006 04:05:13 07/05/2006 09:45:14 5.43 43 / 0 ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Mark099, Right click at the bottom of your screen, select Task Manager, then select WCGrid_Rosetta in the processes, then Kill it. Lawrence Have done that now. I have also stopped running HPF2 units for now. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I got a 'Frozen at 0.0%' HPF2 workunit on my UD computer - device 3579. It should have downloaded about 20:00 UTC on Friday, 7 July 2006. I checked it around 11 PM EDT this Sunday night and discovered it had run for 55 hours + minutes. The graphs had started, then frozen. I rebooted and brought it up on the screen. At 1 minute 55 secs when I switched to the Application View it had redrawn what looked like the exact same graph. After recording some details on the Main View, I switched back to the Application View at 4 minutes run time. The graphs had disappeared and the values (solvation, etc.) had been replaced by 4 dots. So I killed it and started on a FAAH unit.
Lawrence |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just aborted za090_00104_3 after it's been stuck on 96.552% for over 64 hours.
If the PC is rebooted, it goes restarts at 96.552% but the CPU time goes back to around 6 hours. |
||
|
|
|