| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 3595
|
|
| Author |
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
098 is currently not a priority but that will probably change in the next 24 hours. Your 068 probably has 2 wingmen.
Mike |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
ARP1_0033636_089_5
Currently at 79% but I don't expect anything from it. All previous copies errored with Unhandled Exception. Again this morning, no new work available - only the occasional resend. Mike |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
ARP1_0033558_091_5 also errored for the same reason.
Mike |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Once again no work after 0000 UTC stats run and machine is idle at the moment. I hope someone looks at the WUs that are getting errors soon so we don't end up in the extreme laggard situation again like the last time
|
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1408 Status: Offline Project Badges:
|
I got 4 new tasks (no resends) half an hour ago. 098 not yet a straggler.
|
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
I got 6 new tasks at 13:14 GMT (UTC). All non-priority 099 x 5 and 101.
There does seem to be a link to the statistics runs. I have reported the issue. Mike |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
unfortunately, since this has become a somewhat regular occurrence, i have downloaded an extra 64 WUs as a buffer so when they start ending around midnight the machine will have others to work on.
|
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
I should have posted before this - I apologize for the delay.
There was an issue with the object storage system starting around 1:00 UTC Saturday morning. Due to this issue we were unable to create the next generation of workunits for the project except occasionally when the issue wasn't present. The issue was intermittent but each time it happened it caused the assimilators to stop running. We had a period mid-day Saturday and again on Sunday when we were able to run for a few hours before the issue re-occurred. That allowed us to catch up on the backlog and get a large set of work out. It has been running again since about 8 hours ago so I'm hopeful that things are stable again. The object storage is used to archive the outputs used for each generation so that we can re-run a line starting from a previous generation if needed. We only save the past 6 generations due to the amount of storage required (we used to only save 3 but in the case of the 6 workunits we had to restart, 3 generations proved to not be enough). |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
Thank you, Kevin. Let's hope it doesn't recur.
How about the issue with ARP1_0033558_091_5 & ARP1_0033636_089_5 ? Are they now stuck? Mike |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
Thank you, Kevin. Let's hope it doesn't recur. How about the issue with ARP1_0033558_091_5 & ARP1_0033636_089_5 ? Are they now stuck? Mike There is a small number of workunits that are having an issue (ARP1_0033558_091 and ARP1_0033636_089 are among them). I need to collect and send the data back to Delft but I had some critical transition work I needed to do first. I'm hoping to be able to send that information to them soon. |
||
|
|
|