| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 3
|
|
| Author |
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
We experienced a large number of result errors due to an error in one of the input data files. We have cancelled all of the impact workunits. The next time a computer with one of the affected workunits contacts our servers, the jobs on their machines that are impacted will be aborted.
----------------------------------------We did not find this particular error in the targets 6-8 (target 5 experienced the error). In order to quickly determine if targets 6-8 have these errors, we are releasing 100 workunits from each of these three targets and having them sent out to see if they run properly. While we wait for those results to be processed we are running the project slowly for the time the being. [Edit 1 times, last edit by knreed at Sep 5, 2011 12:28:54 PM] |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
Target 6 appears to be having troubles as well. To be cautious we are completely stopping the project until we get the results back for the 100 test workunits for targets 7 and 8.
|
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
Target 7 and 8 look like they are running correctly. Target 6 does have errors so we have cancelled those workunits.
We are starting a 100 workunit test with targets 9-15. Those have been sent out. We have now resumed distributing work for this project for targets 7 and 8. We are will run the project at a reduced paced until we have a some process improvements in place to avoid this type of issue in the future. |
||
|
|
|