| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 6
|
|
| Author |
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
Yesterday, I had several WUs error which were resent to other platforms - Linux & Darwin. These then became valid. Does this mean that my Windows 7 machine received them in error?
----------------------------------------Mike [Edit 1 times, last edit by Mike.Gibson at Jun 14, 2019 3:10:04 PM] |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
Yesterday, I had several WUs error which were resent to other platforms - Linux & Darwin. These then became valid. Does this mean that my Windows 7 machine received them in error? We can't tell, only you can. We could, if you would be providing more information to us. Go to your Results Status page; once there, find the WUs that errored out. (Probably by filtering by Result Status = Error) For each WU that you find, there is an entry with its name, its coupled Device name, its Status (Error, obviously) and some more information like Sent Time, Return Time, CPU Time and Credit.Mike Choose one WU to investigate, then click its Status (the Error link) and a window should open with its Result Log. This would give you/us a clue as to what happened during its run. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The answer is, if a task fails multiple times on one platform, it may get tried on another. The noted distribution detail on Result Status tells the story.
|
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
Here is one. It was only tried once on Windows before being switched to Darwin.
----------------------------------------Result Log Result Name: FAH2_ 002628_ zinc12385133_ 000001_ 000066_ 088_ 0-- <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> (unknown error) - exit code 211 (0xd3)</message> <stderr_txt> INFO: result number = 0 %IMPACT-I: Requested file to open for appending md.out Does not exist. Opening it as a new file. %IMPACT-I: Softcore binding energy with umax = 1000.00000 %IMPACT-I: Using AGBNP2: Analytical Generalized Born Model + Analytic Non-Polar Hydration Model %IMPACT-I: Hybrid potential for binding with lambda = 0.03560 agbnpf_assign_parameters(): info: attempting to load from SQL tables. [03:30:45] INFO: Checkpointed. Progress 500 of 10000 steps complete CPU time 1046.532708 [ERROR] Failed to open wcg_checkpoint.dat for writing. [ERROR] Checkpoint failed. Exiting. 03:47:58 (3992): called boinc_finish(211) </stderr_txt> [Edit 1 times, last edit by Mike.Gibson at Jun 14, 2019 6:45:42 PM] |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
I searched the forum for "wcg_checkpoint.dat" (because of the line "[ERROR] Failed to open wcg_checkpoint.dat for writing.") in there.
Maybe, just maybe, this (or something like that) could be the source of the error. It could also be something else. Probably something outside the BOINC directory that is interfering with the BOINC processes. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
There errors BOINC knows about which 211 (without the minus) is not part and so the log says, unknown. https://github.com/BOINC/boinc/blob/master/lib/error_numbers.h
|
||
|
|
|