| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 1
|
|
| Author |
|
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Did not see a topic amongst the 50 now in this forum on a fail we've seen a few on during beta https://secure.worldcommunitygrid.org/forums/wcg/viewthread_thread,39109 and one case in the Android forum, a OET1 at the time... not an exclusive, but still.
Result Name: HST1_ 002912_ 000042_ AC0003_ T300_ F00042_ S00005_ 1-- <core_client_version>7.6.2</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073740940 (0xc0000374) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [08:09:09] INFO: Running initial simulation Writing checkpoint at step 850. [10:42:15] INFO: Running initial simulation </stderr_txt> ]]> To only piece in the event log that tells of the fail is 10 seconds later 31-May-2016 10:42:25 [World Community Grid] [sched_op] Reason: Unrecoverable error for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 31-May-2016 10:42:25 [World Community Grid] Computation for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 finished 31-May-2016 10:42:25 [World Community Grid] Output file HST1_002912_000042_AC0003_T300_F00042_S00005_1_r1010594328_0 for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 absent 31-May-2016 10:42:25 [World Community Grid] Output file HST1_002912_000042_AC0003_T300_F00042_S00005_1_r1010594328_4 for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 absent 31-May-2016 10:42:25 [World Community Grid] Output file HST1_002912_000042_AC0003_T300_F00042_S00005_1_r1010594328_5 for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 absent 31-May-2016 10:42:25 [World Community Grid] Output file HST1_002912_000042_AC0003_T300_F00042_S00005_1_r1010594328_6 for task HST1_002912_000042_AC0003_T300_F00042_S00005_1 absent Truth be told, there was a power fail and this occurred shortly after power up and client start to run [8 HST concurrent], this one the only going heavy. Still not found on Google, but one comment here was a memory access violation, though the serial input file absent suggest something more going on. |
||
|
|
|