Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Computing for Sustainable Water Forum Thread: CFSW tasks getting Computation Error right after starting |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 57
|
Author |
|
BKraayev
Cruncher Joined: Mar 23, 2005 Post Count: 45 Status: Offline Project Badges: |
I have started to get errors as soon as the tasks start - have been processing CFSW successfully up until today
----------------------------------------25/08/2012 11:02:21 AM World Community Grid Starting cfsw_14390_14390410_0 25/08/2012 11:02:21 AM World Community Grid Starting task cfsw_14390_14390410_0 using cfsw version 611 25/08/2012 11:02:22 AM World Community Grid Computation for task cfsw_14390_14390410_0 finished 25/08/2012 11:02:22 AM World Community Grid Output file cfsw_14390_14390410_0_0 for task cfsw_14390_14390410_0 absent 25/08/2012 11:02:22 AM World Community Grid Starting cfsw_14390_14390400_0 25/08/2012 11:02:22 AM World Community Grid Starting task cfsw_14390_14390400_0 using cfsw version 611 25/08/2012 11:02:23 AM World Community Grid Computation for task cfsw_14390_14390400_0 finished 25/08/2012 11:02:23 AM World Community Grid Output file cfsw_14390_14390400_0_0 for task cfsw_14390_14390400_0 absent 25/08/2012 11:02:23 AM World Community Grid Starting cfsw_14390_14390351_0 25/08/2012 11:02:23 AM World Community Grid Starting task cfsw_14390_14390351_0 using cfsw version 611 25/08/2012 11:02:24 AM World Community Grid Computation for task cfsw_14390_14390351_0 finished 25/08/2012 11:02:24 AM World Community Grid Output file cfsw_14390_14390351_0_0 for task cfsw_14390_14390351_0 absent 25/08/2012 11:02:24 AM World Community Grid Starting cfsw_14390_14390195_0 25/08/2012 11:02:24 AM World Community Grid Starting task cfsw_14390_14390195_0 using cfsw version 611 25/08/2012 11:02:26 AM World Community Grid Computation for task cfsw_14390_14390195_0 finished 25/08/2012 11:02:26 AM World Community Grid Output file cfsw_14390_14390195_0_0 for task cfsw_14390_14390195_0 absent 25/08/2012 11:02:26 AM World Community Grid Starting cfsw_14390_14390185_0 25/08/2012 11:02:26 AM World Community Grid Starting task cfsw_14390_14390185_0 using cfsw version 611 25/08/2012 11:02:27 AM World Community Grid Computation for task cfsw_14390_14390185_0 finished 25/08/2012 11:02:27 AM World Community Grid Output file cfsw_14390_14390185_0_0 for task cfsw_14390_14390185_0 absent 25/08/2012 11:02:27 AM World Community Grid Starting cfsw_14390_14390035_0 25/08/2012 11:02:27 AM World Community Grid Starting task cfsw_14390_14390035_0 using cfsw version 611 25/08/2012 11:02:28 AM World Community Grid Computation for task cfsw_14390_14390035_0 finished 25/08/2012 11:02:28 AM World Community Grid Output file cfsw_14390_14390035_0_0 for task cfsw_14390_14390035_0 absent |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7579 Status: Offline Project Badges: |
First thing to do is reboot your machine, then see if that fixes it.
----------------------------------------Edit: If this is a widespread problem it may be a bad batch of WU's. (See following posts) Cheers
Sgt. Joe
----------------------------------------*Minnesota Crunchers* [Edit 2 times, last edit by Sgt.Joe at Aug 25, 2012 4:38:32 PM] |
||
|
metallicafan
Cruncher Joined: Oct 24, 2008 Post Count: 4 Status: Offline Project Badges: |
I am having this same issue on two different machines of mine. Been running stable for a long time and just this morning on both machines the CFSW tasks all result in a computation error immediately after starting.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
One of my remote machines is doing that too. So are a lot of the wingmen. I think we have a bunch of broken WUs. I switched the remote machine to HCC and it seems happy with that. Here's a sample result set and log from one of the bad ones:
Workunit Status Project Name: Computing for Sustainable Water Created: 08/23/2012 16:42:51 Name: cfsw_14389_14389831 Minimum Quorum: 2 Replication: 2 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit cfsw_ 14389_ 14389831_ 4-- - In Progress 25/08/12 15:56:50 29/08/12 15:56:50 0.00 0.0 / 0.0 cfsw_ 14389_ 14389831_ 3-- 611 Error 25/08/12 15:55:24 25/08/12 15:56:37 0.00 0.0 / 0.0 cfsw_ 14389_ 14389831_ 2-- 611 Error 25/08/12 15:53:45 25/08/12 15:55:16 0.00 0.0 / 0.0 cfsw_ 14389_ 14389831_ 1-- - In Progress 25/08/12 15:52:29 04/09/12 15:52:29 0.00 0.0 / 0.0 cfsw_ 14389_ 14389831_ 0-- 611 Error 25/08/12 15:52:23 25/08/12 15:53:37 0.00 0.0 / 0.0 Result Log Result Name: cfsw_ 14389_ 14389831_ 3-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> - exit code -1 (0xffffffff) </message> <stderr_txt> [16:55:30] INFO:Beginning simulation: 2010:240:1144431681 </stderr_txt> ]]> |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I just had 8 jobs like that too, on a wee C2D running Vista, that has never had any errors before. They were all repair jobs that everyone else had errors with too. looks like Dodgy WUs.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Same errors here on two units
|
||
|
wchoff
Cruncher Joined: Nov 17, 2004 Post Count: 35 Status: Offline Project Badges: |
One more with this problem. Last nights units ran fine. This morning is a bloodbath.
|
||
|
Eurwin
Cruncher Joined: Apr 28, 2007 Post Count: 17 Status: Offline Project Badges: |
Greetings,
----------------------------------------Here is the same issue. Everything went perfectly until cfsw_14352_14352593 (25-8-12 13:31:58) came up. The workunit starts and almost at the same time, it ends in a error. "upload file absent". The last file cfsw_14406_14406770 (25-8-12 17:11:06) I tried, turned also in a error. I have 6 pages off the same error and switch off CFSW and turned SN2S back on. I run Win7 premium x64 with BOINC 7.0.28 x64 on a AMD X6 1090 with stock clock. [Edit 1 times, last edit by Eurwin at Aug 25, 2012 5:35:55 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Count me in. Thought it was my machine's fault at first (crunching CFSW exclusively) and have the project got reset, and it still kept producing errors.
----------------------------------------Will let it crunch something else for now. Edit: My machine is running Windows 7 x64, which produced short result logs like the one posted above. A wingman of one of the WUs I've got has a longer Result Log, which seems to be from a Mac (although it is also an error): Result Log Result Name: cfsw_ 14357_ 14357622_ 1-- <core_client_version>7.0.28</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> [01:30:30] INFO:Beginning simulation: 2010:240:668324524 SIGSEGV: segmentation violation Crashed executable name: wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin built using BOINC library version 7.1.0 Machine type Intel 80486 (64-bit executable) System version: Macintosh OS 10.7.4 build 11E53 Sun Aug 26 01:30:30 2012 atos cannot load symbols for the file wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin for architecture x86_64. 0 wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin 0x00000001000d572d SIGPIPE: write on a pipe with no reader 1 wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin 0x00000001000cad31 SIGPIPE: write on a pipe with no reader 2 libsystem_c.dylib 0x00007fff91831cfa SIGPIPE: write on a pipe with no reader 3 ??? 0x000000095fbff610 SIGPIPE: write on a pipe with no reader 4 libstdc++.6.dylib 0x00007fff8a08fe57 SIGPIPE: write on a pipe with no reader 5 libsystem_c.dylib 0x00007fff917d07c8 SIGPIPE: write on a pipe with no reader 6 libsystem_c.dylib 0x00007fff917d0652 SIGPIPE: write on a pipe with no reader 7 wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin 0x000000010000124b Thread 0 crashed with X86 Thread State (64-bit): rax: 0x0100001f rbx: 0x00000000 rcx: 0x7fff5fbfd608 rdx: 0x00000028 rdi: 0x7fff5fbfd668 rsi: 0x00000003 rbp: 0x7fff5fbfd650 rsp: 0x7fff5fbfd608 r8: 0x00000b07 r9: 0x00000000 r10: 0x000003b0 r11: 0xffffff80002da8d0 r12: 0x7fff5fbfd668 r13: 0x00000003 r14: 0x00000b07 r15: 0x000003b0 rip: 0x7fff93cca67a rfl: 0x00000206 Binary Images Description: 0x100000000 - 0x100125fff /Library/Application Support/BOINC Data/slots/11/../../projects/www.worldcommunitygrid.org/wcgrid_cfsw_baygame_6.12_x86_64-apple-darwin 0x7fff88014000 - 0x7fff88015fff /usr/lib/system/libunc.dylib 0x7fff88db2000 - 0x7fff88dbcfff /usr/lib/system/liblaunch.dylib 0x7fff89033000 - 0x7fff89075fff /usr/lib/system/libcommonCrypto.dylib 0x7fff89076000 - 0x7fff89077fff /usr/lib/system/libdnsinfo.dylib 0x7fff89096000 - 0x7fff89096fff /usr/lib/system/libkeymgr.dylib 0x7fff8a089000 - 0x7fff8a0fcfff /usr/lib/libstdc++.6.dylib 0x7fff8c1c4000 - 0x7fff8c1d2fff /usr/lib/system/libdispatch.dylib 0x7fff8c76b000 - 0x7fff8c76cfff /usr/lib/system/libsystem_sandbox.dylib 0x7fff8caaf000 - 0x7fff8cab6fff /usr/lib/system/libcopyfile.dylib 0x7fff8cbb8000 - 0x7fff8cbbdfff /usr/lib/system/libcompiler_rt.dylib 0x7fff8d0e4000 - 0x7fff8d0e5fff /usr/lib/system/libremovefile.dylib 0x7fff8ecb9000 - 0x7fff8ecc2fff /usr/lib/system/libsystem_notify.dylib 0x7fff8f268000 - 0x7fff8f285fff /usr/lib/system/libxpc.dylib 0x7fff8f6cc000 - 0x7fff8f6d1fff /usr/lib/system/libsystem_network.dylib 0x7fff8fd37000 - 0x7fff8fd64fff /usr/lib/libSystem.B.dylib 0x7fff8fd65000 - 0x7fff8fd6dfff /usr/lib/system/libsystem_dnssd.dylib 0x7fff9177c000 - 0x7fff91782fff /usr/lib/system/libmacho.dylib 0x7fff91783000 - 0x7fff9178efff /usr/lib/libc++abi.dylib 0x7fff9178f000 - 0x7fff9186cfff /usr/lib/system/libsystem_c.dylib 0x7fff918a8000 - 0x7fff918aafff /usr/lib/system/libquarantine.dylib 0x7fff93ca7000 - 0x7fff93cabfff /usr/lib/system/libdyld.dylib 0x7fff93cb5000 - 0x7fff93cd5fff /usr/lib/system/libsystem_kernel.dylib 0x7fff93cd6000 - 0x7fff93cdcfff /usr/lib/system/libunwind.dylib 0x7fff93d01000 - 0x7fff93d05fff /usr/lib/system/libmathCommon.A.dylib 0x7fff94ac5000 - 0x7fff94ac6fff /usr/lib/system/libsystem_blocks.dylib 0x7fff94ac7000 - 0x7fff94b02fff /usr/lib/system/libsystem_info.dylib 0x7fff94df8000 - 0x7fff94dfdfff /usr/lib/system/libcache.dylib Exiting... </stderr_txt> ]]> [Edit 1 times, last edit by Former Member at Aug 25, 2012 5:45:23 PM] |
||
|
deltavee
Ace Cruncher Texas Hill Country Joined: Nov 17, 2004 Post Count: 4846 Status: Offline Project Badges: |
All cfsw WUs that I have received after 13:00UTC are erroring.
---------------------------------------- |
||
|
|