Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Computing for Sustainable Water Forum Thread: Computing for Sustainable Water Problems Thread |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 254
|
Author |
|
Dataman
Ace Cruncher Joined: Nov 16, 2004 Post Count: 4865 Status: Offline Project Badges: |
Having problems running CFSW? Please post your details here.
---------------------------------------- |
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2977 Status: Offline Project Badges: |
Question for WCG techs...
----------------------------------------I know that, in the initial stages, everyone will need verifying for this new project (and hence, everyone will automatically be in a quorum 2), but once machines start being classed as reliable, will this project remain as Quorum 2, or revert to a ZR project? (p.s., I don't know if the issue was at my end or not, but I can now post via IE8 - thus I've deleted that other thread I'd started). |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Now this is really getting in at the beginning...
----------------------------------------Project Name: Computing for Sustainable Water Created: 04/17/2012 14:22:39 Name: cfsw_0000_00000000_0 Minimum Quorum: 2 Replication: 2 [Edit 1 times, last edit by Former Member at Apr 17, 2012 5:44:07 PM] |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: |
Question for WCG techs... I know that, in the initial stages, everyone will need verifying for this new project (and hence, everyone will automatically be in a quorum 2), but once machines start being classed as reliable, will this project remain as Quorum 2, or revert to a ZR project? (p.s., I don't know if the issue was at my end or not, but I can now post via IE8 - thus I've deleted that other thread I'd started). Yes, once hosts get validated, work units will start going out with Zero Redundancy. Similar to how other ZR projects run. Thanks, -Uplinger |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks uplinger. Great news.
|
||
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 2977 Status: Offline Project Badges: |
Yes thanks Uplinger for verifying that - it'll certainly make the project run quicker
---------------------------------------- |
||
|
pcwr
Ace Cruncher England Joined: Sep 17, 2005 Post Count: 10903 Status: Offline Project Badges: |
Is there any special requirements for this project?
----------------------------------------Opted in, but not getting any WUs. Patrick |
||
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges: |
pcwr,
As we usually do with projects when they first start, we don't set them at equal priority and that causes other projects get priority in the scheduler. Thanks, -Uplinger |
||
|
PMH_UK
Veteran Cruncher UK Joined: Apr 26, 2007 Post Count: 760 Status: Recently Active Project Badges: |
Edit2: Some other tasks failed some time later, details at end of post, rebooted.
----------------------------------------Edit: The 1 running failed as well, details below at end of post. I have 1 running for nearly an hour but 7 failed early on a quad core, others OK so far on Linux and Windows. It ran Betas OK. World Community Grid 6.05 Beta Test BETA_cfsw_0002_00002155_1 06:35:12 (06:10:51) 13/04/2012 05:12:33 13/04/2012 05:16:33 Reported: OK S710-U World Community Grid 6.05 Beta Test BETA_cfsw_0002_00002879_1 06:27:46 (06:02:52) 13/04/2012 05:03:44 13/04/2012 05:05:48 Reported: OK * S710-U World Community Grid 6.05 Beta Test BETA_cfsw_0002_00002763_0 06:26:55 (06:02:20) 13/04/2012 05:03:44 13/04/2012 05:05:48 Reported: OK * S710-U World Community Grid 6.05 Beta Test BETA_cfsw_0002_00002418_0 06:18:07 (05:54:51) 13/04/2012 04:55:24 13/04/2012 04:57:28 Reported: OK * S710-U S710-U log: 1 12/04/2012 23:19:15 Starting BOINC client version 6.10.59 for i686-pc-linux-gnu 15 12/04/2012 23:19:15 Processor: 4 GenuineIntel Intel(R) Core(TM) i5 CPU M 540 @ 2.53GHz [Family 6 Model 37 Stepping 5] 16 12/04/2012 23:19:15 Processor: 3.00 MB cache 17 12/04/2012 23:19:15 Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx rdtscp lm constant_tsc arch_perfmon pebs bts xtopology nonstop_tsc aperfmperf pni pclmulqdq dtes64 monitor ds_cp 18 12/04/2012 23:19:15 OS: Linux: 2.6.38-14-generic-pae 19 12/04/2012 23:19:15 Memory: 3.73 GB physical, 3.80 GB virtual Result Name Device Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit cfsw_ 0000_ 00000110_ 0-- S710-U Error 17/04/12 15:35:57 17/04/12 18:54:45 0.04 0.7 / 0.0 cfsw_ 0000_ 00000286_ 0-- S710-U Error 17/04/12 15:35:57 17/04/12 18:54:45 0.00 66.4 / 0.0 cfsw_ 0000_ 00000547_ 3-- S710-U Error 17/04/12 15:35:56 17/04/12 18:35:19 0.04 0.7 / 0.0 cfsw_ 0000_ 00000511_ 0-- S710-U In Progress 17/04/12 15:35:56 27/04/12 15:35:56 0.00 0.0 / 0.0 cfsw_ 0000_ 00000674_ 3-- S710-U Error 17/04/12 15:35:55 17/04/12 18:31:09 0.04 0.6 / 0.0 cfsw_ 0000_ 00000946_ 1-- S710-U Error 17/04/12 15:35:55 17/04/12 18:25:31 0.06 0.9 / 0.0 cfsw_ 0000_ 00000321_ 3-- S710-U Error 17/04/12 15:35:54 17/04/12 18:45:38 0.00 0.0 / 0.0 cfsw_ 0000_ 00000298_ 3-- S710-U Error 17/04/12 15:35:54 17/04/12 18:47:57 0.04 0.6 / 0.0 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit cfsw_ 0000_ 00000547_ 4-- - In Progress 17/04/12 19:06:39 27/04/12 19:06:39 0.00 0.0 / 0.0 cfsw_ 0000_ 00000547_ 3-- 605 Error 17/04/12 15:35:56 17/04/12 18:35:19 0.04 0.7 / 0.0 cfsw_ 0000_ 00000547_ 2-- - In Progress 17/04/12 15:34:05 27/04/12 15:34:05 0.00 0.0 / 0.0 cfsw_ 0000_ 00000547_ 1-- 605 Error 17/04/12 15:27:42 17/04/12 15:32:53 0.00 66.4 / 0.0 cfsw_ 0000_ 00000547_ 0-- 605 Error 17/04/12 15:25:30 17/04/12 15:25:48 0.00 66.4 / 0.0 Result Name: cfsw_ 0000_ 00000547_ 3-- <core_client_version>6.10.59</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> [19:32:07] INFO:Beginning simulation: 1990 240 702302789 *** glibc detected *** free(): invalid next size (fast): 0x176b8340 *** SIGABRT: abort called Stack trace (16 frames): [0x8144760] [0x81c3dd4] [0xb77ba400] [0x81cc4d4] [0x81e1c9f] [0x81e6f31] [0x81e72eb] [0x81b0fa1] [0x806c260] [0x806bc52] [0x804d663] [0x807b571] [0x8052d29] [0x8067deb] [0x81c54b6] [0x8048131] Exiting... </stderr_txt> ]]> Result Name: cfsw_ 0000_ 00000547_ 1-- <core_client_version>6.10.17</core_client_version> <![CDATA[ <message> too many exit(0)s </message> ]]> cfsw_ 0000_ 00000674, cfsw_0000_00000321 and cfsw_0000_00000298 are similar to above, with others also showing too many exit(0)s or similar error. Another PC of mine (GX260) is wingman on other 3 errored units - it got 9 work units for it's single core and will not run these for a while. The 1 that ran for a while failed with: Result Name: cfsw_ 0000_ 00000511_ 0-- <core_client_version>6.10.59</core_client_version> <![CDATA[ <message> process exited with code 193 (0xc1, -63) </message> <stderr_txt> [19:37:39] INFO:Beginning simulation: 1990 240 42976125 [19:48:51] INFO: Finished tick number 4 [19:58:00] INFO: Finished tick number 9 [20:05:19] INFO: Finished tick number 14 [20:14:43] INFO: Finished tick number 19 [20:21:48] INFO: Finished tick number 24 [20:31:10] INFO: Finished tick number 29 [20:39:17] INFO: Finished tick number 34 [20:47:50] INFO: Finished tick number 39 SIGSEGV: segmentation violation Stack trace (19 frames): [0x8144760] [0x81c3dd4] [0xb78fe400] [0x80a24b2] [0x80b0018] [0x80e9c1e] [0x80eae41] [0x80eb015] [0x8096039] [0x804cd13] [0x8079486] [0x80501c0] [0x807b4e2] [0x806df03] [0x806e763] [0x8052ecc] [0x8067deb] [0x81c54b6] [0x8048131] Exiting... </stderr_txt> ]]> GFAM_ x3uja_ a_ PfPMT_ 0016825_ 0075_ 0-- S710-U Error 16/04/12 18:29:31 17/04/12 20:32:20 7.37 111.5 / 0.0 GFAM_ x3uja_ a_ PfPMT_ 0016825_ 0140_ 1-- S710-U Error 16/04/12 18:29:31 17/04/12 20:16:32 7.87 122.6 / 0.0 GFAM_ x3uja_ a_ PfPMT_ 0016825_ 0006_ 0-- S710-U Error 16/04/12 18:29:31 17/04/12 20:47:28 0.69 11.6 / 0.0 GFAM_ x3uja_ a_ PfPMT_ 0016825_ 0073_ 1-- S710-U Error 16/04/12 18:29:31 17/04/12 20:10:14 4.44 68.2 / 0.0 Result Name: GFAM_ x3uja_ a_ PfPMT_ 0016825_ 0073_ 1-- <core_client_version>6.10.59</core_client_version> <![CDATA[ <message> process exited with code 195 (0xc3, -61) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [15:51:21] Number of tasks = 66 [15:51:21] Starting task 0,CPU time is 0.000000. [15:51:21] ./ZINC05034555.pdbqt size = 32 5 ../../projects/www.worldcommunitygrid.org/gfam.x3uja_a_PfPMT.pdbqt size = 2603 0 [16:04:36] Finished task #0 cpu time used 738.154131 <snip> [20:28:44] Starting task 30,CPU time is 15169.232007. [20:28:44] ./ZINC05061014.pdbqt size = 28 4 ../../projects/www.worldcommunitygrid.org/gfam.x3uja_a_PfPMT.pdbqt size = 2603 0 [20:36:32] Finished task #30 cpu time used 449.204073 [20:36:32] Starting task 31,CPU time is 15618.436080. [20:36:32] ./ZINC05061018.pdbqt size = 25 5 ../../projects/www.worldcommunitygrid.org/gfam.x3uja_a_PfPMT.pdbqt size = 2603 0 *** glibc detected *** double free or corruption (out): 0x0acd1960 *** ERROR: VINA was killed by signal 6. Retrying task. [20:42:27] Starting task 31,CPU time is 15618.436080. [20:42:27] ./ZINC05061018.pdbqt size = 25 5 ../../projects/www.worldcommunitygrid.org/gfam.x3uja_a_PfPMT.pdbqt size = 2603 0 Unable to update graphics data. ERROR: VINA was killed by signal 11. 20:49:18 (19938): called boinc_finish </stderr_txt> ]]> The other 3 have same error but at different times. Paul.
Paul.
----------------------------------------[Edit 2 times, last edit by PMH_UK at Apr 17, 2012 9:28:54 PM] |
||
|
petehardy
Senior Cruncher USA Joined: May 4, 2007 Post Count: 318 Status: Offline Project Badges: |
Is there any special requirements for this project? Opted in, but not getting any WUs. Patrick Badge Hunters uncheck everything except the project they want! "Patience is a virtue", I can't wait to learn it! |
||
|
|