| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 118
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm not a tech nor a community Advisor, but I'm wondering what your failure rate is on CEP and on how many and what kind of machines you are crunching on.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
The techs can't look into it. You didn't name the task, and your original report gave no useful information. **************. Project Name: The Clean Energy Project Created: 1/29/09 Name: E000287_312A_001s2g012 Minimum Quorum: 2 Initial Replication: 2 **Edited for intolerance**tkh [Edit 2 times, last edit by TKH at Feb 5, 2009 1:31:49 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm not a tech nor a community Advisor, but I'm wondering what your failure rate is on CEP and on how many and what kind of machines you are crunching on. Hi, i have not had an errored out CEP yet, i did stop when the big problems were around but since the restart no problems again, i must just be lucky. Not the answer you asked for but there you go. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi, i have not had an errored out CEP yet, i did stop when the big problems were around but since the restart no problems again, i must just be lucky. Not the answer you asked for but there you go. Well, I was asking this more specific to David because I want to see if his failure rate deviates a lot from mine or not. Since the last restart of the project my failure rate is quite low (1 failure in the last days), but much better than it was on this project before. For CEP I use 4 machines with in total 10 CPU-cores in them. It could be that there is somethng in Davids configuration what causes a high failure rate. If so, it could be interesting to examine that config in order to find out what causes that high failure rate. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
They should. That work unit was going nowhere. It had been running 20.83 hours on a very fast computer when I aborted it because it was making no progress. David, it is possible to have a large WU on CEP. I've had a WU which took 45 hours to complete and was valid after all. It ran on a Quad-core which is also a fast guy, so it is possible. These large WU's are also valuable for the project, especially when they are large because they contain a lot of information. So please let them run. |
||
|
|
David_L6
Senior Cruncher USA Joined: Aug 24, 2006 Post Count: 296 Status: Offline Project Badges:
|
I'm not a tech nor a community Advisor, but I'm wondering what your failure rate is on CEP and on how many and what kind of machines you are crunching on. Failure rate isn't as bad as it was a few weeks ago but no other project produces errors on my computers. Possible exception is HPF2. I no longer run that project. 1) 3.4GHz P4 with HT (not overclocked), 1.5GB RAM, Windows XP Home ^^^ I seldom run this machine. 2) Q6700 (not overclocked), 3GB RAM, Vista Ultimate 32 bit ^^^ Rock solid. If I get errors on this one, something is wrong with the work unit. 3) QX6700 @ 3.6GHz (overclocked), 4GB RAM, XP Pro 32 bit 4) Q6700 @ 3.65GHz (overclocked), 8GB RAM, Vista Ultimate 64 bit ![]() |
||
|
|
David_L6
Senior Cruncher USA Joined: Aug 24, 2006 Post Count: 296 Status: Offline Project Badges:
|
The techs can't look into it. You didn't name the task, and your original report gave no useful information. ************ Project Name: The Clean Energy Project Created: 1/29/09 Name: E000287_312A_001s2g012 Minimum Quorum: 2 Initial Replication: 2 ********** ******** **Edited for inappropriate language and intolerance**tkh ![]() [Edit 1 times, last edit by TKH at Feb 5, 2009 1:33:05 PM] |
||
|
|
David_L6
Senior Cruncher USA Joined: Aug 24, 2006 Post Count: 296 Status: Offline Project Badges:
|
They should. That work unit was going nowhere. It had been running 20.83 hours on a very fast computer when I aborted it because it was making no progress. David, it is possible to have a large WU on CEP. I've had a WU which took 45 hours to complete and was valid after all. It ran on a Quad-core which is also a fast guy, so it is possible. These large WU's are also valuable for the project, especially when they are large because they contain a lot of information. So please let them run.I don't mind a large work unit, but I don't think that was the case with this one. I let that particular work unit run overnight after I suspected there was a problem. I also tried shutting it down and re-starting it and re-booting computer. It did the same thing some previous work units did (nothing) so I aborted that one. ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
******** ************ You are having legitimate problems with a project ***. You are running multi-core machines (known problem with CEP). You are overclocking (might be a problem). You are running CEP on Vista machines (known problem). By looking at your machines, I can tell you are going to have problems with CEP. ********. ********* **Edited for intolerance**tkh [Edit 2 times, last edit by TKH at Feb 6, 2009 1:34:53 PM] |
||
|
|
mclaver
Veteran Cruncher Joined: Dec 19, 2005 Post Count: 566 Status: Offline Project Badges:
|
It looks like there still is problems with CEP. We continue to loose processing time here. It looks like everyone that processed these work units ended in Errors, so they are probably bad WU, not my computers.
----------------------------------------IT LOOKS LIKE I AM GETTING ONE ERROR A DAY ON CEP Result Name Device Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000303_ 509A_ 001u1i00n_ 1-- fox-amd-9950 Error 2/2/09 12:10:18 2/5/09 11:47:00 4.14 72.3 / 0.0 E000285_ 528A_ 001s1a00i_ 5-- ASUS-i7-965 Error 2/4/09 12:23:25 2/4/09 20:51:11 8.25 203.6 / 0.0 E000284_ 830A_ 001s0t00y_ 4-- ASUS-i7-965 Error 2/2/09 22:17:38 2/3/09 04:00:56 5.62 137.1 / 0. ALL THREE ERRORS HAVE THE SAME RESULT LOG Result Log <core_client_version>6.4.5</core_client_version> <![CDATA[ <message> The system cannot write to the specified device. (0x1d) - exit code 29 (0x1d) </message> <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. </stderr_txt> ]]> EVERYONE WHO ATTEMPTED THIS WU, ENDED IN ERROR Workunit Status Project Name: The Clean Energy Project Created: 1/29/09 Name: E000285_528A_001s1a00i Minimum Quorum: 2 Initial Replication: 2 Result Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit E000285_ 528A_ 001s1a00i_ 6-- Error 2/4/09 20:53:01 2/5/09 06:08:11 6.31 151.6 / 0.0 E000285_ 528A_ 001s1a00i_ 5-- Error 2/4/09 12:23:25 2/4/09 20:51:11 8.25 203.6 / 0.0 E000285_ 528A_ 001s1a00i_ 4-- Too Late 2/1/09 16:48:06 2/2/09 21:03:46 10.62 166.4 / 0.0 E000285_ 528A_ 001s1a00i_ 3-- Error 2/1/09 09:49:09 2/4/09 12:20:47 10.54 162.6 / 0.0 E000285_ 528A_ 001s1a00i_ 2-- Error 2/1/09 04:55:09 2/1/09 16:47:08 7.62 148.7 / 0.0 E000285_ 528A_ 001s1a00i_ 0-- Error 1/31/09 11:27:47 2/1/09 09:47:20 11.68 136.7 / 0.0 E000285_ 528A_ 001s1a00i_ 1-- Error 1/31/09 11:27:28 2/1/09 04:54:20 9.62 142.8 / 0.0 ![]() ![]() ![]() |
||
|
|
|