| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 15
|
|
| Author |
|
|
[AF>Le_Pommier] Jerome_C2005
Cruncher Joined: Aug 17, 2006 Post Count: 29 Status: Offline Project Badges:
|
Hi
----------------------------------------I have 3 CEP2 failing on my Mac after one hour or more of calculation. But 2 other did finish OK, after 14 or 15 hours. I don't know how to look for result error log on WCG (I know how to do it on any other project) so I can show you. Thanks for your help. |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7846 Status: Offline Project Badges:
|
On the My Contribution page click on the Result Status bar on the right side. When on the Result Status page use the drop down under Project Name and select The Clean Energy Project - Phase 2 item. this will display your recent work units under this project. Click on a work unit in question. This wioll bring up another screen. In one column you should see the designation "in progress", "valid" or "error." (There are others but these are the ones you should be concerned with right now.) Click on the "error". This will give you another screen with the "result Log". This you can copy and paste into the forum, if you wish.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
[AF>Le_Pommier] Jerome_C2005
Cruncher Joined: Aug 17, 2006 Post Count: 29 Status: Offline Project Badges:
|
Ok thanks so first think I can see it's failing for all instances of the WU (I can see 5 on the status page for the WU) and the detail of error is :
----------------------------------------Nom du résultat: E235445_ 867_ S.288.C32H21N5O5.PZCILQHXRZGBOA-UHFFFAOYSA-N.2_ s1_ 14_ 4-- <core_client_version>7.6.12</core_client_version> <![CDATA[ <message> process exited with code 195 (0xc3, -61) </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [17:55:24] Number of jobs = 8 [17:55:24] Starting job 0,CPU time has been restored to 0.000000. [17:55:24] Starting new Job [17:55:25] Qink name = fldman [17:55:31] Qink name = gesman [17:55:33] Qink name = scfman No heartbeat from core client for 30 sec - exiting No heartbeat: Exiting INFO: No state to restore. Start from the beginning. [02:20:33] Number of jobs = 8 [02:20:33] Starting job 0,CPU time has been restored to 0.000000. [02:20:36] Starting new Job [02:20:41] Qink name = fldman [02:20:47] Qink name = gesman [02:20:49] Qink name = scfman [03:55:56] Qink name = anlman [03:55:56] Qink name = drvman Application exited with RC = 0x100 [03:57:59] Finished Job #0 called boinc_finish </stderr_txt> ]]> So it looks like I'm not the only one. |
||
|
|
Seoulpowergrid
Veteran Cruncher Joined: Apr 12, 2013 Post Count: 823 Status: Offline Project Badges:
|
I also have seen a high failure rate of CEP WUs in my Mac and saw the same WU failing for other machines as well. The high failure rate seems to be normal for the platform but i still run the project through the Mac.
----------------------------------------![]() |
||
|
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
Classic overload signal
No heartbeat from core client for 30 sec - exiting No heartbeat: Exiting Discussed sooooo many times on these forums, that one wonders... does the program need fixing or ... ;?) |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
What SekeRob is trying to say in his convoluted way is that you may want to rethink how many CEP2s you allow to run concurrently on your machine (definitely not all cores). I seem to remember an advice from one of those so many aforementioned threads saying it's best to allow only 50% of your cores or 50%+1 cores run the CEP2 project.
----------------------------------------Of course if the current set up was working for you previously and suddenly you are experiencing issues, then you may need to look into other potential causes such as new installations on your machine or if you are running something differently since you started noticing the failures. Either way, it's a good idea to run a few searches on the forum too. If you manage to figure out how that works, it will provide a lot of insight on the matter. I hope this helps. Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() |
||
|
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
In my 'convoluted way' telling you there's even a Start Here FAQ, the Forum Navigation & Search How To's", no different with Boolean operators and all as any other search engine, meaning if you can't use the search function here to half way it's optimum, you're not using Google/Bing either to it's best potential
![]() |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
if you can't use the search function here to half way it's optimum, you're not using Google/Bing either to it's best potential tongue Yes, guilty as charged, SekeRob, but before you lock me up and throw away the key, I want to ask you this: are you happy with the forum software (including the search functionality) as it is? I think not and I believe you've already spoken your mind on that. Yes, it is what it is and we (apparently) have to live with it, but that doesn't mean we have to like it or make it look like it's normal. So let's all admit that we are living in the 21st century where you don't have to even know what Boolean operators are (let alone use them) in order to make a simple query or search. And yes, I will stand behind what I said about the answer you've provided to Jerome's issue. So although I understand your frustration about seeing the same question being asked for the umptieth time, that doesn't justify the laconic reply. I know this may sound bitter and/or harsh, but I am trying to understand why the "riddle me this, riddle me that" answers when you can just ignore this type of questions and focus on the real issues where your knowledge can be put to good use. Let somebody else answer them or let the cruncher bang his/her head against the wall until they think about actually using the search functionality of this decrepit forum software and figure out how to make it work. Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() |
||
|
|
[AF>Le_Pommier] Jerome_C2005
Cruncher Joined: Aug 17, 2006 Post Count: 29 Status: Offline Project Badges:
|
What SekeRob is trying to say in his convoluted way is that you may want to rethink how many CEP2s you allow to run concurrently on your machine (definitely not all cores). I seem to remember an advice from one of those so many aforementioned threads saying it's best to allow only 50% of your cores or 50%+1 cores run the CEP2 project. Of course if the current set up was working for you previously and suddenly you are experiencing issues, then you may need to look into other potential causes such as new installations on your machine or if you are running something differently since you started noticing the failures. Either way, it's a good idea to run a few searches on the forum too. If you manage to figure out how that works, it will provide a lot of insight on the matter. I hope this helps. Cheers! How can I possibly limit the number of concurrent CEP2 running ?? I just have no idea, the only is to prevent completely CEP2 from the WCG setup but that would be a shame. Any clue ? Thanks |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7846 Status: Offline Project Badges:
|
There is a setting in the profiles called "Project Specific Settings" by which you can limit the number of workunits per host. Start with a low number and gradually raise it to see how many your system can handle. This does not directly address your question about how many can run concurrently, but does provide a mechanism for limiting the number your host has at any one time. This does necessitate running another project to keep your host busy. Maxing out your host with all CEP2 on multiple cpu systems tends to bring those systems to their knees due to the tremendous I/O during the startup (Job 1) of these CEP2 tasks.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
|