| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 10
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello,
I just installed BOINC 5.8.16 (due to an old GLIBC version) on a CentOS 3.9. After registering the client, it started downloading severall workunits very quickly (8 CPUs on the system). However 4 failed within a couple of seconds after downloading, always with the same message: <core_client_version>5.8.16</core_client_version> 4 other work units are still in progress, at least according to the website, however the system is idle at >99%, so no real processing is going on. Even with "./boinc -return_results_immediately" I do not see anything going on. What could be going on here? |
||
|
|
JmBoullier
Former Community Advisor Normandy - France Joined: Jan 26, 2007 Post Count: 3716 Status: Offline Project Badges:
|
Welcome here danielfrank!
----------------------------------------Could you please post the message log of your BOINC client from the very first line down to the error messages (or down to the end if is no longer doing anything). Read you later. Jean. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I noticed that everything works fine on another system that's using OpenSolaris with a CentOS 5.4 zone and is working quite fine, so I have already rebuild my boinc zone and it's running fine now.
I don't know if it's the old glibc in CentOS 3.9, but this could be the problem. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Interesting, OpenSolaris with a CentOS zone. First reading someone got it to run on WCG, there having been only one previous mention here of http://www.opensolaris.com/get/index.jsp exactly one year ago, a few members running CentOS but never in connection.
----------------------------------------
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Opensolaris with a linux zone doesn't work as well as I thought, at least for WCG. Only Help Cure Muscular Dystrophy - Phase 2 seems to process fine.
All workunits of other projects (of the "available projects") fail in some way: Some fail within a couple of seconds, others fail after some time. I've already opted out of these projects to avoid causing unnecessary errors. If someone is interested, I can write a quick guide on how to get WCG running in a linux zone in opensolaris, but considering the moderate results, I'm not sure it would do WCG any good to make it too easy right now. If there's any interest in debugging the failures during processing of the workunits, I'm available. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
If HCMD2 runs, then it might be a available memory issue as this science is by far the smallest with a footprint of 7-9MB ram , 47-380 MB VM use on a windows system. The message log stored in the stdoutdae.txt file and the Result Status page > Status links of the errors may give more hints.
----------------------------------------Unless there's interest from the members don't think there's much to gain in a write up but thanks for the offer. Happy crunching... on HCMD2... every little bit helps.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
If HCMD2 runs, then it might be a available memory issue as this science is by far the smallest with a footprint of 7-9MB ram , 47-380 MB VM use on a windows system. The message log stored in the stdoutdae.txt file and the Result Status page > Status links of the errors may give more hints. There shouldn't be any memory issues, I have allocated the zone a total of 4 GB ram with 7 WUs processing in parallel, so every WU should have around 512 MB available. Also when I check the error output of some workunits it looks to me like there are differences during calculation. For example the WU CMD2_ 0396-PGTBA.clustersOccur-1YNS_ A.clustersOccur_ 13_ 80445_ 81058_ 80549_ 80676_ 0-- ran for 2.9 hours and gave the following result: <core_client_version>6.10.17</core_client_version> To me this looks like the WU finished perfectly fine and there's also no error in the BOINC client log. For HFCC_ s2_ 01780840_ s2_ 0000_ 0-- I would assume that some of the calculation just give different results than expected: INFO:[02:31:03] Start AutoGrid... This seems to happen on all of the HFCC WUs. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Thanks for following through in the discovery. The -131 has been reported a few times in past, here one thread http://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,19794
----------------------------------------Seems by the official message description the output file gets too big, maybe indeed because of those many messages logged we don't generally see for a good result: ERR_FILE_TOO_BIG -131 One of the output files is bigger than the maximum set by the project for upload. BOINC will not try to upload this file. Solution: Go to the project's forums and report this behavior. The first log is normal btw for the HCMD2 task.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for following through in the discovery. The -131 has been reported a few times in past [...] Solution: Go to the project's forums and report this behavior. Ok, anywhere else (besides here) I should report this? Still, the WU is listed with an error for my client and I can see two other results done by other people that are valid. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Sorry, misunderstood that these HCMD2 jobs were turning out valid. I'd more have expected for a normal ending task to generate an invalid state, but from what you say, they never pass through a Pending Validation state and only go bust when the quorum comparison is done.
----------------------------------------On techs/programmer investigating this... It's a judgment call: A very rare configuration / OS. The WCG staff will have to say whether they're in for a look... they have their hobby moments and Eurekas at times.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
|