| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 10
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
When I (and others on my team) click on the links in the Result Status screen (https://secure.worldcommunitygrid.org/ms/viewBoincResults.do), under "Status" (i.e. "Pending Validation", "Valid"), we get an error like this:
----------------------------------------
Is this a known issue? [Edit 2 times, last edit by Former Member at Jun 15, 2007 11:38:34 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
That's a normal warning message, you can ignore it.
The error is in the next line: No heartbeat from core client. This means there is a problem with RPC communication between the core client and the science application. Since the client is no longer controlling it, the science application kills itself. If this happens rarely, it isn't a problem. The core client will restart the science, and pick up from the last checkpoint. You will see a message: Result (result name) exited with zero status but no 'finished' file If this happens repeatedly you may need to reset the project. If it happens frequently, then it is a major problem. Often it is caused by a software conflict, and we will need to do some troubleshooting. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
For peace of mind, the fact that the work units get to be listed as "Pending Validation" means the servers, when auditing the job, found no immediate client side error. It's a question of patiently waiting if the job goes with quorum complete into 'Invalid' or 'Valid'. Latter ideal.
----------------------------------------The states a work unit have are described here: http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=6105
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Maybe I'm not being clear. That error appears for every Valid and Pending result on the website's Result Status page. My results look fine, it's just this one link that looks funky. And there is nothing that says "No heartbeat from core client. "
----------------------------------------[Edit 1 times, last edit by Former Member at Jun 14, 2007 1:46:26 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
You posted it yourself:
Result Log You can and should ignore the VersionInfo warning. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Here's a more complete piece of the log from my desktop client
----------------------------------------<core_client_version>5.8.16</core_client_version> <![CDATA[ <stderr_txt> World Community Grid AutoDock (projects/www.worldcommunitygrid.org/wcg_faah_autodock_5.28_windows_intelx86) version Failed to get VersionInfo size: 1812 Failed to get VersionInfo size: 1812 INFO: projects/www.worldcommunitygrid.org/wcg_faah_autodock_5.28_windows_intelx86 Start AutoGrid... INFO:[23:50:07] Start AutoGrid... That is just the initial set up stage and after there should be pages of process progress entries ending with the </stderr_txt>. If there is not with that 'exited', it's hard to believe such a work unit managed to get in the Result Status page without an 'Error'. If there is reams of records and you just snipped, then your client recovered. Revert to Didactylos initial post as cause finding is needed on frequent occurrence.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
You posted it yourself: Oops! I see it now. :) |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Can we take it that you don't have that problem often enough to notice, and you do, in fact, have absolutely nothing to worry about?
If so: that's great. Happy crunching. And if you don't mind, please will you edit your original post and insert [COMPLETED] after the title? Thanks. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Can we take it that you don't have that problem often enough to notice, and you do, in fact, have absolutely nothing to worry about? If so: that's great. Happy crunching. And if you don't mind, please will you edit your original post and insert [COMPLETED] after the title? Thanks. I have that problem with every single result. (the versioninfo one) I agree there's nothing to worry about, but I'm not sure that error should show up on the results screen. I see it is just showing the stderr log now, but it says "Result Log" which causes some of my confusion. Anyway, since others don't have a problem with it, I guess it's not a big deal! I'll mark it as complete. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
For information only:
The stderr section isn't really intended for members to use. For successful results, it is pretty much ignored altogether. It is no more and no less than what the science application logs to stderr while it is running. Some science apps log more, some log less. We had one beta application that logged insane amounts of trivial information! So, what good is it? When something goes wrong, we (or the techs) can examine the trace of exactly what the science application did, where it crashed and how it crashed. It's exceedingly useful for giving a headstart in troubleshooting or debugging. It is marked "Result Log" because it is the metainformation that gets returned with the results files. As you observe, it's nearly all stderr log. And a final observation: if you are trying to troubleshoot a failed unit that hasn't uploaded, you can find the stderr.txt file in the slot directory. |
||
|
|
|