Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 21
|
![]() |
Author |
|
Dayle Diamond
Senior Cruncher Joined: Jan 31, 2013 Post Count: 452 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I'm not getting any errors, but I'm getting a whole lot of _2 and _3 resends.
Looking into the logs, looks like errors are abundant, just not with me? Here's some samples, but there's others where the log is just...blank. Result Name: MCM1_ 0011041_ 2002_ 0-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> CreateProcess() failed - Access is denied. (0x5) </message> ]]> Result Name: MCM1_ 0010988_ 3475_ 0-- <core_client_version>7.4.27</core_client_version> <![CDATA[ <message> couldn't start app: CreateProcess() failed - Kahva ei kelpaa. (0x6) </message> ]]> |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Those look like coming off machines with maybe AV or other security configured too tight or incorrectly. Nothing we can do about this. A machine generating too many outright errors is eventually limited to 1 task per day. Think in the end there will be a notice to those machines generating constant fault, if not, it would be nice if the server polled these clients with a notice to the system tray pop-up [BOINC Manager function] in hopes the member 'notices'
----------------------------------------![]() [Edit 2 times, last edit by Former Member at Feb 2, 2015 8:52:22 AM] |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1673 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Just for information, I have more than 50 errored WUs on one host today.
----------------------------------------I loose about 10 CPU hours, the rest came to error by starting the computation. Cheers, Yves |
||
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I don't have any errors and I'm running mostly MCM now - returning 200-400 results per day. I suggest you check that machine.
----------------------------------------![]() |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1673 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi Mumak,
----------------------------------------without any intervention on the concerned machine, MCM is computed again perfectly. I think that it was some troubles with a couple of batches. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
"I think that it was some troubles with a couple of batches."
A batch is 10,000 work units and you are the only one getting serial fail, then serial valid. Give us some detail and maybe another member confirm. As my devices are on MCM diet and one or the other reports one / fetches one every 15-30 minutes, 100% valid, more probably what Mumak observed, a local problem... the ghost in the machine. |
||
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1673 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hi SekeRob,
----------------------------------------it looks like that some of the wingmen experienced errors as well. I am currently business traveling without direct access to the concerned system but the system seems to be OK currently. Cheers, Yves |
||
|
dango
Senior Cruncher Joined: Jul 27, 2009 Post Count: 307 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Hello,
mine situation: 23-Feb-2015 09:27:20 [World Community Grid] [sched_op] Reason: Unrecoverable error for task MCM1_0011902_3954_1 (process exited with code 22 (0x16, -234)) 23-Feb-2015 09:27:20 [World Community Grid] Computation for task MCM1_0011902_3954_1 finished 23-Feb-2015 09:27:20 [World Community Grid] Output file MCM1_0011902_3954_1_0 for task MCM1_0011902_3954_1 absent 23-Feb-2015 09:27:20 [World Community Grid] Starting task MCM1_0011902_9494_0 using mcm1 version 735 in slot 0 means from same batch 11902 some workunit (3954) failed to start (exit code 22) and another (9494) runs OK any idea? thanks |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Don't know what code 22 means... would have to search with Google. What platform was this on?
|
||
|
dango
Senior Cruncher Joined: Jul 27, 2009 Post Count: 307 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
# cat /etc/debian_version
7.7 # dpkg -l|grep -i boinc ii boinc-client 7.0.27+dfsg-5 amd64 core client for the BOINC distributed computing infrastructure ii boinc-manager 7.0.27+dfsg-5 amd64 GUI to control and monitor the BOINC core client |
||
|
|
![]() |