| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 5
|
|
| Author |
|
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges:
|
I'm getting one cruncher that is turning a lot of inconclusives and invalids. The WUs start, get to 65-70% and restart, somethimes twice. Here is what the log says about the restart.
----------------------------------------invalid Commandline = projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.41_windows_x86_64 -screen none -in in.wcg.acc -var wcgsteps1 10000 -var wcgsteps2 10000 -var loop 0 -var restart 0 -var rinterval 100 -var ifile in.wcg.acc -var wcgseed 30696201 inconclusive Commandline = projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.41_windows_x86_64 -screen none -in in.wcg.acc -var wcgsteps1 10000 -var wcgsteps2 6401 -var loop 2 -var restart 1 -var rinterval 100 -var ifile in.wcg.acc -var wcgseed 30733777 Both errors look the same except for the number at the end. What's going on?
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
See no difference of relevance with a valid result:
----------------------------------------<stderr_txt> Commandline = ../../projects/www.worldcommunitygrid.org/wcg_c4cw_lmps_6.41_x86_64-pc-linux-gnu -screen none -in in.wcg.acc -var wcgsteps1 10000 -var wcgsteps2 10000 -var loop 0 -var restart 0 -var rinterval 100 -var ifile in.wcg.acc -var wcgseed 24952357 Are those that pass through the Inconclusive state ever get into a Valid after the wingman has returned? Probably you'd want to set some additional log flags in the client cc_config.xml http://boinc.berkeley.edu/wiki/Cc_config.xml so it will print out more around the times when the results restart.. [review stdoutdae.txt for anomalies during those restarts]. Those restarts can create enough of a small variation not to render them valid... tight tolerances in order to be able to run zero redundancy. --//-- edit: flags to set: <task_debug> <mem_usage_debug> <cpu_sched> <app_msg_receive> <app_msg_send> It's not the first time reported that restarted tasks for CFCW don't check and go invalid, so we like to find why/when they restart. Maybe the techs can inject if I missed any relevant flags to further a debug. [Edit 1 times, last edit by Former Member at Jun 28, 2011 7:25:00 PM] |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7849 Status: Offline Project Badges:
|
If it is only specific to one machine, I would look at a hardware issue. I have a couple of suggestions. First, shut down the machine fully and do a cold start. this will make sure all memory and registers are reset to zero. Second, I would look at heating issues - dust in the heat sinks or defective fans/inadequate air flow. I only mention these because I have had these items happen on long running systems. Good luck.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
nanoprobe
Master Cruncher Classified Joined: Aug 29, 2008 Post Count: 2998 Status: Offline Project Badges:
|
Are those that pass through the Inconclusive state ever get into a Valid after the wingman has returned? What's the best way to determine that?
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.
![]() ![]() |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7849 Status: Offline Project Badges:
|
Are those that pass through the Inconclusive state ever get into a Valid after the wingman has returned? What's the best way to determine that? Check under result status. Most of the inconclusives will turn valid eventually. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
|