| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 4
|
|
| Author |
|
|
Seoulpowergrid
Veteran Cruncher Joined: Apr 12, 2013 Post Count: 823 Status: Offline Project Badges:
|
Edit: The issue seems to have resolved itself. Thanks for input and suggestions.
----------------------------------------I have a headless linux box I've run for a year or two. Everything has run well, plenty of ram, lots of threads, even ran tons of CEP2 WUs through it. After a bunch of MIP WUs errored out the system stopped returning WCG WUs and stopped getting new WUs. "top" command shows me boinc is running, but no WUs are active and all ram is available. I restarted the machine several times but still nothing is running. I am inside the box via SSH client and doing command boinccmd --get_state it shows all the tasks are uninitialized: name: MIP1_00004808_0689_0 WU name: MIP1_00004808_0689 project URL: http://www.worldcommunitygrid.org/ report deadline: Fri Sep 29 17:33:20 2017 ready to report: yes got server ack: no final CPU time: 2.052000 state: compute error scheduler state: uninitialized exit_status: 1 signal: 0 suspended via GUI: no active_task_state: UNINITIALIZED app version num: 0 checkpoint CPU time: 0.000000 current CPU time: 0.000000 fraction done: 0.000000 swap size: 0 MB working set size: 0 MB estimated CPU time remaining: 0.000000 I am trying to do a reset or resume but seems I am unable to give the proper command. I hope I am making a noob mistake as I've tried boinccmd -- project http://www.worldcommunitygrid.org/ resume boinccmd -- project http://www.worldcommunitygrid.org/ op resume boinccmd -- project http://www.worldcommunitygrid.org/ op = resume and also tried those minus the "http://" but what comes up is a list of commands possible to use in boinc. Any ideas what I am doing wrong? ![]() [Edit 1 times, last edit by Seoulpowergrid at Sep 22, 2017 8:37:59 AM] |
||
|
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
http://boinc.berkeley.edu/wiki/Boinccmd_tool
----------------------------------------boinccmd - - help If you're stuck without the manual Edit, small as commands in Linux are case sensitive (never delved into the intelligence of that) [Edit 1 times, last edit by SekeRob* at Sep 22, 2017 9:31:45 AM] |
||
|
|
SekeRob
Master Cruncher Joined: Jan 7, 2013 Post Count: 2741 Status: Offline |
As with nanoprobe, you're best off detaching and readding WCG to clean any mess, plus you need update instead of resume as latter does not force a communication with project.
|
||
|
|
Seoulpowergrid
Veteran Cruncher Joined: Apr 12, 2013 Post Count: 823 Status: Offline Project Badges:
|
I was using that link but it looks like I am doing something wrong still. I was hoping to do a reset as that means I could keep the HST files I had in there, but if I can't figure it out in the next day or so I'll need to detach or remove the program and bring in a fresh install.
----------------------------------------*5 minutes later and more searching/testing* Well I gave it the update, resume, and set_run_mode always, and get_simple_gui_info commands again and suddenly I am downloading a ton of WUs. All the other 100+ WU I had, which boinc said were uninitialized, apparently ran and were sent back to WCG at the same moment. Most of the MIP errored out but the others apparently ran and most already are marked valid. I'm scratching my head but valid is valid and I am running all threads now so *shrugs* looks like the issue is resolved. Thanks for your input and I'm glad the rig is up and running. ![]() |
||
|
|
|