Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 3
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1305 times and has 2 replies Next Thread
wplachy
Senior Cruncher
Joined: Sep 4, 2007
Post Count: 423
Status: Offline
Reply to this Post  Reply with Quote 
"aborted by project - no longer usable" WUs Have Detached Status

I need some help understanding what is happening and if it indicates a problem with the PCs?

Twice in the last 5 days on two different PCs I have had a number of WUs terminate with a grid status of "Detached". The devices have not been detached from the project, have not been shutdown and the logs indicate processing has not been interrupted.

The PCs are running BOINC client 6.10.58 under Win7-64. Both run 24X7 and are set for 100% CPU and 100% network available. Both are reliable devices, and after the mass termination pick up 1 or 2 repair tasks.

From what I can find the message "aborted by project - no longer usable" indicates the server terminated the WU's. If so, why are they showing "Detached"?

The terminations have occurred in a group and in both cases the log message preceeding the terminations has been:
"Message from server: Resent lost result wu name "

Result Log (All are the same except WU Name...nothing shown except wu name)

Result Name: c4cw_ target05_ 147543057_ 0--

The WUs terminated in this last batch are:
c4cw_target05_147741608_0
c4cw_target05_147721659_0
c4cw_target05_147725350_0
c4cw_target05_147707254_0
c4cw_target05_147709821_0
c4cw_target05_147673629_0
c4cw_target05_147657183_0
c4cw_target05_147655200_0
c4cw_target05_147652302_0
c4cw_target05_147651151_0
c4cw_target05_147653216_0
c4cw_target05_147659968_0
c4cw_target05_147656171_0
c4cw_target05_147543057_0
c4cw_target05_147495463_0
c4cw_target05_147493259_0
c4cw_target05_147430025_0
c4cw_target05_147436136_0
c4cw_target05_147439228_0
c4cw_target05_147439986_0
c4cw_target05_147381400_0

Log from last occurance (log time is UTC -6 Hrs)
World Community Grid	03/01/12 05:41:01 PM	Sending scheduler request: To fetch work.	
World Community Grid 03/01/12 05:41:01 PM Requesting new tasks for CPU
World Community Grid 03/01/12 05:41:01 PM [sched_op_debug] CPU work request: 20.55 seconds; 0.00 CPUs
World Community Grid 03/01/12 05:41:01 PM [sched_op_debug] NVIDIA GPU work request: 0.00 seconds; 0.00 GPUs
World Community Grid 03/01/12 05:41:07 PM Scheduler request completed: got 1 new tasks
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Server version 601
World Community Grid 03/01/12 05:41:07 PM Message from server: Resent lost result c4cw_target05_147767977_0
World Community Grid 03/01/12 05:41:07 PM Project requested delay of 11 seconds
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] estimated total CPU job duration: 5637 seconds
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] estimated total NVIDIA GPU job duration: 0 seconds
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 1 min 0 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: Unrecoverable error for result c4cw_target05_147381400_0 (aborted by project - no longer usable)
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 1 min 38 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: Unrecoverable error for result c4cw_target05_147430025_0 (aborted by project - no longer usable)
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 5 min 36 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: Unrecoverable error for result c4cw_target05_147493259_0 (aborted by project - no longer usable)
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 14 min 16 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: Unrecoverable error for result c4cw_target05_147495463_0 (aborted by project - no longer usable)
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 44 min 21 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: Unrecoverable error for result c4cw_target05_147543057_0 (aborted by project - no longer usable)
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Deferring communication for 11 sec
World Community Grid 03/01/12 05:41:07 PM [sched_op_debug] Reason: requested by project
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147381400_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147439986_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147439228_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147436136_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147430025_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147493259_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147495463_0 finished
World Community Grid 03/01/12 05:41:08 PM Computation for task c4cw_target05_147543057_0 finished
World Community Grid 03/01/12 05:41:09 PM Starting c4cw_target05_147767977_0
World Community Grid 03/01/12 05:41:09 PM Starting task c4cw_target05_147767977_0 using c4cw version 641
World Community Grid 03/01/12 05:41:22 PM [sched_op_debug] Starting scheduler request
World Community Grid 03/01/12 05:41:22 PM Sending scheduler request: To report completed tasks.
World Community Grid 03/01/12 05:41:22 PM Reporting 21 completed tasks, requesting new tasks for CPU and GPU
World Community Grid 03/01/12 05:41:22 PM [sched_op_debug] CPU work request: 63519.87 seconds; 7.00 CPUs
World Community Grid 03/01/12 05:41:22 PM [sched_op_debug] NVIDIA GPU work request: 8640.86 seconds; 1.00 GPUs
World Community Grid 03/01/12 05:41:28 PM Scheduler request completed: got 12 new tasks
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] Server version 601
World Community Grid 03/01/12 05:41:28 PM Project requested delay of 11 seconds
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] estimated total CPU job duration: 67639 seconds
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] estimated total NVIDIA GPU job duration: 0 seconds
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147381400_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147439986_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147439228_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147436136_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147430025_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147493259_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147495463_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147543057_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147659968_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147656171_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147653216_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147652302_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147651151_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147655200_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147657183_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147673629_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147709821_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147707254_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147725350_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147721659_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] handle_scheduler_reply(): got ack for result c4cw_target05_147741608_0
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] Deferring communication for 11 sec
World Community Grid 03/01/12 05:41:28 PM [sched_op_debug] Reason: requested by project

----------------------------------------
Bill P

[Mar 2, 2012 3:43:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: "aborted by project - no longer usable" WUs Have Detached Status

Virtual timemachine simulation: Wonder back in mind and check if an Account Management System [AMS] was recently attached to the clients e.g. BAM (BOINCStats Account Manager). If not configured (on the website of BAM), there is the tendency to detach from WCG, when WCG was already attached.

Back today, some members have reported that this is not the case for them. The client somehow has some administrative foul-up, that looses downloaded tasks, talks to the server, and this one tries to send any lost information, verification fails and the tasks get declared "detached" http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=6105, which to the client is executed as an abort. The "detached" status indication on the Result Status pages actually do not occur until the client connects again. Not sure how or why, maybe LAN networking issues. These somehow screw up the connecting of the core client with running tasks which then crash.

One thought: Any chance of IP address collisions on your LAN?

ttyl

--//--
----------------------------------------
[Edit 1 times, last edit by Former Member at Mar 2, 2012 9:27:14 AM]
[Mar 2, 2012 9:26:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
wplachy
Senior Cruncher
Joined: Sep 4, 2007
Post Count: 423
Status: Offline
Reply to this Post  Reply with Quote 
Re: "aborted by project - no longer usable" WUs Have Detached Status

Thank you for the response!

snip... Wonder back in mind and check if an Account Management System [AMS] was recently attached to the clients e.g. BAM (BOINCStats Account Manager).

I haven't ever used an AMS.
snip..One thought: Any chance of IP address collisions on your LAN?

All devices have a unique IP. I did recently upgrade to a Cisco gigabit router that got less than stellar reviews, perhaps that may be the problem. If it occurs again I'll switch back and see what develops.
Thanks again :-)
----------------------------------------
Bill P

[Mar 2, 2012 2:59:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread