| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 3593
|
|
| Author |
|
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 1293 Status: Offline Project Badges:
|
Another one to add to the list. This one is "too late"
0003468_148 https://www.worldcommunitygrid.org/contribution/workunit/737134676 |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
ARP are flowing again.
Mike ![]() |
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
ARP are flowing again. Well, that might be a bit of an overstatement. So far, it's a bit of a light drizzle rather than a flowing stream... Mike ![]() Ralf ![]() |
||
|
|
Mike.Gibson
Ace Cruncher England Joined: Aug 23, 2007 Post Count: 12594 Status: Offline Project Badges:
|
4+2 is a raging torrent compared with the Atacama that was last week.
Mike ![]() |
||
|
|
MJH333
Senior Cruncher England Joined: Apr 3, 2021 Post Count: 300 Status: Offline Project Badges:
|
4+2 is a raging torrent compared with the Atacama that was last week. Agreed!Mike ![]() I’ve hit my project limits for ARP1, which is the first time this has happened (or got anywhere near) in months. Let’s hope this is not just a one-off! Cheers, Mark |
||
|
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 2173 Status: Offline Project Badges:
|
4+2 is a raging torrent compared with the Atacama that was last week. Got a light summer rain in the evening, just enough to get a couple more pipes wet... Mike ![]() Ralf ![]() |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
A remarkable workunit, I found:
The task with suffix _1 below was aborted 4 days after being distributed, after reaching 2 checkpoints, reason: "no longer usable". The task with suffix _2 was distributed within the original deadlines (2025-07-09T16:50) of the tasks with suffixes _0 and _1. workunit 738775281 ARP1_0014328_148_0 Linux Fedora P. Validation 2025-07-03T16:50:27 2025-07-04T23:32:45 7.74/7.80 683.3/0.0Details: ---------------------------------------------------------------------------------------------------------------- ARP1_0014328_148_0 Linux Fedora P. Validation 2025-07-03T16:50:27 2025-07-04T23:32:45 7.74/7.80 683.3/0.0 Adri |
||
|
|
phytell
Cruncher Joined: Sep 8, 2014 Post Count: 39 Status: Offline |
If you're going to put a significant amount of effort into identifying stuck units, it may be worth noting that Boinc keeps a job log of all completed tasks, which could be processed to identify the highest generation a specific unit has reached. However, I suspect that even if we combined logs from everyone on this thread we'd still be missing a significant amount of the total units. It might be possible to identify those stuck in the first few generations, but anything more ambitious than that would be struggling against a lot of missing data.
|
||
|
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 1317 Status: Offline Project Badges:
|
Using the job log is a reasonable suggestion except for WUs like those recent Darwin ones that couldn't achieve validation. They'd be in the job log because the client thinks they are "success" tasks...
----------------------------------------The best way to track stuck units would have been for folks to always announce problem tasks; we have seen some of that both in the [distant] past and recently, but I suspect that a lot of tasks have either been "lost" by the system or failed without anyone mentioning them here (often because the user is in "fire and forget" mode and never checks!). We only know about the Darwin issue because Unixchick has been flagging up tasks with validation issues, and I rather suspect there are [many?] more we don't know about! As for whether the whole thing is an exercise in futility, that might depend on how many of the users who handle (say) 100+ ARP1 tasks a day might provide data. (My average of 15 or so returns a day would be hardly a drop in the ocean, but I would always report any WU that got stuck!) Cheers - Al. [Edit 1 times, last edit by alanb1951 at Jul 10, 2025 2:14:13 AM] |
||
|
|
catchercradle
Senior Cruncher England Joined: Jan 16, 2009 Post Count: 167 Status: Offline Project Badges:
|
Interesting,
With just one machine even though with VMs it sometimes pretends to be up to 4 machines, I struggle to get as many as 15 tasks a day. I check my results most days and have been involved with crunching ARP since it came out and have still to notice a stuck work unit be that native Linux, Linux in a VM or Windows in a VM. So we have any figure for the percentage of stuck units? |
||
|
|
|