Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
Member(s) browsing this thread: Link64
Thread Status: Active
Total posts in this thread: 661
Posts: 661   Pages: 67   [ Previous Page | 58 59 60 61 62 63 64 65 66 67 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 56852 times and has 660 replies Next Thread
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1341
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Good to see they're going to set an early warning system on the WAS thread pool management! Touchy software, isn't it wink

And now we know why getting work signed off seemed to be going backwards. If they could find a way of snuffing out that that bad parsing of scheduler requests (presumably a BOINC issue rather than a WCG one) at source, the "fake" Linux jobs (and associated non-HR validation issues) would eventually stop being a problem! And if they can solve that (or BOINC fix whatever it is in the (Docker-aware?) client and/or the server code, validation would be simplified again -- however, no mention of that in the update...

Recently I've been working at ways of digging out examples of stuck "2 valid, 1 pending" cases and was about to post some examples when this update appeared -- no need now, as that case appears on their list of issues and they almost certainly don't (and didn't) need examples from the end users anyway...

Hopefully, this week will see some of the issues sorted -- I don't expect them to get the lot in one go, but sorting out one part will release some of the pressure elsewhere and might make it easier to track down and resolve those awkward validation edge cases!

Cheers - Al.
[Dec 16, 2025 6:19:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1316
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Thank you Al for spotting the update and posting it to the thread. I really like how they are letting us know the issues and what they are working on.
[Dec 16, 2025 6:31:44 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2557
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

No wonder that it could be double(ish) credits for some, when the stats looks like this. The global history stats, can't possible be higher than the only project (MCM1) that we are running now. ARP hasn't been run since before the migration, and there's no new stats history for any other project than MCM1. And with the extremely low validation rate we have now, I doubt that any of the "Results returned" numbers below, is anywhere near the true values. They look much higher than the low validation rate suggests.

Statistics history (Mapping Cancer Markers History)

Statistics date		Total run time               Points generated		Results returned
(y:d:h:m:s)

12/15/2025 200:226:01:27:38 571,830,332 955,054


Global statistics history (All Projects)

Statistics date Total run time Points generated Results returned
(y:d:h:m:s)

12/15/2025 300:339:02:11:27 857,745,498 1,432,581

----------------------------------------
[Edit 5 times, last edit by Grumpy Swede at Dec 16, 2025 10:51:49 AM]
[Dec 16, 2025 10:43:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2361
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Latest Operational Status update...

December 15, 2025

  • Forum service restored, after degraded service starting roughly 03:00 UTC, December 15th, 2025 led to a crash at roughly 12:30 UTC same day - service was restored at approximately 20:00 UTC Dec 15th, 2025.

They finally get it and are now posting in UTC, bravo! smile Thank you, thank you, thank you.

Adri
----------------------------------------
[Edit 1 times, last edit by adriverhoef at Dec 16, 2025 11:20:47 AM]
[Dec 16, 2025 11:20:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1341
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Regarding the discrepancies between Global daily stats and MCM1 daily stats...

If you look at the last 30 days of both and consider the post-migration days only, you'll find a few more cases where Global > MCM1 and several where MCM1 = 2*Global!

I suspect that replaying of validation datasets cause these confusions, and I'm not sure how reliable either data set is likely to be now, especially given that the Global>MCM1 examples all seem to solve out as 2*Global = 3*MCM, suggesting that the correct value in all cases could well be the difference between them!

Ah, well, it'll either sort out eventually or there will be some people with lots of extra [unearned] credit in perpetuity wink

Cheers - Al.
[Dec 16, 2025 11:40:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2557
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

@alanb1951
Yeah, I have also seen the same discrepancy before. One thing is sure though, for now one can not rely on either the MCM1 stats, or the Global stats.

We'll se what happens in the future, when the validations start working reliable.
[Dec 16, 2025 3:27:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1316
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

This is an interesting match up. Mine is the error. I'm assuming this is a download error. https://www.worldcommunitygrid.org/contribution/workunit/792354903
[Dec 16, 2025 6:50:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TLD
Veteran Cruncher
USA
Joined: Jul 22, 2005
Post Count: 863
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

This is an interesting match up. Mine is the error. I'm assuming this is a download error. https://www.worldcommunitygrid.org/contribution/workunit/792354903


Does your computer have a intel CPU or Apple?
----------------------------------------

[Dec 16, 2025 7:35:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1316
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Apple M4
----------------------------------------
[Edit 1 times, last edit by Unixchick at Dec 16, 2025 8:34:20 PM]
[Dec 16, 2025 8:34:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1341
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Unixchick's "interesting match up" job did indeed get the the dreaded Permanent HTTP download error sad -- at present, retries can go almost anywhere because of the Homogeneous Redundancy changes in place to avoid the "other platforms" issue.

I, for one, will be glad when they can turn HR back on again, if only because users seem to end up with the much slower 32-bit retries at present. And yes, that seems to happen to Windows users as well as Linux users if the log files for successful retries that went to so-called "Alpine Linux" clients are anything to go by...

Perhaps some users who have deliberately disabled alt_platform in their client could say whether they actually get any retries at present and, if so, has it respected the "64-bit only" constraint? There are certainly still enough retries available, with download errors and [too many] missed deadline tasks...

Cheers - Al.

P.S. That download error was yet another "it happens around 16:00 to 18:00 UTC" case. I wonder what's happening early afternoon local time over there...
----------------------------------------
[Edit 1 times, last edit by alanb1951 at Dec 17, 2025 1:30:21 AM]
[Dec 17, 2025 1:25:31 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 661   Pages: 67   [ Previous Page | 58 59 60 61 62 63 64 65 66 67 | Next Page ]
[ Jump to Last Post ]
Post new Thread