Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Active Research Forum: Smash Childhood Cancer Thread: SCC computation results (or validation process) for MacOS (Darwin) seems OS-rev dependent |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 24
|
Author |
|
Taurus Oldbull
Advanced Cruncher US Joined: Nov 26, 2020 Post Count: 53 Status: Offline Project Badges: |
Yes, Disk Utility & SMC reset - no change.
|
||
|
Jeff17
Cruncher Joined: Aug 10, 2007 Post Count: 13 Status: Offline Project Badges: |
Wanted to report that I am seeing a similar pattern of Invalid SCC results when verification is done with an earlier MacOS version.
----------------------------------------My host - Darwin 22.4.0 - Mac Mini - i7-8700B When I am paired with an older OS, which seems to be most of the time, the initial verification fails and a third task is issued. This is typically also an older OS, they validate, and I get marked Invalid. |
||
|
Ian C
Cruncher Joined: Jul 28, 2022 Post Count: 27 Status: Offline |
Seeing pretty much the same - huge proportion of invalids (although credits still granted).
I have slightly given up trying to understand and am just letting it get on with it. |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 859 Status: Offline Project Badges: |
https://www.worldcommunitygrid.org/contribution/workunit/314105617
Yet another one invalid only due to OS issues. I wish the SCC team read the boards and noticed this issue. |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7581 Status: Recently Active Project Badges: |
https://www.worldcommunitygrid.org/contribution/workunit/314105617 Yet another one invalid only due to OS issues. I wish the SCC team read the boards and noticed this issue. From looking at the result posted above, it looks like upgrading to Darwin 22.5.0 might fix the problem. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 859 Status: Offline Project Badges: |
1. I can't upgrade the OS on this old machine.
2. I'd get more invalids if I upgraded as 22.5.0 only matches to 22.5.0 while all the other MacOSes match just fine. |
||
|
Ian C
Cruncher Joined: Jul 28, 2022 Post Count: 27 Status: Offline |
Rather than being an OS revision issue, I suspect that this is much rather a problem of dissimilar CPU types. On the 17.x system, you are most likely running straight x86 code on an x86 CPU. On the 22.x system(s), there is most likely x86 code running on Apple's Rosetta x86 CPU emulator, with the physical CPU being an ARM based CPU (Apple's M1 or M2). And in that case, the problem is likely a peculiarity/bug in that Rosetta emulation that is causing this. Highly unlikely that this is anything that can be fixed on the WCG/BOINC side, but much rather a problem that Apple would have admit to and fix on their end. Which might be equally unlikely... Ralf No. I run mine of an intel Mac mini running 22.5.0 (Ventura 13.4 at the mo, but pattern has been the same for several updates - but rosetta never involved) and am seeing the same pattern: with 97 results listed, 38 are invalid (similar number pending), with only 4 valid. A couple of errors from the known bad batches too. Not rosetta related. OPN runs flawlessly with or without GPU enabled, as does ARF on the odd occasions I get units. MCM has never worked properly for me. |
||
|
Unixchick
Veteran Cruncher Joined: Apr 16, 2020 Post Count: 859 Status: Offline Project Badges: |
It is a bit frustrating knowing we are screaming in the wind. Krembil won't acknowledge the issue. SCC won't know what we do about the system. Is the data valid anymore?? If the output from the different OSes is so different to get one set marked invalid, then what does this do to the whole data set? Will they get a wrong conclusion? Will they have to run the whole data set over??
|
||
|
Taurus Oldbull
Advanced Cruncher US Joined: Nov 26, 2020 Post Count: 53 Status: Offline Project Badges: |
"If the output from the different OSes is so different to get one set marked invalid, then what does this do to the whole data set? Will they get a wrong conclusion? Will they have to run the whole data set over??" This has been my primary concern also, since first starting this thread. |
||
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 873 Status: Offline Project Badges: |
Unixchick, TaurusOldBull;
----------------------------------------Frustration understood (see my P.S.) -- perhaps the following might reassure you a little (even though it doesn't make this problem go away...) Any application (such as SCC1 or OPN1/G) that uses adaptive replication has to be looking for results that are "good enough", not the perfect match that is necessary for, say, ARP1. Validation is then useful for spotting systems that produce wildly different results, on the assumption that big differences are down to hardware/run-time issues on the individual system(s), given that applications for all platforms should have been through the Beta test system and tasks are supposed to go to broadly similar systems (32-bit vs. 64-bit; choice of O/S)... The real problem then is choosing a validation threshold that catches real problems but passes minor variations. The validation method might not allow a great deal of flexibility there :-( We'd need the SCC scientists to confirm this, but I suspect that the only real issue would be if they were getting excessive false positives or false negatives, rather than some fairly minor variance in how well a ligand docks (or how badly it fails...) With the above in mind, I suspect this specific issue isn't likely to be a show-stopper, unlike those accursed MyoD1-C batches. We know they are working on that, although I wouldn't expect any real news on that until they reckon they've resolved it [more or less] completely :-) Cheers - Al. P.S. Over at Einstein@home, something like 7 or 8% of my GPU tasks for one project are declared invalid because the initial wingman and the verification wingman both happened to be on Windows where the applications are built with different libraries for one reason or another; as it doesn't happen all the time the tech folks regard it [rightly, in my view] as lower priority than all the other stuff they have to do to keep the project(s) running. It isn't all one-sided, though; another 3% or so cause a Windows wingman to be declared invalid because the verification wingman uses Linux :-) [Edit: to make the P.S. more clear] [Edit 3 times, last edit by alanb1951 at Jun 16, 2023 12:57:02 AM] |
||
|
|