Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 15
Posts: 15   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 354 times and has 14 replies
savas
Cruncher
Joined: Sep 21, 2021
Post Count: 30
Status: Recently Active
Reply to this Post  Reply with Quote 
Re: invalid

After reviewing the compiler flags and validator code, we have determined that compliance with IEEE-754 should not be expected, which was probably a deliberate choice for the sake of performance. It is common in HPC to lean towards performance if possible. Most notably, the use of -fp-model precise as opposed to -fp-model strict and optimization level 3 with -O3 suggest there would be variance in floating point arithmetic between different architectures in the production build of ARP1.

The validation logic for the ARP1 validators includes a comparison of the md5sum of the result.out files produced by the ARP1 workunits being compared, and we would therefore certainly expect that differences could cause these invalids. It is also possible that this stricter check acts as a heuristic to keep the accumulation of rounding error low, so that subsequent generations and the final dataset are within tighter tolerances to the potential floating point error - but this is just speculation as we did not write the code.
[Jan 24, 2025 11:51:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
cliviafreak
Cruncher
Joined: Jan 13, 2025
Post Count: 16
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: invalid

Thanks. That is good to hear!
[Jan 25, 2025 3:09:57 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: invalid

Thank you Savas.
[Jan 25, 2025 6:16:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
fufu
Cruncher
Joined: Mar 20, 2006
Post Count: 40
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: invalid

I have the same problem on my new Mac mini with M4 chip. Around 50% invalids with ARP. No invalids at all with MCM.
[Jan 25, 2025 12:57:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
catchercradle
Advanced Cruncher
Joined: Jan 16, 2009
Post Count: 103
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: invalid

My understanding is that the rounding algorithms vary slightly between AMD and Intel chips. This is why CPDN use a statistical analysis to decide which results to include in the scientific research rather than having more than one computer complete the task and looking for exact matches.

To test another variable on the same machine I ran some tasks both using WINE and using Windows in a VM. Both operating systems produced identical results. Currently CPDN tasks are all single OS,i.e. windows, Linux or MacOS so it is not possible to do comparison on whether the executables for the different operating system make a difference.

I would suggest the invalid results are most likely to be related to the rounding differences However as noted above WCG doesn't let you see your wingmen's computers to check this. I will try and find on the CPDN boards what Glenn wrote about the code used for the ARP tasks as he knows something about it.
[Jan 25, 2025 3:30:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 15   Pages: 2   [ Previous Page | 1 2 ]
[ Jump to Last Post ]
Post new Thread