World Community Grid - View Thread

World Community Grid Forums

Category: Active Research

Forum: Africa Rainfall Project

Thread: invalid

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 15

[ ]

Author

This topic has been viewed 1592 times and has 14 replies

savas
Cruncher
Joined: Sep 21, 2021
Post Count: 34
Status: Offline


Re: invalid

After reviewing the compiler flags and validator code, we have determined that compliance with IEEE-754 should not be expected, which was probably a deliberate choice for the sake of performance. It is common in HPC to lean towards performance if possible. Most notably, the use of -fp-model precise as opposed to -fp-model strict and optimization level 3 with -O3 suggest there would be variance in floating point arithmetic between different architectures in the production build of ARP1.

The validation logic for the ARP1 validators includes a comparison of the md5sum of the result.out files produced by the ARP1 workunits being compared, and we would therefore certainly expect that differences could cause these invalids. It is also possible that this stricter check acts as a heuristic to keep the accumulation of rounding error low, so that subsequent generations and the final dataset are within tighter tolerances to the potential floating point error - but this is just speculation as we did not write the code.

[Jan 24, 2025 11:51:29 PM]

cliviafreak
Cruncher
Joined: Jan 13, 2025
Post Count: 21
Status: Offline
Project Badges:

45 day badge for Africa Rainfall Project


Re: invalid

Thanks. That is good to hear!

[Jan 25, 2025 3:09:57 AM]

Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1087
Status: Offline
Project Badges:

180 day badge for Smash Childhood Cancer

45 day badge for Microbiome Immunity Project

1 year badge for Africa Rainfall Project

1 year badge for OpenPandemics - COVID-19


Re: invalid

Thank you Savas.

[Jan 25, 2025 6:16:15 AM]

fufu
Cruncher
Joined: Mar 20, 2006
Post Count: 44
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

90 day badge for Discovering Dengue Drugs - Together

2 year badge for Nutritious Rice for the World

1 year badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

1 year badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

1 year badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

1 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

20 year badge for Mapping Cancer Markers

2 year badge for Uncovering Genome Mysteries

20 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

10 year badge for Microbiome Immunity Project

5 year badge for Africa Rainfall Project

5 year badge for OpenPandemics - COVID-19


Re: invalid

I have the same problem on my new Mac mini with M4 chip. Around 50% invalids with ARP. No invalids at all with MCM.

[Jan 25, 2025 12:57:43 PM]

catchercradle
Senior Cruncher
England
Joined: Jan 16, 2009
Post Count: 154
Status: Offline
Project Badges:

14 day badge for Drug Search for Leishmaniasis

14 day badge for OpenPandemics - COVID-19


Re: invalid

My understanding is that the rounding algorithms vary slightly between AMD and Intel chips. This is why CPDN use a statistical analysis to decide which results to include in the scientific research rather than having more than one computer complete the task and looking for exact matches.

To test another variable on the same machine I ran some tasks both using WINE and using Windows in a VM. Both operating systems produced identical results. Currently CPDN tasks are all single OS,i.e. windows, Linux or MacOS so it is not possible to do comparison on whether the executables for the different operating system make a difference.

I would suggest the invalid results are most likely to be related to the rounding differences However as noted above WCG doesn't let you see your wingmen's computers to check this. I will try and find on the CPDN boards what Glenn wrote about the code used for the ARP tasks as he knows something about it.

[Jan 25, 2025 3:30:26 PM]

[ ]