Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 211 times and has 12 replies Next Thread
luc.n.allard@gmail.com
Cruncher
Joined: Feb 5, 2018
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Computation Error in clusters?

I usually keep 3 WUs running and have recently noticed that I can see not only those 3 WUs stop but up to 7 or 8 more all failing with Computation Error displayed? Then Boinc seems to recover and proceeds as it normally does. The failed WUs are usually ARP followed by MCMs that were q'd up. The event log shows that output files are absent for the ARP WUs. tks!
[Jan 22, 2025 4:52:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
luc.n.allard@gmail.com
Cruncher
Joined: Feb 5, 2018
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

I checked the 'results' file on the host site and it shows a few of these:
rsl_malloc failed allocating 24911668 bytes, called ..\external\RSL_LITE\rsl_bcast.c, line 270, try 3
: Not enough space
[Jan 22, 2025 5:40:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
MJH333
Senior Cruncher
England
Joined: Apr 3, 2021
Post Count: 240
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

Not an expert, but that sounds like a memory error to me. Does your machine meet the minimum requirements for ARP1? See this page.
If it does, perhaps you should run some memory diagnostics.
Cheers,
Mark
[Jan 22, 2025 6:13:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Bryn Mawr
Senior Cruncher
Joined: Dec 26, 2018
Post Count: 337
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

I checked the 'results' file on the host site and it shows a few of these:
rsl_malloc failed allocating 24911668 bytes, called ..\external\RSL_LITE\rsl_bcast.c, line 270, try 3
: Not enough space


As MJH says, your machine ran out of memory and swap space so the WU could not allocate the 24mb it was asking for.
[Jan 22, 2025 7:25:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2092
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

And, the computer in question may not be able to run 3 ARP tasks at the same time. Too little memory, or other restrictions perhaps.

Check this page for System Requirement for the different projects. Each running ARP, need 1 GB Memory Available.

https://www.worldcommunitygrid.org/help/topic.s?shortName=minimumreq
[Jan 22, 2025 9:42:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
luc.n.allard@gmail.com
Cruncher
Joined: Feb 5, 2018
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

thanks for responding. I usually have between 2-3 GB mem available at all times. Also, wouldn't Windows 11 just use/expand the pagefil? I also have the Compute Options set use use 99 % of available mem.

So I then ran extensive HP laptop diags and it was all clean

So I then did a Project Reset and this appears to have cleared it.

thanks again!
[Jan 24, 2025 5:55:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
luc.n.allard@gmail.com
Cruncher
Joined: Feb 5, 2018
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

mem is okay. did a Project Reset
[Jan 24, 2025 6:01:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1932
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

thanks for responding. I usually have between 2-3 GB mem available at all times. Also, wouldn't Windows 11 just use/expand the pagefil? I also have the Compute Options set use use 99 % of available mem.
2-3GB RAM is even tight to just run one single ARP1 WU, let alone multiple ones. You are pretty much calling for problems... sad


Ralf
----------------------------------------

[Jan 24, 2025 6:15:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7579
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

I see you are running this on a laptop. I don't know if heat is an issue for you, but excessive heat can cause some memory to malfunction among other things. However, it does look like you did not enough allocated.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jan 24, 2025 6:25:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
luc.n.allard@gmail.com
Cruncher
Joined: Feb 5, 2018
Post Count: 12
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation Error in clusters?

8 GB - usually have 2 - 3 GB free
[Jan 26, 2025 6:20:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread