Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Support Forum: BOINC Agent Support Thread: Computation Error in clusters? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 13
|
Author |
|
luc.n.allard@gmail.com
Cruncher Joined: Feb 5, 2018 Post Count: 12 Status: Offline Project Badges: |
I usually keep 3 WUs running and have recently noticed that I can see not only those 3 WUs stop but up to 7 or 8 more all failing with Computation Error displayed? Then Boinc seems to recover and proceeds as it normally does. The failed WUs are usually ARP followed by MCMs that were q'd up. The event log shows that output files are absent for the ARP WUs. tks!
|
||
|
luc.n.allard@gmail.com
Cruncher Joined: Feb 5, 2018 Post Count: 12 Status: Offline Project Badges: |
I checked the 'results' file on the host site and it shows a few of these:
rsl_malloc failed allocating 24911668 bytes, called ..\external\RSL_LITE\rsl_bcast.c, line 270, try 3 : Not enough space |
||
|
MJH333
Senior Cruncher England Joined: Apr 3, 2021 Post Count: 240 Status: Offline Project Badges: |
Not an expert, but that sounds like a memory error to me. Does your machine meet the minimum requirements for ARP1? See this page.
If it does, perhaps you should run some memory diagnostics. Cheers, Mark |
||
|
Bryn Mawr
Senior Cruncher Joined: Dec 26, 2018 Post Count: 337 Status: Offline Project Badges: |
I checked the 'results' file on the host site and it shows a few of these: rsl_malloc failed allocating 24911668 bytes, called ..\external\RSL_LITE\rsl_bcast.c, line 270, try 3 : Not enough space As MJH says, your machine ran out of memory and swap space so the WU could not allocate the 24mb it was asking for. |
||
|
Grumpy Swede
Master Cruncher Svíþjóð Joined: Apr 10, 2020 Post Count: 2092 Status: Offline Project Badges: |
And, the computer in question may not be able to run 3 ARP tasks at the same time. Too little memory, or other restrictions perhaps.
Check this page for System Requirement for the different projects. Each running ARP, need 1 GB Memory Available. https://www.worldcommunitygrid.org/help/topic.s?shortName=minimumreq |
||
|
luc.n.allard@gmail.com
Cruncher Joined: Feb 5, 2018 Post Count: 12 Status: Offline Project Badges: |
thanks for responding. I usually have between 2-3 GB mem available at all times. Also, wouldn't Windows 11 just use/expand the pagefil? I also have the Compute Options set use use 99 % of available mem.
So I then ran extensive HP laptop diags and it was all clean So I then did a Project Reset and this appears to have cleared it. thanks again! |
||
|
luc.n.allard@gmail.com
Cruncher Joined: Feb 5, 2018 Post Count: 12 Status: Offline Project Badges: |
mem is okay. did a Project Reset
|
||
|
TPCBF
Master Cruncher USA Joined: Jan 2, 2011 Post Count: 1932 Status: Offline Project Badges: |
thanks for responding. I usually have between 2-3 GB mem available at all times. Also, wouldn't Windows 11 just use/expand the pagefil? I also have the Compute Options set use use 99 % of available mem. 2-3GB RAM is even tight to just run one single ARP1 WU, let alone multiple ones. You are pretty much calling for problems... Ralf |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7579 Status: Offline Project Badges: |
I see you are running this on a laptop. I don't know if heat is an issue for you, but excessive heat can cause some memory to malfunction among other things. However, it does look like you did not enough allocated.
----------------------------------------Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
luc.n.allard@gmail.com
Cruncher Joined: Feb 5, 2018 Post Count: 12 Status: Offline Project Badges: |
8 GB - usually have 2 - 3 GB free
|
||
|
|