Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: FightAIDS@Home Phase 2 Thread: FAAH2 extremely hungry for RAM (ubuntu) |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 12
|
Author |
|
Magiceye04
Cruncher Joined: Jul 5, 2008 Post Count: 36 Status: Offline Project Badges: |
Hi,
on the project home page, FAAH2 is stated to use 250MB of RAM. Some hours ago i noticed that my Ryzen7 with 16GB RAM was pausing some WUs "waiting for RAM". Boinc Manager told me, the WUs need 2,09GB of RAM! (Ubuntu18) Also on my small PC with 8GB and Ubuntu16, they claim 1GB which is also far above the stated limit. In top i see also 2GB reserved, 512MB used on both PCs. Is that huge RAM hunger normal for the project? If so, could you please update the home page? If not, what i going wrong on my PCs? Best regards MagicEye |
||
|
Magiceye04
Cruncher Joined: Jul 5, 2008 Post Count: 36 Status: Offline Project Badges: |
OK, there is a new version of the app available since yesterday and i found the hint, that this one might use "a bit more" memory.
This seems to be the explanation. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not for me... 457MB RAM for each task, not so much for me. |
||
|
TonyEllis
Senior Cruncher Australia Joined: Jul 9, 2008 Post Count: 258 Status: Recently Active Project Badges: |
Just a tad more for me...
----------------------------------------
Run Time Stats https://grassmere-productions.no-ip.biz/
----------------------------------------[Edit 1 times, last edit by TonyEllis at Jan 28, 2019 2:21:36 PM] |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: |
Boinc Manager told me, the WUs need 2,09GB of RAM! (Ubuntu18) Also on my small PC with 8GB and Ubuntu16, they claim 1GB which is also far above the stated limit. In top i see also 2GB reserved, 512MB used on both PCs. It looks like they require 2 GB to download, but much less to run. It is common for the jobs to reserve a lot more memory than they typically use; there could be larger ones also. [Edit 1 times, last edit by Jim1348 at Jan 28, 2019 2:43:37 PM] |
||
|
Magiceye04
Cruncher Joined: Jul 5, 2008 Post Count: 36 Status: Offline Project Badges: |
No, i have not seen any 2GB download.
Yes, they reserve more than needed during calculation - but this means that some WUs wait until the reserved memory is free. |
||
|
FrancisNH
Cruncher Joined: Apr 2, 2015 Post Count: 1 Status: Offline Project Badges: |
Problem Report #1: Five (5) AMD Threadripper CPU’s lockup running Fight Aids at Home (FAH2) > 22 threads.
Severity: Must avoid 100% FAH2 World Community Grid (WCG) project on BOINC using AMD Threadripper CPU’s. 100% Zika projects are okay. Mix of projects are okay including a few FAH2 work units in the mix. Environment: OS: Ubuntu 16.04.6 LTS (Unity) on all twenty (20) machines. Linux Version 4.15.0-46 generic. Boinc: Version 7.6.31 from Ubuntu repository. CPU’s 5 - AMD Threadrippers 16 Cores 32 Threads Machine fails, Locks up > 22 threads of FAH2 9 - AMD Ryzens 8 Cores 16 Threads Machine is good, all 16 FAH2 threads running. 6 - Intel i7-4790K 4 Cores 8 Threads Machine is good, all 8 FAH2 threads running. Symptons: 1.) After about a minute following reset, the machine locks up with no mouse or keyboard input accepted. 2.) Low heat output seems to indicate number crunching has also stopped after the lockup. 3.) The only exit is a hard power off by holding the power-on button depressed several seconds. Keyboard CTRL-ALT-DEL does nothing. 4.) Fortunately, the machine operates normally for about a minute after completing power-on startup. Otherwise, a disk OS re-installation would be necessary. 5.) In that first minute of normal operation, one must quickly open BOINC manager and “suspend” World Community Grid (running 32 FAH2 threads). Now the machine will run normally indefinitely without locking up. 6.) FAH2 is then deselected at WCG web site, “My Projects”. All FAH2 work units on five (5) Threadripper machines must be manually “Aborted” by user. However, each of the five (5) Threadripper machines successfully resume number crunching doing 32 queued (“waiting”) Zika work units without locking up. 7.) After having “suspended” WCG in the BOINC console, FAH2 is seen to be running on 22 of 32 threads. #1 Longest running FAH2 thread has run over 4 hours. #22 shortest running good thread has run 1 hour 22 minutes. The last 10 threads out of 32 have run equally just 14 Seconds. 8.) Unfinished Zika threads are “waiting” because the FAH2 threads have a shorter deadline (4 April FAH2 vs 12 April Zika). The five (5) Threadripper machines are each manually recovered as above by aborting all FAH2 work units. All fifteen (15) other machines are untouched, and operate normally throughout the Threadripper recovery. Those good 15 machines have run eight (8) and sixteen (16) FAH2 threads at 100% loading, like Threadrippers normally do. 9.) I have run all 20 machines where all projects are selected at WCG web site, including FAH2 in the mix. FAH2 work units processed good on Threadripper, except when the FAH2 load is 100%. Comments: 1.) There is a lot if unused memory on all machines, including the Threadripper 1950X which has 16 GB. I wonder if the memory allocation should be allowed to grow larger than is set in BOINC? 2.) The delay before lockup at startup is puzzling (though fortunate)? There is a similar delay period, a window of time in which the BOINC Manager must be opened after startup (and left open), else it will never open afterward. 3.) I wonder if FAH2 software or BOINC could detect this problem and avoid it? |
||
|
ErikaT
Former World Community Grid Admin USA Joined: Apr 27, 2009 Post Count: 912 Status: Offline Project Badges: |
Problem Report #1: Five (5) AMD Threadripper CPU’s lockup running Fight Aids at Home (FAH2) > 22 threads. FrancisNH,Severity: Must avoid 100% FAH2 World Community Grid (WCG) project on BOINC using AMD Threadripper CPU’s. 100% Zika projects are okay. Mix of projects are okay including a few FAH2 work units in the mix. Environment: OS: Ubuntu 16.04.6 LTS (Unity) on all twenty (20) machines. Linux Version 4.15.0-46 generic. Boinc: Version 7.6.31 from Ubuntu repository. CPU’s 5 - AMD Threadrippers 16 Cores 32 Threads Machine fails, Locks up > 22 threads of FAH2 9 - AMD Ryzens 8 Cores 16 Threads Machine is good, all 16 FAH2 threads running. 6 - Intel i7-4790K 4 Cores 8 Threads Machine is good, all 8 FAH2 threads running. Symptons: 1.) After about a minute following reset, the machine locks up with no mouse or keyboard input accepted. 2.) Low heat output seems to indicate number crunching has also stopped after the lockup. 3.) The only exit is a hard power off by holding the power-on button depressed several seconds. Keyboard CTRL-ALT-DEL does nothing. 4.) Fortunately, the machine operates normally for about a minute after completing power-on startup. Otherwise, a disk OS re-installation would be necessary. 5.) In that first minute of normal operation, one must quickly open BOINC manager and “suspend” World Community Grid (running 32 FAH2 threads). Now the machine will run normally indefinitely without locking up. 6.) FAH2 is then deselected at WCG web site, “My Projects”. All FAH2 work units on five (5) Threadripper machines must be manually “Aborted” by user. However, each of the five (5) Threadripper machines successfully resume number crunching doing 32 queued (“waiting”) Zika work units without locking up. 7.) After having “suspended” WCG in the BOINC console, FAH2 is seen to be running on 22 of 32 threads. #1 Longest running FAH2 thread has run over 4 hours. #22 shortest running good thread has run 1 hour 22 minutes. The last 10 threads out of 32 have run equally just 14 Seconds. 8.) Unfinished Zika threads are “waiting” because the FAH2 threads have a shorter deadline (4 April FAH2 vs 12 April Zika). The five (5) Threadripper machines are each manually recovered as above by aborting all FAH2 work units. All fifteen (15) other machines are untouched, and operate normally throughout the Threadripper recovery. Those good 15 machines have run eight (8) and sixteen (16) FAH2 threads at 100% loading, like Threadrippers normally do. 9.) I have run all 20 machines where all projects are selected at WCG web site, including FAH2 in the mix. FAH2 work units processed good on Threadripper, except when the FAH2 load is 100%. Comments: 1.) There is a lot if unused memory on all machines, including the Threadripper 1950X which has 16 GB. I wonder if the memory allocation should be allowed to grow larger than is set in BOINC? 2.) The delay before lockup at startup is puzzling (though fortunate)? There is a similar delay period, a window of time in which the BOINC Manager must be opened after startup (and left open), else it will never open afterward. 3.) I wonder if FAH2 software or BOINC could detect this problem and avoid it? We have responded to you via the email you sent to Contact Us. Thank you for your support, ErikaT |
||
|
Jean-David Beyer
Senior Cruncher USA Joined: Oct 2, 2007 Post Count: 335 Status: Offline Project Badges: |
I noticed some of these work units took 8% or a little more of RAM on my 4-core processor. This was not running me out of RAM. I had 8 GBytes RAM in my machine with another 8 GBytes on order. When the RAM came, I installed it, and changed the CR2032 clock battery that was a little over 7 years old.
----------------------------------------So now those work units each take only 4% of my RAM (not surprising). |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: |
It looks like they are taking only 200 MB on my Win7 64-bit machine. Since there are tons of Windows machines out there, but relatively few Linux ones, I never put FAH2 on the Linux ones. The VINA ones (among others) do better on Linux.
|
||
|
|