| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 33
|
|
| Author |
|
|
Speedy51
Veteran Cruncher New Zealand Joined: Nov 4, 2005 Post Count: 1326 Status: Offline Project Badges:
|
@Bobby B
----------------------------------------Exactly. My point is that some people want longer run times some people want shorter. I am not wanting to shift this thread anymore off-topic than it is all I will say is if I was able to use Linux on my daily machine I would but unfortunately I require windows to many tasks ![]() |
||
|
|
!evil
Cruncher Joined: Aug 3, 2023 Post Count: 4 Status: Offline |
From what I can see, SVMlight calls clock for timing/verbosity, so those calls should be easy to avoid. I tried LD_PRELOAD using the wrapper approach, but the tasks simply fail. If I had to guess, boinc probably protects us from malicious code.
Regarding long vs short vs not running a task: the sys calls are wasted CPU cycles, that is, the research team could potentially get results significantly faster/more results within their time budget. |
||
|
|
thunder7
Senior Cruncher Netherlands Joined: Mar 6, 2013 Post Count: 238 Status: Offline Project Badges:
|
This problem seems to have disappeared on my linux systems crunching MCM.
48 cores - 2.3% system time (2696 V2) 32 cores - 0.1% system time (7950X) 40 cores - 0.8% system time (2680 V2) There's been no announcement, but how are other systems running now? |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
I don't know how you measure that, thunder7.
One thing that has changed is that we no longer need to download the 102 MB file mcm1.dataset-sarc1.txt; instead there is now a 48 MB curatedOvarian_EarlyLate_v1.0 file: -rw-r--r--. 1 boinc boinc 48177891 Oct 20 20:23 ~boinc/projects/www.worldcommunitygrid.org/e55b6bdba4ed0b4b6e315c6767d68e3f.txt Adri |
||
|
|
thunder7
Senior Cruncher Netherlands Joined: Mar 6, 2013 Post Count: 238 Status: Offline Project Badges:
|
I use de 'top' command, which shows
Tasks: 436 total, 33 running, 403 sleeping, 0 stopped, 0 zombie %Cpu(s): 0.0 us, 0.1 sy, 99.9 ni, 0.0 id, 0.0 wa, 0.0 hi, 0.0 si, 0.0 st MiB Mem : 63432.7 total, 59847.3 free, 3010.1 used, 1265.1 buff/cache MiB Swap: 123567.0 total, 123567.0 free, 0.0 used. 60422.6 avail Mem PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 6405 boinc 39 19 78552 39452 2392 R 100.3 0.1 20:08.60 wcgrid_mcm1_map 6400 boinc 39 19 78552 39460 2392 R 100.0 0.1 21:22.31 wcgrid_mcm1_map 6414 boinc 39 19 78848 40544 2392 R 100.0 0.1 18:05.74 wcgrid_mcm1_map 6418 boinc 39 19 78552 38920 2392 R 100.0 0.1 17:03.67 wcgrid_mcm1_map etc. 0.1 sy means 0.1% system time 99.9 ni means 99.9% time running low priority process(es) like WCG. |
||
|
|
adriverhoef
Master Cruncher The Netherlands Joined: Apr 3, 2009 Post Count: 2346 Status: Offline Project Badges:
|
Intel:
Tasks: 400 total, 9 running, 383 sleeping, 8 stopped, 0 zombie Another one, AMD: Tasks: 467 total, 20 running, 447 sleeping, 0 stopped, 0 zombie And yet another one, Intel: Tasks: 266 total, 9 running, 257 sleeping, 0 stopped, 0 zombie |
||
|
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 1317 Status: Offline Project Badges:
|
Regarding "This problem seems to have disappeared on my linux systems crunching MCM" (thunder7):
I think all the Ovarian work units use VMethod=LOO (which, from doing some reading about SVMlight, I think stands for Leave One Out...), and LOO tasks run better than NFCV (N-Fold?) on Linux, especially on older CPUs[*1] I've not looked at the library code (perhaps !evil has, and can comment again?) but I'd hazard a guess that the Leave One Out training mechanism doesn't result in as much timer activity overall as does NFCV... Whatever the case regarding LOO versus NFCV as training method, it should be noted that the Ovarian dataset is quite a lot smaller than the Sarcoma one, so tasks will run a fair bit faster anyway :-) Cheers - Al. [*1] On Sarcoma tasks, my Intel boxes tend to take between 40% and 60% longer to run if NFCV rather than LOO; on my Ryzen boxes (which have more FPU capability per core!) the difference is down to between 10% and 20%... I don't think the performance difference has that much to do with timers :-) |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7846 Status: Offline Project Badges:
|
Whatever the case regarding LOO versus NFCV as training method, it should be noted that the Ovarian dataset is quite a lot smaller than the Sarcoma one, so tasks will run a fair bit faster anyway :-) I had noticed the first couple of days of the ovarian work units the run time was smaller than I previously for the previous units. The units must be getting bigger because the time for each unit seems to have grown from about 1.7 hours to about 3 hours. They must have started with the easy ones. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 1317 Status: Offline Project Badges:
|
Sgt, Joe,
[I should probably have said "probably run a fair bit faster" :-)] Interesting that you've seen such significant increases -- it doesn't tally with my experience (even on my Intel systems, which are more prone to swings!) , but I don't doubt your assessment -- I'd just like to know what's going on (and why!) I've started poking around in my gathered statistics and parameter logs to see if there's an obvious link that will differentiate my slowest and quickest results. A quick scan with the Mark One Eyeball shows that (unlike Sarcoma units) there are considerable variations in task parameters, some of which may well play into runtime consumed! I'm going to have to write scripts to match up task parameters with task run-times, but I have done a sweep of CPU times for Ovarian tasks from start (very late September) to 20th October (as up-to-date as my easiest-to-query database, which only contains results for days where everything has validated!), counting tasks returned per day and average CPU hours. The total lists would make for a long post so I'll just show a sample from the results for one of my Ryzens (the 3700X),,, Date NTasks Ave. CPUhours I've had a look at the results not yet entered into that database, and the times for that system all seem to be in the range 1.17..1.22 hours with an occasional outlier at around 1.4 hours or 1.1 hours, so there's still some variability -- I'll keep looking, and if I spot anything interesting when I start matching parameters to run times, I'll report :-) Cheers - Al. P.S. Of course, it could just be that they send my "2 or 3 concurrent task" systems the easy stuff :-) |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7846 Status: Offline Project Badges:
|
Alan:
----------------------------------------Just a little further information: I have just used the results from my T520 with dual Xeon E5-2665. Starting on Oct. 18 through Oct 23 the units ranged from 1.8 hours to 2.1 hours. Late on Oct 23 to early on Oct.24 the units fairly quickly ramped from 2.1 hours to 3.0 hours. From then until the present the units have gradually ramped up to a very consistent 3.2 hours. This spans a total of about 800 units. This is on a machine running Linux. I have one machine running Windows (I7-3770) which has been consistent over all these days at 2.2 to 2.3 hours. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
|