| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 34
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I had no idea the heartbeat thing existed. I've been running three Phenom II X6 machines with just a single hard drive and no SSD and computing six CEP2 work units simultaneously for a while now. My FX-8120 machine though is running HCC since it has a Radeon HD 7950. I think the only time I saw the heartbeat issue (a long time ago) was when I had failed to tell my antivirus to ignore the BOINC data and executables directories/folders...although I could also see it happening if you're running Windows and the search indexer was indexing the BOINC data or executables directory and/or the entire disk one or the other (or both) are running from.I've returned to WCG after 9 months of distributed X264 10-bit encoding of my personal archive of TV series and movies. (Yes, it is O/T...but CEP2's high amount of disk I/O makes it important to stress the need for telling your antivirus [many of which scan/temporarily lock any file run as an executable and any file opened for read or write to include unzipping, an act which can drive a cascade of scans/temporary file locks on the unzipped files - as a part of their "real time protection"] to keep its hands to itself. Antivirus scanning is typically opening a file and reading the whole file in an attempt to match one or more virus "signatures" from an ever-growing database of virus signatures...so if your antivirus does that whenever a file is "touched" in a BOINC data directory, the other-than-minimal I/O of CEP2 could be doubled or more.) |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks SekeRob, I was tired of micromanaging my little quad which can't handle more than one CEP2 concurrently, so this was perfect news for me :)
----------------------------------------Actually I went as far as bringing up a virtual linux just to run one also, so I can have 2 concurrent as long as it's not 2 windows concurrent. The efficiency is so poor in both cases that I net maybe a cpu day/day, but that's better than it was managing before. Starting a CEP2 (windows) job still freezes the box for a minute while the disk pegs, and forces a restart on remaining boinc tasks which is annoying, but after playing with VM and memory and disk settings ad nauseum I gave up and just live with it. This project is close to my heart, otherwise it would be too annoying to run. Anyway your news and clear explanation was very helpful so thanks. [Edit 1 times, last edit by Former Member at Mar 6, 2013 11:28:01 PM] |
||
|
|
mmstick
Senior Cruncher Joined: Aug 19, 2010 Post Count: 151 Status: Offline Project Badges:
|
Thanks SekeRob, I was tired of micromanaging my little quad which can't handle more than one CEP2 concurrently, so this was perfect news for me :) Actually I went as far as bringing up a virtual linux just to run one also, so I can have 2 concurrent as long as it's not 2 windows concurrent. The efficiency is so poor in both cases that I net maybe a cpu day/day, but that's better than it was managing before. Starting a CEP2 (windows) job still freezes the box for a minute while the disk pegs, and forces a restart on remaining boinc tasks which is annoying, but after playing with VM and memory and disk settings ad nauseum I gave up and just live with it. This project is close to my heart, otherwise it would be too annoying to run. Anyway your news and clear explanation was very helpful so thanks. Why not just ditch Windows? All of my systems run Ubuntu 12.04 besides my GPU HCC crunching gaming machine. [Edit 1 times, last edit by mmstick at Mar 7, 2013 1:46:04 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well, there is no golden bullet in terms of operating systems - different people have different demands. That's why CEP is provided for all the major operating systems.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
why not ditch windows: in a word: work. Using windows as a virtual OS doesn't cut it for work, I did run Vista on my mac for the miserable time I was forced to deal with that stinking excuse for an OS -- but for XP while it lasts, or W7, better to run it native. Linux is great but little use at work beyond what I can do on the mac.
----------------------------------------Meanwhile, happy to be able to run 2 CEP2 tasks 24/7 now, on the one host that can handle it. ETA: However -- and apologies in advance for not researching this question, flu zapped my energy -- can someone please point me to a link where I can read why the flow of CEP2 seems reduced? My work profile is set to CEP2 and DSFL, and a lot of the time I'm getting only DSFL even when I'm 2-3 wus below the max-cep2-wu profile setting, to the point where it risks running dry on cep2 overnight. This has only been true for the last week or less; started a little before I made the app_config change. [Edit 2 times, last edit by Former Member at Mar 7, 2013 7:54:04 PM] |
||
|
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges:
|
CEP2 and app_config.xml: WCG website is broken!!
----------------------------------------I am running a machine with max_concurrent for CEP2 = 3 (of 8 threads), and 1 other project selected (FAAH), with Minimum work buffer = 1.15 and Max additional work buffer = 0.70. If I set the Number of workunits per host for the CEP Phase 2 in its Device Profile to the max exact number in the dropdown list (16), the pool of Ready-to-Start CEP2 WUs in the next-to-run part of the buffer sometimes disappears, and the number of CEP2 WUs that are running drops below my specified 3. At least the vacated threads run extra FAAH WUs. Today I tried setting Number of workunits per host for the CEP Phase 2 = Unlimited. Since the change, the client has received 13 CEP2 WUs and 7 FAAH WUs. Since CEP2 WUs take about 5-7 hrs and FAAH take about 3-5 hrs, and less than 50% of the cores are assigned to CEP2, the machine will gobble up all of the FAAH WUs, and then 5 CPU threads will sit idle while the excess CEP2 WUs are crunched 3 at a time, until the work cache is reduced anough to allow downloading more WUs. I don't really want to reduce the work cache, due to the occasional major WCG server outages. Yes, they've fixed all the computing problems, but hey I've heard that before. Then there are the sabotage attempts carried on by the North American climate. Yes, you passed through Sandy and the recent ice storm OK, but what about Boulder CO at Christmas several years back? Please provide a way of increasing the maximum exact setting for Number of workunits per host for the CEP Phase 2, either by lengthening the drop-down list or by allowing us to simply type in the number. ================== @[B.S] sTrey and others experiencing problems with systems freezing at CEP2 startup: You can reduce this effect with a bit of micro-management, if you are at the machine when the CEP2 WUs first start. The trick is to force the next CEP2 WU to run for just a few seconds - say 5 - then suspend it and run a non-CEP2 WU or another CEP2 WU that has passed its initial startup phase. After a short while - say 20 sec, or after the HDD activity LED goes off - you can run the new CEP2 WU some more. Pausing your new CEP2 WU like this allows the HDD to catch up. You may have to repeat this cycle another time or 2. More precise time estimates depend on the speed of your CPU and disc system. LAIM must be ON of course. This sounds onerous, but with app_config.xml limiting the max_concurrent for CEP2, you can pre-run as many CEP2 WUs as you like, subject to the amount of RAM and disc space available. [Edit 2 times, last edit by Rickjb at Mar 10, 2013 9:47:52 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Dear Rickjb,
this is something our friends at IBM-WCG have to look into. Best wishes Your Harvard CEP team |
||
|
|
Jason1478963
Senior Cruncher United States Joined: Sep 18, 2005 Post Count: 295 Status: Offline Project Badges:
|
I have found the app_config file to be very useful for this project as well. This sure beats only a few in the cue at a time. I am trying to get 16 at a time an run between 2 and 4 on a machine with the rest being filled with additional work from other projects. This allows me to get hcc gpu work and fill in extra slots on the 12 core. I am trying to run a min work buffer of 1.45 an a max additional work buffer of 1.5 days set at the client preferences. I would like to see the WCG website updated so these settings don't need to be set at the client. The worst part for me has been updating linux clients without creating new client ID's all the time.
----------------------------------------![]() [Edit 1 times, last edit by Jason1478963 at Mar 10, 2013 7:05:55 PM] |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
Rickjb,
We have added this to our list of website enhancements. It will require some thinking as there are positives to having default and unlimited options from a pull down. Thank you for the suggestion, -Uplinger |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
why not ditch windows: in a word: work. Using windows as a virtual OS doesn't cut it for work, I did run Vista on my mac for the miserable time I was forced to deal with that stinking excuse for an OS -- but for XP while it lasts, or W7, better to run it native. Linux is great but little use at work beyond what I can do on the mac. Meanwhile, happy to be able to run 2 CEP2 tasks 24/7 now, on the one host that can handle it. ETA: However -- and apologies in advance for not researching this question, flu zapped my energy -- can someone please point me to a link where I can read why the flow of CEP2 seems reduced? My work profile is set to CEP2 and DSFL, and a lot of the time I'm getting only DSFL even when I'm 2-3 wus below the max-cep2-wu profile setting, to the point where it risks running dry on cep2 overnight. This has only been true for the last week or less; started a little before I made the app_config change. There was an inbalance in DSFL which was causing it to send out more than other projects. We have readjusted the weights on the feeder to help resolve this issue. Thanks, -Uplinger |
||
|
|
|