| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 23
|
|
| Author |
|
|
_Absinthe_
Cruncher Joined: Aug 10, 2010 Post Count: 26 Status: Offline Project Badges:
|
have just uninstalled 6.12.34 and installed 6.12.33 as advised. Here is a copy/paste of the beginning of the log files from the 6.12.33 client:
22/09/2011 22:59:43 | | Starting BOINC client version 6.12.33 for windows_x86_64 22/09/2011 22:59:43 | | log flags: file_xfer, sched_ops, task 22/09/2011 22:59:43 | | Libraries: libcurl/7.19.7 OpenSSL/0.9.8l zlib/1.2.5 22/09/2011 22:59:43 | | Data directory: D:\_TEMP\BOINC 22/09/2011 22:59:43 | | Running under account JamesR 22/09/2011 22:59:43 | | Processor: 8 GenuineIntel Intel(R) Core(TM) i7 CPU 920 @ 2.67GHz [Family 6 Model 26 Stepping 5] 22/09/2011 22:59:43 | | Processor: 256.00 KB cache 22/09/2011 22:59:43 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 syscall nx lm vmx tm2 popcnt pbe 22/09/2011 22:59:43 | | OS: Microsoft Windows 7: Ultimate x64 Edition, Service Pack 1, (06.01.7601.00) 22/09/2011 22:59:43 | | Memory: 11.99 GB physical, 23.98 GB virtual 22/09/2011 22:59:43 | | Disk: 1.82 TB total, 800.40 GB free 22/09/2011 22:59:43 | | Local time is UTC +1 hours 22/09/2011 22:59:43 | | NVIDIA GPU 0: GeForce GTX 260 (driver version 27550, CUDA version 4000, compute capability 1.3, 838MB, 605 GFLOPS peak) 22/09/2011 22:59:43 | | NVIDIA GPU 1 (not used): GeForce GTS 250 (driver version 27550, CUDA version 4000, compute capability 1.1, 967MB, 415 GFLOPS peak) 22/09/2011 22:59:43 | | Version change (6.12.34 -> 6.12.33) 22/09/2011 22:59:43 | GPUGRID | URL http://www.gpugrid.net/; Computer ID 107950; resource share 100 22/09/2011 22:59:43 | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 1307766; resource share 100 22/09/2011 22:59:43 | World Community Grid | General prefs: from World Community Grid (last modified 01-Jan-1970 00:00:01) 22/09/2011 22:59:43 | World Community Grid | Host location: none 22/09/2011 22:59:43 | World Community Grid | General prefs: using your defaults 22/09/2011 22:59:43 | | Reading preferences override file 22/09/2011 22:59:43 | | Preferences: 22/09/2011 22:59:43 | | max memory usage when active: 6139.54MB 22/09/2011 22:59:43 | | max memory usage when idle: 11051.17MB 22/09/2011 22:59:43 | | max disk usage: 40.00GB 22/09/2011 22:59:43 | | max CPUs used: 7 22/09/2011 22:59:43 | | suspend work if non-BOINC CPU load exceeds 25 % 22/09/2011 22:59:43 | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager) 22/09/2011 22:59:43 | | Not using a proxy 22/09/2011 22:59:44 | | Running CPU benchmarks 22/09/2011 22:59:44 | | Suspending computation - CPU benchmarks in progress 22/09/2011 23:00:15 | | Benchmark results: 22/09/2011 23:00:15 | | Number of CPUs: 7 22/09/2011 23:00:15 | | 2592 floating point MIPS (Whetstone) per CPU 22/09/2011 23:00:15 | | 8148 integer MIPS (Dhrystone) per CPU 22/09/2011 23:00:16 | World Community Grid | Started download of hcc1_image04_6.42.tga 22/09/2011 23:00:16 | World Community Grid | Started download of hcc1_image01_6.42.tga |
||
|
|
_Absinthe_
Cruncher Joined: Aug 10, 2010 Post Count: 26 Status: Offline Project Badges:
|
(the downloads specified in the last two entries didn't succeed, it's going thru the 'retry' then backoff process as before)
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Reading preferences override file This means that you have created a special disk file containing your BOINC preferences overriding your website preferences. suspend work if non-BOINC CPU load exceeds 25 % This means if if 2 threadas are running, you will suspend the 7 BOINC threads you are allowing. I hope you have selected 'Leave Application in Memory if Suspended' so your multi-GB virtual memory will save everything instead of dumping it back to the last checkpoint. 22/09/2011 23:00:16 | World Community Grid | Started download of hcc1_image04_6.42.tga I think these are general data files downloaded before you run your first HCC work unit. Does your My Grid page show any time for the HCC project? What does your Results Status page show for WCG projects? Lawrence |
||
|
|
_Absinthe_
Cruncher Joined: Aug 10, 2010 Post Count: 26 Status: Offline Project Badges:
|
I've been managing the preferences thru the client, instead of via the website, I guess from your comment that this isn't the preferred method?
re: the memory question, I work from home a lot, and when I am I usually have numerous VM's running and need as much memory as I can get my hands on, so no, I don't let it leave app in memory if suspended. However, the preferences are set to dump to disk every 60secs, so it can't be losing that much processing time? Just checked my My Grid page, and I'm showing zero runtime since 20th Sept, and last result returned 64+ hrs ago... |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just to be sure, did you *do/try all* of my last post in this thread?
--//-- |
||
|
|
Ingleside
Veteran Cruncher Norway Joined: Nov 19, 2005 Post Count: 974 Status: Offline Project Badges:
|
I've been managing the preferences thru the client, instead of via the website, I guess from your comment that this isn't the preferred method? Managing preferences client-side isn't normally a problem, except to remember if you've changed one thing client-side, any changes server-side won't have any effects. This doesn't apply to the project-specific preferences like example selection of WCG-sub-projects to run. re: the memory question, I work from home a lot, and when I am I usually have numerous VM's running and need as much memory as I can get my hands on, so no, I don't let it leave app in memory if suspended. However, the preferences are set to dump to disk every 60secs, so it can't be losing that much processing time? 60 seconds is the shortest time between checkpoints, in reality the actual time can be much larger, depending on application. Especially CEP2 can be many hours between checkpoints. Also, even if you do have an application that basically can checkpoint every second, and therefore waits for the 60-second preference-setting before checkpoints, the application can still have a multi-second initializing-time, so wastes much time on re-initializing for each re-start as long as isn't kept in memory. Just checked my My Grid page, and I'm showing zero runtime since 20th Sept, and last result returned 64+ hrs ago... Then it comes to problems with connection, appart for re-starting any routers, modems and affected computers, enabling more detailed logging can be an idea. This is done by adding/editing cc_config.xml located in your BOINC data-directory, and including some/all of these: <cc_config.xml> Not all options will be so informative, but atleast the 1st. option about <file_xfer_debug> I'll recommend to keep even during "normal" operations. After adding these log-flags, in BOINC Manager, select menu-option to re-read config-file. Some of these logging-options can generate lots of info, so to disable them again, just change the 1 with a zero for all things you wants to disable again, and re-read config-file. edit - removed <file_xfer>, since this is default on it's little point to add this option. ![]() "I make so many mistakes. But then just think of all the mistakes I don't make, although I might." [Edit 2 times, last edit by Ingleside at Sep 23, 2011 4:17:22 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi _Absinthe_,
I've been managing the preferences thru the client, instead of via the website, I guess from your comment that this isn't the preferred method? It is a perfectly fine method as long as you remember that you are doing it. I normally look askance at it because so many members 'set and forget'. The idea is that BOINC should 'just work' so people do not need to remember choices made years ago.Something is going wrong with your network commo that only seems to affect WCG. I think that while you are trying to resolve it, you should allow BOINC to run continuously. To do this, allow BOINC to run unless non-BOINC programs exceed 90% (not 25%) of CPU capacity. At the same time, run only one BOINC project. If you have a hyperthreading quad-core, then this means no more than 13% (or 15% or 20%) of CPU capacity. This should keep things simple while you examine your network. Check out your network BOINC preferences. Let them run wide open without limitations. About saving BOINC check points - this is a little different than you think. A check point is not a mirror image copy of the computer state. It is a special point that a project algorithm reaches which allows a relatively small table or group of tables to save the current progress of the task. But this is highly dependent on the algorithm. Some projects reach this point several times a second or several times a minute. Saving at each point would involve a lot of unnecessary disk I/O, irritating many people. So BOINC allows them to set a minimum delay. The default is 1 minute. Who minds missing a minute of computation due to a power outage? After powering up, the project starts at the last saved check point. On the other hand, some projects - such as CEP - might have an hour ( or 2 or 3 hours) between some check points. If members lose this much time just because the task was superseded by a higher priority email - well, some members complain. Which is why we have a 'Leave Application in Memory' option. With big Virtual Memory disk files, it seldom causes a problem. So far, I have no sage advice to give you, so I am just advising you on ways to simplify your situation while you investigate. Computer problems are so complicated and frustrating, it makes a lot of sense to keep everything we can control very simple while we try to figure it out.Lawrence |
||
|
|
_Absinthe_
Cruncher Joined: Aug 10, 2010 Post Count: 26 Status: Offline Project Badges:
|
wow, thanks for all the suggestions folks, really really appreciate it :)
Re: the checkpoint saving, that's some very good info to know, thanks. The description in the preferences obviously implies it's a lot simpler than it actually is. I'll try setting "leave apps in memory when suspended" and see just how much that impacts things when I'm working from home... (When I've configured BOINC before I went for the 'safest' option from my work perspective since if you run out of memory when running numerous VM's in VMware Workstation, things can just completely grind to a halt and make it very difficult to get things moving again.) I apologise SekeRob, I missed the link you posted re: adding "198.20.8.241 download.worldcommunitygrid.org" to hosts file in order to target primary DL servers instead of the cache ones. I've done that now and it's immediately resolved the problem. Which prompts the question as to why the client can't resolve the names for the cache servers? (I'm assuming it would be bad practice to leave it pointed at the primary's indefinitely?) As an aside, if you don't mind one more question, how do I ensure the client is getting preference settings from the web as opposed to local config file? I can't find anything in the menus to specify a preference either way... |
||
|
|
_Absinthe_
Cruncher Joined: Aug 10, 2010 Post Count: 26 Status: Offline Project Badges:
|
oh, and I will be enabling some of the additional logging options and removing the 'fix' entry from my hosts file, to try and find the exact cause of the problem, but that'll have to wait til later as have work to do now :(
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I apologise SekeRob, I missed the link you posted re: adding "198.20.8.241 download.worldcommunitygrid.org" to hosts file in order to target primary DL servers instead of the cache ones. I've done that now and it's immediately resolved the problem. Which prompts the question as to why the client can't resolve the names for the cache servers? (I'm assuming it would be bad practice to leave it pointed at the primary's indefinitely?) No worries, it's really not something you can resolve, but let that hosts entry be for now, as something is not working in the download cloud... something the WCG techs have to fix, me thinketh, therefore I ammeth. --//-- |
||
|
|
|