Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: The Clean Energy Project - Phase 2 Forum Thread: New Project Setting |
No member browsing this thread |
Thread Status: Active Thread Type: Sticky Thread Total posts in this thread: 124
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Elapsed or CPU time? There has not been any testing, announcement, new science app release to not cut off hard at 12 hours. Frankly, cleanenergy was delighted that the new Q-Chem engine porting process has started, so no sure if the WCG programmers want to 2 step this... first test to allow running to 24 hours, then another test for the Q-Chem 4 (?).
Project average run time continues to sit at near the 8 hour mark: See http://bit.ly/WCGCE1 (on new look, seems to go into a steady decline again).... no hint there too that the cut off was changed. --//-- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Not sure if this is a new problem, but I've noticed it first on the current (now past "due") Clean Energy Project, so I'm posting it here. Every time I turn off my computer, I keep losing all of my time on this project, even when I suspend the work before shutting down. I've never had this problem before. What's going on? I would have finished before the deadline if not for losing all of that time.
Thanks! |
||
|
Jim1348
Veteran Cruncher USA Joined: Jul 13, 2009 Post Count: 1066 Status: Offline Project Badges: |
Every time I turn off my computer, I keep losing all of my time on this project, even when I suspend the work before shutting down. Do you really lose all the work? It sounds more like a checkpoint problem. You lose all your work since then when you shut down.You can use BOINC Tasks to find out when the last checkpoint occurred. http://www.efmer.eu/boinc/boinc_tasks/ |
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: |
I'm having the same problem in the past week. 2 out of the last 3 CEP2 WUs seemed to run fine during a session, but generally I'm only able to run 8-9 hours at a time (the CEP2 sessions generally run to completion at the 12 hour mark on my laptop). In two cases, when I shut down, then restarted the next time, the CEP2 WU was running like it was restarted from scratch. I had one WU do this to me three times in a row. I figured it was maybe just a bad WU and aborted it. The next CEP2 WU worked fine (across 3 sessions), but then the latest one just did the same thing for me. Has me scratching my head since I've been running a number of these WUs this year and up until the past 3 WUs I've never had any issues with CEP2 at all.
|
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: |
By the way I'm running on a quad core cpu laptop. When I was loosing my time on the CEP2 WUs, the other WUs from other workloads shutdown just fine without any lost time the next session.
|
||
|
armstrdj
Former World Community Grid Tech Joined: Oct 21, 2004 Post Count: 695 Status: Offline Project Badges: |
Can you post the stderr from one of these runs. To get the log go to My Grid and select result status and find the result which experienced the behavior and click on the status.
Thanks, armstrdj |
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: |
Here is the details of the first WU that restarted several times from scratch for me last weekend before I gave up on it:
************************************************** Result Log Result Name: E208639_ 131_ C.28.C23H13NOSSeSi.01424705.4.set1d06_ 0-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [08:49:13] Number of jobs = 16 [08:49:13] Starting job 0,CPU time has been restored to 0.000000. [08:56:31] Finished Job #0 [08:56:31] Starting job 1,CPU time has been restored to 348.927437. [09:20:13] Finished Job #1 [09:20:13] Starting job 2,CPU time has been restored to 1479.965887. [23:33:15] Number of jobs = 16 [23:33:15] Starting job 2,CPU time has been restored to 1479.965887. Application exited with RC = 0xc000013a [09:08:52] Number of jobs = 16 [09:08:52] Starting job 2,CPU time has been restored to 1479.965887. [20:53:53] Number of jobs = 16 [20:53:53] Starting job 2,CPU time has been restored to 1479.965887. [23:55:58] Number of jobs = 16 [23:55:58] Starting job 2,CPU time has been restored to 1479.965887. Application exited with RC = 0xc000013a [06:32:35] Number of jobs = 16 [06:32:35] Starting job 2,CPU time has been restored to 1479.965887. Application exited with RC = 0xc000013a [16:59:59] Number of jobs = 16 [16:59:59] Starting job 2,CPU time has been restored to 1479.965887. Abort requested: Exiting </stderr_txt> ]]> |
||
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges: |
Here is the second WU that eventually restarted itself too and never could get to a finish:
********************************************** Result Log Result Name: E208699_ 376_ C.29.C23H14N4SSi.01554253.2.set1d06_ 0-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> aborted by user </message> <stderr_txt> INFO: No state to restore. Start from the beginning. [08:34:29] Number of jobs = 16 [08:34:29] Starting job 0,CPU time has been restored to 0.000000. [08:41:05] Finished Job #0 [08:41:05] Starting job 1,CPU time has been restored to 316.541629. [08:59:41] Finished Job #1 [08:59:41] Starting job 2,CPU time has been restored to 1205.716129. [17:05:41] Finished Job #2 [17:05:41] Starting job 3,CPU time has been restored to 23284.522059. [17:25:56] Finished Job #3 [17:25:56] Starting job 4,CPU time has been restored to 24254.661078. [18:10:21] Finished Job #4 [18:10:21] Starting job 5,CPU time has been restored to 24941.689482. [18:25:53] Finished Job #5 [18:25:53] Starting job 6,CPU time has been restored to 25680.510218. [18:40:47] Finished Job #6 [18:40:47] Starting job 7,CPU time has been restored to 26400.158431. [19:01:10] Finished Job #7 [19:01:10] Starting job 8,CPU time has been restored to 27370.843453. Application exited with RC = 0xc000013a [19:15:22] Finished Job #8 [19:15:22] Starting job 9,CPU time has been restored to 28056.109046. INFO: No state to restore. Start from the beginning. [20:10:19] Number of jobs = 16 [20:10:19] Starting job 0,CPU time has been restored to 0.000000. Abort requested: Exiting </stderr_txt> ]]> |
||
|
Dagon
Cruncher Poland Joined: May 28, 2007 Post Count: 4 Status: Offline Project Badges: |
Hello All,
----------------------------------------I'd have a question. Would it be possible to limit the number of CEP WUs crunched at one time? When crunching 2-3 CEP WUs + 2 or 1 other WUs everything goes smoothly on my quad. When crunching 4 in one time, the tasks very often restart... That is very iritating... I tried to set up maximum number of cores in the settings but it doesn't work. Opting for more projects doesn't help because the WUs are downloaded in bunches of 4-6. Holding the WUs is not always possible, as the CPU is workning sometimes some time without my direct control. Is there any way to solve it? |
||
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3294 Status: Offline Project Badges: |
Hello All, I'd have a question. Would it be possible to limit the number of CEP WUs crunched at one time? When crunching 2-3 CEP WUs + 2 or 1 other WUs everything goes smoothly on my quad. When crunching 4 in one time, the tasks very often restart... That is very iritating... I tried to set up maximum number of cores in the settings but it doesn't work. Opting for more projects doesn't help because the WUs are downloaded in bunches of 4-6. Holding the WUs is not always possible, as the CPU is workning sometimes some time without my direct control. Is there any way to solve it? Go to Device Manager, select the profile you wish to change, scroll down till you find "Project Specific Settings" and choose the number you like. Then save :D AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
|