| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 253
|
|
| Author |
|
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges:
|
This CPU time problem is still present:
----------------------------------------BETA_ OET1_ 0000040_ xEBGP_ 0317_ 1-- Pending Validation 12/1/14 07:39:21 12/2/14 01:16:45 0.00 / 1.96 49.6 / 0.0 BETA_ OET1_ 0000040_ xEBGP_ 0322_ 1-- Pending Validation 12/1/14 07:39:21 12/2/14 01:06:09 0.00 / 1.88 47.6 / 0.0 BETA_ OET1_ 0000040_ xEBGP_ 0361_ 0-- Valid 12/1/14 07:39:21 12/1/14 23:53:58 0.00 / 1.82 46.2 / 51.4 BETA_ OET1_ 0000038_ xEBGP_ 0651_ 0-- Valid 12/1/14 07:13:33 12/1/14 23:24:32 0.00 / 2.05 51.8 / 45.5 EDIT: sorry, those above are probably not 7.07. v7.07 seems to have several issues with estimated time reporting. Name BETA_OET1_0000307_xEBGP-OM_rig_0001_0 Application Beta Test 7.07 Workunit name BETA_OET1_0000307_xEBGP-OM_rig_0001 State Running Received 12/2/2014 8:37:41 AM Report deadline 12/6/2014 8:36:54 AM Estimated app speed 3.04 GFLOPs/sec Estimated task size 7,367 GFLOPs CPU time at last checkpoint 00:00:00 CPU time 02:44:08 Elapsed time 02:44:28 Estimated time remaining 00:14:48 Fraction done 10.000% Virtual memory size 56.39 MB Working set size 57.84 MB Name BETA_OET1_0000307_xEBGP-OM_rig_1848_0 Application Beta Test 7.07 Workunit name BETA_OET1_0000307_xEBGP-OM_rig_1848 State Running Received 12/2/2014 8:37:41 AM Report deadline 12/6/2014 8:36:54 AM Estimated app speed 3.04 GFLOPs/sec Estimated task size 7,367 GFLOPs CPU time at last checkpoint 00:00:00 CPU time 02:45:17 Elapsed time 02:45:37 Estimated time remaining -- Fraction done 98.338% Virtual memory size 58.80 MB Working set size 60.18 MB --- after almost 4h this one has reached ~99.9%, then went back to 3.333% ![]() [Edit 3 times, last edit by Mumak at Dec 2, 2014 12:02:54 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Got one of the 7.07 beta with a single job in the task. The appearance now will make many volunteers nervous as the remaining time after several hours drops to --- and then is likely to sit there for several more hours.
----------------------------------------The task properties as of now, pulled with boinctasks: Name BETA_OET1_0000307_xEBGP-OM_rig_0579_0 Application beta20 7.07 Workunit name BETA_OET1_0000307_xEBGP-OM_rig_0579 State Running Received 12/2/2014 8:58:37 AM Report deadline 12/6/2014 8:58:37 AM Estimated app speed 2,32 GFLOPs/sec Estimated task size 7.367 GFLOPs CPU time at last checkpoint 00:00:00 CPU time 02:00:01 Elapsed time 02:01:39 Estimated time remaining -- Fraction done 0,000% Virtual memory size 53,09 MB Working set size 54,44 MB Directory slots/3 Process ID 6604 The only thing that serves as positive feedback is the cpu time continuing to increment, but a looping task could do that too. Less joyfull is the fraction done showing as 0.000 percent when the tasks view prints 100 percent progress. One of the multi-job finished and logged cpu time properly at close both locally and on the result status page, windows 8.1 with 7.2.33 agent: 7.07 beta20 BETA_OET1_0000054_xEBGP_0111_1 02:26:36 (02:24:02) 12/2/2014 11:16:28 AM 12/2/2014 11:18:54 AM 98,25 Reported: OK + 45.39 MB 43.87 MB [Edit 1 times, last edit by Former Member at Dec 2, 2014 10:29:49 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Got four of the 7.07s. Those on the lappie finished normally, the deskside is still going as it's slower. All behaviour looks normal. CPU time is recorded properly for those that finished, and at least the two on the deskside started together and are running: no -148s this time.
----------------------------------------[edit]Just checked the log and found that both pairs actually started two seconds apart, so I guess this doesn't prove that the -148 problem is fixed after all.[/edit] One of the 7.05s finished yesterday with a time-out after many hours, but that too seems to have behaved as expected (apart from sizing issues, maybe). [Edit 1 times, last edit by Former Member at Dec 2, 2014 10:32:06 AM] |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1404 Status: Offline Project Badges:
|
No Beta 7.07 finished yet.
----------------------------------------On my Linux VM from 7 tasks 5 dropped their displayed progress from a higher value to 10%. BETA_OET1_0000307_xEBGP-OM_rig_0702_1 7.07 Beta Test 02:21:58 (02:21:36) 10,000 00:12:46 06 Dec 08:59:43 Running 99,7 [0] 02:21:36 53.00 MB 49.82 MB VM3 BETA_OET1_0000307_xEBGP-OM_rig_0952_0 7.07 Beta Test 02:21:28 (02:21:09) 10,000 00:12:43 06 Dec 08:59:43 Running 99,8 [0] 02:21:09 55.59 MB 52.41 MB VM3 BETA_OET1_0000307_xEBGP-OM_rig_1470_1 7.07 Beta Test 02:03:19 (02:02:45) 10,000 00:11:05 06 Dec 09:01:48 Running 99,5 [0] 02:02:45 55.38 MB 52.15 MB VM3 BETA_OET1_0000307_xEBGP-OM_rig_1670_0 7.07 Beta Test 02:06:26 (02:06:07) 10,000 00:11:22 06 Dec 09:01:49 Running 99,7 [0] 02:06:07 58.74 MB 55.55 MB VM3 BETA_OET1_0000307_xEBGP-OM_rig_1456_1 7.07 Beta Test 02:02:13 (02:02:06) 10,000 00:11:00 06 Dec 09:01:49 Running 99,9 [0] 02:02:06 55.34 MB 52.14 MB VM3 Edit: 6 now. [Edit 1 times, last edit by Crystal Pellet at Dec 2, 2014 10:29:34 AM] |
||
|
|
deltavee
Ace Cruncher Texas Hill Country Joined: Nov 17, 2004 Post Count: 4894 Status: Offline Project Badges:
|
This CPU time problem is still present: BETA_ OET1_ 0000040_ xEBGP_ 0317_ 1-- Pending Validation 12/1/14 07:39:21 12/2/14 01:16:45 0.00 / 1.96 49.6 / 0.0 BETA_ OET1_ 0000040_ xEBGP_ 0322_ 1-- Pending Validation 12/1/14 07:39:21 12/2/14 01:06:09 0.00 / 1.88 47.6 / 0.0 BETA_ OET1_ 0000040_ xEBGP_ 0361_ 0-- Valid 12/1/14 07:39:21 12/1/14 23:53:58 0.00 / 1.82 46.2 / 51.4 BETA_ OET1_ 0000038_ xEBGP_ 0651_ 0-- Valid 12/1/14 07:13:33 12/1/14 23:24:32 0.00 / 2.05 51.8 / 45.5 Those look like yesterday's 7.05 WUs, not the 7.07s in the latest batch. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
7.07 Beta Test BETA_OET1_0000306_xEBGP-F_rig_0771_1 02:59:11 (02:52:58) [0] 02:52:58 96,53 40,000 00:43:00 06-12-2014 08:21 Running win7_32-tmc 84,3 °C 0 39.22 MB 51.39 MB
----------------------------------------7.07 Beta Test BETA_OET1_0000307_xEBGP-OM_rig_0613_0 02:35:35 (02:30:34) [0] 02:30:34 96,77 94,822 - 06-12-2014 08:43 Running win7_32-tmc 84,3 °C 0 32.55 MB 49.57 MB I missed them starting. Looks like these are stuck once again. No checkpoints after 2.5 and 3 hours. oet = one endless task? ;-) edit: Both tasks have cpu and wall clocks counting up. The first one doesn't budge from 40.000% progress. The latter one is progressing very slowly, 0.001 'percents per second', while the remaining time counter shows nil since I've noticed the Betas. [Edit 1 times, last edit by Former Member at Dec 2, 2014 11:05:19 AM] |
||
|
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1404 Status: Offline Project Badges:
|
On my Linux VM from 7 tasks 5 dropped their displayed progress from a higher value to 10%. On 2 Windows machines the downdrop to 10% progress also pops up after 2-3 hours of runtime. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
7.07 Beta Test BETA_OET1_0000306_xEBGP-F_rig_0771_1 02:59:11 (02:52:58) [0] 02:52:58 96,53 40,000 00:43:00 06-12-2014 08:21 Running win7_32-tmc 84,3 °C 0 39.22 MB 51.39 MB 7.07 Beta Test BETA_OET1_0000307_xEBGP-OM_rig_0613_0 02:35:35 (02:30:34) [0] 02:30:34 96,77 94,822 - 06-12-2014 08:43 Running win7_32-tmc 84,3 °C 0 32.55 MB 49.57 MB I missed them starting. Looks like these are stuck once again. No checkpoints after 2.5 and 3 hours. oet = one endless task? ;-) edit: Both tasks have cpu and wall clocks counting up. The first one doesn't budge from 40.000% progress. The latter one is progressing very slowly, 0.001 'percents per second', while the remaining time counter shows nil since I've noticed the Betas. As uplinger stated, there are single task work units that are hard to compute [and very hard to estimate how long they run]. They're looking to still introduce intermediate checkpointing, kind of the type that follows "Write to Disk at most" setting. With this approach of forced checkpointing, multiple started same time will likely also checkpoint simultaneous as is the case with UGM at this time. Don't know what the impact on the user would be with such a solution. |
||
|
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges:
|
oet = one endless task? ;-) ![]() ![]() [Edit 1 times, last edit by Mumak at Dec 2, 2014 12:00:54 PM] |
||
|
|
rbotterb
Senior Cruncher United States Joined: Jul 21, 2005 Post Count: 401 Status: Offline Project Badges:
|
The two 7.07 WUs I had last night - one did get done OK and seemed to clock its CPU more normally. The other one is still running - its clock has now been progressing (50% done now), but for periods of time the %complete was at zero last night before it started moving up. This one in progress still looks a bit odd:
BETA_ OET1_ 0000306_ xEBGP-F_ rig_ 1527_ 1-- Pavilion-dv7 In Progress 12/2/14 07:14:12 12/6/14 07:14:12 0.00 / 0.00 0.0 / 0.0 BETA_ OET1_ 0000051_ xZAGP_ 0606_ 1-- Pavilion-dv7 Pending Validation 12/2/14 06:39:02 12/2/14 12:26:44 3.39 / 3.41 61.7 / 0.0 Running on Win 7, HP dv7, 4 core, 1.6 Ghz, 6 GB Memory. |
||
|
|
|