Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Locked
Total posts in this thread: 296
Posts: 296   Pages: 30   [ Previous Page | 15 16 17 18 19 20 21 22 23 24 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 527531 times and has 295 replies Next Thread
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

I am watching an ace80 for the first time...
1. It is progressing on a 1h20mn basis, 12.5 % for 10 mn elapsed but
2. CPU time is stuck at 17 seconds from the beginning

I have checked the CPU time via Properties, BoincTasks (CPU time and Checkpoints fields) and the anomaly is consistent in all places.

Later...
First checkpoint has been taken now.
Elapsed: 14:27
CPU time at last checkpoint: 9:49
CPU time stuck again at 10:06
CPU time since last checkpoint still showing 17 seconds.

Puzzled...

Edit: OK. After waiting for the 3rd checkpoint it seems that CPU time is accounted correctly at each checkpoint only.
No significant loss of CPU time, just a lousy reporting. sad
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
----------------------------------------
[Edit 1 times, last edit by JmBoullier at Jul 29, 2011 8:28:42 PM]
[Jul 29, 2011 8:12:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

coolstream, depends on your OS. But they are located in the slots directory in your BOINC Data directory. BOINC data directory is shown at the very beginning of your Messages in the BOINC agent.

On Windows it's probably in C:\ProgramData\BOINC

you'll directories with numbers...go into one of those and you'll see stderr.txt

-Uplinger
[Jul 29, 2011 8:16:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
kateiacy
Veteran Cruncher
USA
Joined: Jan 23, 2010
Post Count: 1027
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

I have one running at 60% CPU usage on a Linux box. In "Properties" it just showed:

CPU time at last checkpoint 16:28
CPU time 16:51
Elapsed time 11:29
Percent complete 12.5%

Wierd elapsed time...

Edit: just checked the other laptop that's running at 60% CPU usage, and it's also showing shorter elapsed time than CPU time for its beta.

The two desktops running at 100% CPU usage have elapsed time larger than CPU time for the betas. Either the efficiency is lower than with other WUs, or CPU time isn't updating very often.

I now have 5 Beta-Beta's spread across 4 Linux machines -- 2 moderately fast machines and 2 much slower ones. On all 4, the percent complete is consistently looking as if the total time will be 80 minutes. Edit: that is not holding. The slower machines now are incrementing percent completed more slowly.
----------------------------------------

----------------------------------------
[Edit 2 times, last edit by kateiacy at Jul 29, 2011 8:42:16 PM]
[Jul 29, 2011 8:20:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Re: New Beta Starting 2011/07/22

The CEP2 vanishing time phenomena revisited? I'll wait till the completion to see how wallclock-elapsed-cpu times relate before doing a first opine.

I'm on Linux Ubuntu 11.04 kernel 2.6.38.10 bld 47. It's a good build... getting almost consistently 99.8-99.9% efficiency for DDDT2 on all 4 cores, sometimes it even shows a round 100% i.e. 99.95% or better... all viewed viaBOINCTasks.

--//--
[Jul 29, 2011 8:32:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mfbabb2
Senior Cruncher
USA
Joined: Feb 18, 2011
Post Count: 361
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

Only got 1 Beta (out of 9 cores). crying
BETA_BETA_ace80_0000000_2478
1.250% after 6 minutes -- est is 8:40:00 remaining.

3.750% -- 00:20:00 -- 08:28:32 (no checkpoint yet) working mem 143.13MB

Got another (same machine) -- est 08:43:15.

Checkpoints at 45:27 and 36:29 minutes.
12.500% and 13.750%.
01:11:27 and 00:52:09 elapsed.
07:31:19 and 06:44:05 remaining.

4 0f 5 machines have not received any of these Betas.
----------------------------------------
Murphy

----------------------------------------
[Edit 1 times, last edit by mfbabb2 at Jul 29, 2011 8:43:50 PM]
[Jul 29, 2011 8:43:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

There are still 8000 results waiting to be sent.

Thanks,
-Uplinger
[Jul 29, 2011 8:45:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mfbabb2
Senior Cruncher
USA
Joined: Feb 18, 2011
Post Count: 361
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

Send 'em my way, pls.
----------------------------------------
Murphy

[Jul 29, 2011 8:46:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Re: New Beta Starting 2011/07/22

Ok, just a heads up...on initial inspection, these look good.

The science does a calculation against different size ligands. Depending on the size of the ligand determines the runtime per job. We are still working to estimate this properly but as for these workunits they were not estimated. Some of the smaller ligands in the quick runs checkpointed every 40 seconds. Which would come out to be about 6-7 minutes with these workunits. There are however ligands that ran for 10 minutes on the quicker workunits, which would cause them to check point every 1.5 hours. As you can see there is a large difference in speed based on the ligand.

In your stderr, you should see pdbqt = # # asdfasdfadf

These two numbers represent the size of the ligand. the first number varies so far between 10 and 38. The second number between 0 and 15. the smaller the combination between the two the shorter the job.

Thanks,
-Uplinger

This is what Isee in w windows job, 8 of these jobs packaged.

INFO: No state to restore. Start from the beginning.
[21:01:01] Number of tasks = 8
[21:01:01] Starting job 0,CPU time is 0.000000.
[21:01:01] ZINC04935517.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0
[21:52:59] Finished Job #0 cpu time used 3003.718750
[21:52:59] Starting job 1,CPU time is 3003.718750.
[21:52:59] ZINC04935517.pdbqt size = 31 5 ../../projects/www.worldcommunitygrid.org/beta13.target_ace.pdbqt size = 4660 0

97% efficiency, 25% after 2 of these tasks inside the work unit.

--//--
[Jul 29, 2011 8:47:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
pirogue
Veteran Cruncher
USA
Joined: Dec 8, 2008
Post Count: 685
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

There are still 8000 results waiting to be sent.

Thanks,
-Uplinger

Send 'em my way, pls.

No kidding. I'm getting one every 15 minutes or so. At that rate, it's going to take around a week to get all my machines topped off.
----------------------------------------

[Jul 29, 2011 8:51:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Dataman
Ace Cruncher
Joined: Nov 16, 2004
Post Count: 4865
Status: Offline
Project Badges:
Re: New Beta Starting 2011/07/22

There are still 8000 results waiting to be sent.

Thanks,
-Uplinger

Thanks uplinger. I have not seen any in an hour.

First one completed at 1:34:51 clock & 1:34:43 cpu. (If you can beleive BOINCTasks wink )Wu's where GPU's are idle running at 99.60% or above and ones with twin GPU's running at 98.10% or above. Looks very good from my point of view!
cowboy
----------------------------------------


[Jul 29, 2011 8:53:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 296   Pages: 30   [ Previous Page | 15 16 17 18 19 20 21 22 23 24 | Next Page ]
[ Jump to Last Post ]
Post new Thread