Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 30
Posts: 30   Pages: 3   [ Previous Page | 1 2 3 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2703 times and has 29 replies Next Thread
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

I don't suspect the dialup situation is an issue, as the machine along with 3 others have been in this mode for several years...
Of course! Since this setup is unusual for me I had forgotten that it is your normal mode of operation and that other machines are running fine.

Doing a clean re-install is what I tried to avoid you (together with trying to understand what has gone wrong) but since you are ready for it that is certainly the simplest way.
However before going for it I would appreciate if you could go to your BOINC data directory (C:\Documents and Settings\All Users\Application Data\BOINC) and copy paste the <time_stats> and <net_stats> sections of file "client_state.xml" as some kind of post-mortem.

Next, uninstall BOINC again and completely erase the whole C:\Documents and Settings\All Users\Application Data\BOINC directory before re-installing.

After re-installing you may need to attach your client to WCG (I don't remember if the WCG installation module does it automatically). If so, from BOINC Manager use Tools->Attach to project... and when requested to identify you use your normal username and password.

Good luck. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
----------------------------------------
[Edit 1 times, last edit by JmBoullier at Jan 2, 2010 11:36:40 PM]
[Jan 2, 2010 11:35:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Two Left Feet
Cruncher
Joined: Mar 29, 2009
Post Count: 8
Status: Offline
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

I don't suspect the dialup situation is an issue, as the machine along with 3 others have been in this mode for several years...
Of course! Since this setup is unusual for me I had forgotten that it is your normal mode of operation and that other machines are running fine.

Doing a clean re-install is what I tried to avoid you (together with trying to understand what has gone wrong) but since you are ready for it that is certainly the simplest way.
However before going for it I would appreciate if you could go to your BOINC data directory (C:\Documents and Settings\All Users\Application Data\BOINC) and copy paste the <time_stats> and <net_stats> sections of file "client_state.xml" as some kind of post-mortem.

I'm willing to dissect the patient!

<time_stats>
<on_frac>1.000000</on_frac>
<connected_frac>1.000000</connected_frac>
<active_frac>0.000076</active_frac>
<cpu_efficiency>0.977400</cpu_efficiency>
<last_update>1293258597.968750</last_update>
</time_stats>
<net_stats>
<bwup>3792.407780</bwup>
<avg_up>564740.115592</avg_up>
<avg_time_up>1262484459.843750</avg_time_up>
<bwdown>9814.780478</bwdown>
<avg_down>770666.854625</avg_down>
<avg_time_down>1262469224.953125</avg_time_down>
</net_stats>
Next, uninstall BOINC again and completely erase the whole C:\Documents and Settings\All Users\Application Data\BOINC directory before re-installing.

After re-installing you may need to attach your client to WCG (I don't remember if the WCG installation module does it automatically). If so, from BOINC Manager use Tools->Attach to project... and when requested to identify you use your normal user name and password.
Noted, but I'll wait... until you've "picked my brain clean" (does that translate?) smile
Thanks and Cheerful Regards! Bruce

Good luck. Jean.

[Jan 3, 2010 5:14:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

Bruce,
Thanks again for your excellent feedback, it's a pleasure to work with you.
<active_frac>0.000076</active_frac>
This is definitely the reason why BOINC is not fetching work It's equivalent to saying that this client is active 6.5 seconds per day!
Now, why this parameter has become so ridiculous, we will probably never know. BOINC has a few bugs in its time tracking area, and when one of these parameters is going off like this one it's usually hopeless.

If you like to experiment before re-installing you might edit the file (while BOINC is stopped) and set active_frac to 1.000000. In case BOINC is checking that users are not playing too much with its parameters you might do it too in file client_state_prev.xml, although I am not sure it's necessary..
When you restart BOINC it should start fetching work normally again.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Jan 3, 2010 10:56:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

Bruce,
Thanks again for your excellent feedback, it's a pleasure to work with you.
<active_frac>0.000076</active_frac>
This is definitely the reason why BOINC is not fetching work It's equivalent to saying that this client is active 6.5 seconds per day!
Now, why this parameter has become so ridiculous, we will probably never know. BOINC has a few bugs in its time tracking area, and when one of these parameters is going off like this one it's usually hopeless.

You've overlooked one very important parameter:
<last_update>1293258597.968750</last_update>

Since 1290* is November 2010, would guess 1293* most likely is in January 2011, or maybe December 2010.

In any case, it's a completely wrong date, and the very low active_frac is due to this. Also, since is running v6.2.xx, the time-stats won't be updated until after the "last updated"-date (meaning around 2011)...

To fix, the easiest is just to shut-down BOINC, making sure BOINC isn't running (use task manager). Afterwards, open-up client_state.xml in something like notepad, and just remove the whole <time_stats> </time_stats> -part of the file, and save it. On next startup, all parameters will be back to 1 again, and working from there.


It's also possible to edit the active_frac, but if so you'll also need to edit <last_updated> to something like 12626*. But, this gives more room for errors, so my recommendation is to just delete the whole time_stats-block.

As for client_state_prev.xml, it's not neccessary to edit this file, since it's only used in cases can't read the whole client_state.xml-file.
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Jan 4, 2010 3:37:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

Welcome back Ingleside!
I was sure that you would have interesting things to say about this case but the holiday season is for everybody... smile
I thought of checking the last update date, unfortunately I did not feel motivated enough. For debugging purpose It would be nice if this date were also stored in human readable format in client_state.xml...

So now we know why the active_frac was so low. Maybe Bruce will remember a possible reason why the last_update date was wrong, otherwise that will remain a mystery.

Last, thanks for the tip for resetting the time stats safely. I'll try to not forget it.
Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Jan 4, 2010 7:40:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

Welcome back Ingleside!
I was sure that you would have interesting things to say about this case but the holiday season is for everybody... smile

Thanks, and a happy new year to you and everyone else. cool

I thought of checking the last update date, unfortunately I did not feel motivated enough. For debugging purpose It would be nice if this date were also stored in human readable format in client_state.xml...

It would be nice, but not very important after the various changes included in later BOINC-clients. It was probably v6.4.x that started detecting if the date has been set in the future, and starts updating time_stats again. And v6.10.xx doesn't decay the time_stats if client has been off for more than 10 days, since this most likely is due to a holiday and not due to user running client very infrequently.

The deadlines will most likely get screwed-up, but these is human readable in the manager.

So now we know why the active_frac was so low. Maybe Bruce will remember a possible reason why the last_update date was wrong, otherwise that will remain a mystery.

Can only guess, but these errors most commonly is due to user-error, by user manually changing the date and afterwards correcting it again. Seeing it's just active_frac that's very low, this has been done while BOINC-client was running.

Another possibility is a bad battery so losing the clock on power-off, but this normally resets the date somewhere in the past and not in the future. Also, this would give very low on_frac.

It's possible, but not very likely, that an automatic time-synch has been severely wrong.
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Jan 4, 2010 1:02:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Two Left Feet
Cruncher
Joined: Mar 29, 2009
Post Count: 8
Status: Offline
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

So now we know why the active_frac was so low. Maybe Bruce will remember a possible reason why the last_update date was wrong, otherwise that will remain a mystery.

Can only guess, but these errors most commonly is due to user-error, by user manually changing the date and afterwards correcting it again. Seeing it's just active_frac that's very low, this has been done while BOINC-client was running.

User error! I confess to the clock changes.

I'll be more careful in the future.

I removed "time_stats" section from client_state.xml, restarted and have my typical work buffer restored.

Thanks for everyone's help! Bruce
[Jan 4, 2010 11:28:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

This latest code insert, the underlying details I don't know, infers that work will be fetched when cores are idle... but then, cores should not have been idle to begin with on mitrichr's I7, no matter what cache setting
David  12 Jan 2010
350 - client: small tweak to work fetch:
351 if project has crazy DCF, don't automatically request 1 sec;
352 only request work if there's a shortfall.

This is coming in a next release for Windows I suppose, i.e. after the latest 6.10.25 alpha and curious if it does fix the I7 issue.

Edit: And this one, well, I take code knowledgeable assertions how it should work versus how it does work anyway with reservation... I believe it when I see it. The last line is telling, and that's from the chief developer ;>)
David  5 Jan 2010
92 - client: scheduling problem:
93 - a project overestimates job FLOP counts
94 - the client starts jobs in EDF mode
95 - as job progresses and fraction done increases,
96 its completion time estimate decreases until
97 it's no longer a deadline miss.
98 - job gets preempted by other job from that project;
99 you end up with lots of partly completed jobs.
100 Solution (I hope): if an app version has running jobs,
101 compute a "temp DCF" for the app version,
102 which is the min of dynamic/static estimates for its jobs.
103 Apply this scaling factor to completion time estimates
104 for unstarted jobs in RR simulation
105 - client: the estimation of remaining time of running jobs was wrong
106 (how did this bug survive so long?)

Maybe we get now more accurate and quicker adjustment of the TTC's :D
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Jan 14, 2010 11:54:01 AM]
[Jan 14, 2010 11:44:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

compute a "temp DCF" for the app version,
One step toward DCFs per application? Let's keep hoping...
(how did this bug survive so long?)
Go figure... laughing
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Jan 14, 2010 6:20:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: "got 0 new tasks"

Well, the temp_DCF was removed again with a comment that it did not work.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jan 24, 2010 8:08:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 30   Pages: 3   [ Previous Page | 1 2 3 ]
[ Jump to Last Post ]
Post new Thread