Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 27
Posts: 27   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3504 times and has 26 replies Next Thread
trongnguyen_82
Cruncher
Joined: Aug 10, 2006
Post Count: 10
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

From lawrencehardin:

The progress bar moves backwards if an 'attempt' fails. The program starts over again with a slightly different 'attempt'.


From Van Fanel:

As a possible way around it, I would advice to only update the progress bar when the step has been successful. In other words, instead of letting the progress bar advance in small bits, only update it after a successful iteration. The only set back is that the progress bar would seem static for most of the time...


If it's the nature of the new FAAH to try several attempts until it finds a good solution to get out of the 'lengthy loop', I would prefer to see the progress bar moving back and forth instead of sticking at a number for some 'long' times. At least, it differentiate this behaviour from the bug when 'a workunit stuck at xx,xx% for several hours', in which we might have to suspend/reset BOINC to fix it.
----------------------------------------

[Apr 19, 2008 4:59:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

Well I'm puzzled as I haven't seen this progress issue as yet on the few I've crunched - wondering if its dependent on boinc version?
[Apr 19, 2008 5:19:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
breathesgelatin
Advanced Cruncher
Joined: Aug 5, 2006
Post Count: 117
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

Hello trongnguyen_82,
Yes, the progress indicator on these FAAH units proves in a very annoying way that we are running a non-deterministic algorithm. It is an irritating mismatch to the standard BOINC framework. I foresee lots of forum queries. I wonder why the new FAAH program seems so much more blatant about it? Is this primarily caused by the application change or by a new set of molecules?

Lawrence


You're probably going to have lots of people also aborting the new WUs in confusion. I really think that you need to somehow solve the problem if possible. I agree that only making it move might be a successful move - especially for the regular cruncher that might not check the forums that often. For people who crunch test WUs, seeing the progress bar move back and forth is OK once we know why, but I fear it would be too disconcerting for general users.
----------------------------------------

[Apr 19, 2008 7:05:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

We will take a good look at this issue before we release these versions into production. If you see my post here: https://secure.worldcommunitygrid.org/forums/wcg/viewthread?thread=19803 you will see that we have begun the process of preparing to release the new version. This process will take several days and hopefully we can get this cleared up by then.
[Apr 20, 2008 1:25:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
breathesgelatin
Advanced Cruncher
Joined: Aug 5, 2006
Post Count: 117
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

We will take a good look at this issue before we release these versions into production. If you see my post here: https://secure.worldcommunitygrid.org/forums/wcg/viewthread?thread=19803 you will see that we have begun the process of preparing to release the new version. This process will take several days and hopefully we can get this cleared up by then.


I think that's a good idea to review everything.

I also want to note that you lose time crunched when the progress bar goes back... sometimes, not always. For example when it jumps back to 25% or 50% or wherever (depending on where it is in the process), you sometimes also lose the time crunched--sometimes as much as 45 minutes. While this doesn't really bother me in terms of helping out WCG, it will probably bother general users hoping to get full credits etc.
----------------------------------------

[Apr 20, 2008 2:25:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

My beta also reacts in the same way, bouncing back and fourth between 23.8% and 24.9%. The CPU time is at 15 minutes, so I guess it will continue to loop for a while.

I am sure I have seen this same phenomenon several weeks ago, although I am unable to find anything related in the forum (but that only means that I have not found the right search key smile ).

As far as I remember it was around the same percentages, and after a few minutes it passed the 25 % mark and was completing as usual.

Apparently that has not triggered a big rush of questions or problem reports... smile

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 20, 2008 2:36:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

I also want to note that you lose time crunched when the progress bar goes back... sometimes, not always. For example when it jumps back to 25% or 50% or wherever (depending on where it is in the process), you sometimes also lose the time crunched--sometimes as much as 45 minutes. While this doesn't really bother me in terms of helping out WCG, it will probably bother general users hoping to get full credits etc.


I need more information about this. The only reason your cpu time would go backwords is if the science app was stopped (i.e. if you shut down your client, rebooted your computer etc). In that case your cpu time would revert to the time at the last checkpoint.

Can you please create a cc_config.xml file with the following options in your BOINC installation directory?

<cc_config>
<log_flags>
<checkpoint_debug>1</checkpoint_debug>
<task_debug>1</task_debug>
</log_flags>
</cc_config>

Restart your client to start it logging. Once it has re-occurred, then please post all of the messages.

Anyone experiencing this issue, please do the same.

Also please make sure you are using the 5.10.45 client (you can get it here: https://secure.worldcommunitygrid.org/ms/viewDownloadAgain.do )

thanks,
Kevin
[Apr 20, 2008 2:00:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

Think the percent alternation from 24.9 to 23.x and on is just the known multi docking attempts on finding best energy (graphics green en red dotted lines) as commented already by Lawrence, and the reverting indeed a client restart.

The Checkpoint FAQ provides information how combined with the checkpoint log flag, post knreed, to minimize that loss, if interested to micro-manage.

http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=11332

sample of the old UD agent graph which in it's BOINC graphics incarnation is much smaller:



This has always been with FA@H and particular on the short units will appear amplified.... I looked at the few beta's and saw this too. But, the little green line was ever extending until near the end where it goes into the 'best energy or move on to next section' routine.

[edit: Best Energy Graphic illustration refreshed showing new and old agent with explanation of red and green line]
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Apr 23, 2008 5:17:08 PM]
[Apr 20, 2008 2:14:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
breathesgelatin
Advanced Cruncher
Joined: Aug 5, 2006
Post Count: 117
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

I also want to note that you lose time crunched when the progress bar goes back... sometimes, not always. For example when it jumps back to 25% or 50% or wherever (depending on where it is in the process), you sometimes also lose the time crunched--sometimes as much as 45 minutes. While this doesn't really bother me in terms of helping out WCG, it will probably bother general users hoping to get full credits etc.


I need more information about this. The only reason your cpu time would go backwords is if the science app was stopped (i.e. if you shut down your client, rebooted your computer etc). In that case your cpu time would revert to the time at the last checkpoint.

Can you please create a cc_config.xml file with the following options in your BOINC installation directory?

<cc_config>
<log_flags>
<checkpoint_debug>1</checkpoint_debug>
<task_debug>1</task_debug>
</log_flags>
</cc_config>

Restart your client to start it logging. Once it has re-occurred, then please post all of the messages.

Anyone experiencing this issue, please do the same.

Also please make sure you are using the 5.10.45 client (you can get it here: https://secure.worldcommunitygrid.org/ms/viewDownloadAgain.do )

thanks,
Kevin


I am using the 5.10.45 client on all my boxes.

I will get you the info you need, but I don't know how to change the XML file. You're going to have to explain that more to me.
----------------------------------------

[Apr 20, 2008 4:26:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Beta_faah4000 Workunits Problem

Just open it with a flat text editor like notepad. The file may already exist in either the BOINC Program or Data directory. When saving, make sure to flip the default txt extension, so it retains it's proper cc_config.xml name.

You don't need to restart the client. Simply visit the Advanced menu and take the 'read config file' option to add the checkpoint message recording.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Apr 20, 2008 4:33:10 PM]
[Apr 20, 2008 4:31:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 27   Pages: 3   [ Previous Page | 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread