Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 149
Posts: 149   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 19637 times and has 148 replies Next Thread
pacerintl
Cruncher
USA
Joined: Nov 7, 2006
Post Count: 47
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

I've had quite a few of these jobs run long too...one ran over ninety hours and just before finishing there is a computational error...and I get 0 credit for wasting over 90 hours. This has happened a few times, sometimes I get partial credit...sometimes nothing. Should I just kill jobs that are running over 20hours ?
[Aug 7, 2008 11:22:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Hello pacerintl,
knreed runs a script every few days to award credit to long FAAH units that have timed out. Hasn't he caught any of your over-long units?

Lawrence
[Aug 7, 2008 11:26:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mikaado
Cruncher
Joined: Dec 3, 2007
Post Count: 14
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Should I just kill jobs that are running over 20hours ?


I had two of these monster WU's and the another one went all right. But the other was 98 % in 42h. Then I shutted down my computer and now it started to calculate it from the start. Well, let's see how far it goes this time biggrin
[Aug 8, 2008 7:05:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

hmmm, that's no good,

The recommended shut-down if working on monster tasks in particular is, to open BOINCmgr Advanced View and go into the Advanced menu. Choose the Shut Down Connected Client.... The reason for this option rather than exiting BOINC directly is, that it will stop the core client (BOINC.exe) too even when running as service, provided you have permission.

Why manually shutting down?: Because some OSses could be too quick and not allow BOINC to write the checkpoint save file completely from where a task could resume. Particular Vista with pre 5.10.35 BOINC versions suffer this problem, to include the risk of job corruption and complete loss.

ciao

Note to self: A candidate for a short and separate FAQ Index item and cross reference to the Checkpoint Save article.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 8, 2008 8:38:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
mikaado
Cruncher
Joined: Dec 3, 2007
Post Count: 14
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

This is the first time I have any problems with BOINC client. I'm running 5.10.45 client with vista x64. But I think that the shutdown might be the issue. Just thought tech's might like to know.
[Aug 8, 2008 9:07:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

This is the first time I have any problems with BOINC client. I'm running 5.10.45 client with vista x64. But I think that the shutdown might be the issue. Just thought tech's might like to know.

That's what I thought, and know.... follow the link in the Checkpoint FAQ which has at bottom a link to the Vista FAQ. Data location is important and should NOT be in the C:\Program Files structure. BOINC 6 will put it in the C:\ProgramData structure

Very large files require substantial time to be written and closed, so experience showed even a forced delay of 60 seconds was not enough for some.... hence the manual shutdown recommendation.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Aug 8, 2008 10:07:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

So what should I do with queued work that will no way make the deadline?
[Aug 9, 2008 3:57:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Hello Questar,
knreed extended the deadline some time ago. If you already had these units, then the server knows about the new deadline, even if your client does not. If these are recent work units that you cannot complete, then go ahead and abort them.

Lawrence
[Aug 9, 2008 11:22:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: this is a really long work unit

Thanks for the reply.
[Aug 10, 2008 12:06:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Second really long work unit received

Credit has now been awarded to all results for FightAIDS@Home that were running the 'monster' workunits.

If the cpu_time reported was more than 10,000 seconds, then credit and runtime were awarded based upon what the computer claimed and reported.

If the cpu time reported was less than 10,000 seconds, then credit and runtime were awarded based upon the averages of 550 BOINC Credits and a runtime of 230699 seconds (~64.1 hours).

This second case was just run for the first time. We awarded the following:

BOINC Credit: 239250
WCG Points: 1674750
Results: 435
Run Time: 100354065 seconds (3.2 years)
Hosts: 348
Users: 284


Can we assume you will be running this again?

I've got work units aborting all over the place and reporting 0 time.

Thanks.
[Aug 10, 2008 12:08:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 149   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread