Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 137
Posts: 137   Pages: 14   [ Previous Page | 4 5 6 7 8 9 10 11 12 13 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 21963 times and has 136 replies Next Thread
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?

Sek, As mentioned in my post above, I was manually correcting the issue over the weekend. But I have corrected the issue of the feeder being clogged for FAHB. Members should see a more consistent supply of work. At the moment, we are going to see if we can have 100k results in progress at a single time. So we should start to see over the next 72 hours the number of completed results jump up.

Thanks,
-Uplinger
[Oct 12, 2015 8:55:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Run dry already?

Thanks to gb009761 for the support and to uplinger for your labors!
[Oct 13, 2015 12:27:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Speedy51
Veteran Cruncher
New Zealand
Joined: Nov 4, 2005
Post Count: 1290
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?


Oh, and noticed my remote when TeamViewing in, has 17 FAHB, 4 of which are with the wcgfahbnnnn sub, 23.5% shock

The tasks you are referring to are they tasks that have not completed the 100,000 steps on the first go? My task names not have a b in them anywhere. I have the following task names
FAH2_ avx101122_ 000075_ 0019_ 007_ 0-- 
I have another 13 tasks like this on my Windows 10 machine.
----------------------------------------

[Oct 13, 2015 5:25:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
spout23
Advanced Cruncher
Joined: Apr 24, 2012
Post Count: 89
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?

Dry again !!

10/15/2015 7:31:53 AM | World Community Grid | Scheduler request completed: got 0 new tasks
10/15/2015 7:31:53 AM | World Community Grid | No tasks sent
10/15/2015 7:31:53 AM | World Community Grid | No tasks are available for FightAIDS@Home - Phase 2
10/15/2015 7:31:53 AM | World Community Grid | No tasks are available for the applications you have selected.
10/15/2015 7:31:56 AM | World Community Grid | work fetch suspended by user
10/15/2015 7:32:05 AM | World Community Grid | work fetch resumed by user

Spout23 sad
[Oct 15, 2015 12:38:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Maxxina
Advanced Cruncher
Joined: Jan 5, 2008
Post Count: 124
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?

jezz, there were, are and will be dry times. I dont know why everybody need to report it all the time about it . ( facepalm )
[Oct 15, 2015 8:54:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: Run dry already?

And since the validator for FAHB has been off since at least 19:29 there are no next generation tasks being generated either, that is if that validation for credit is a requirement for the next gen.
[Oct 15, 2015 9:31:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?

Had a hiccup in the validator, it is now catching up.

Spout23, as for it being out of work, we are limiting the total number of results in progress to around 120k. This means that if only 40k get returned in a day, only 40k more will go out. We are hoping to increase this in the future, but since it is still a relatively new project, there are minor issues we are tweaking on the backend.

Thanks,
-Uplinger
[Oct 15, 2015 10:34:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: Run dry already?

Kay, did you really have to slow down the validator on Thursday, to get many to be recorded on Friday, so you would get closer to your 100K FAHB results in a day [Just kidding, but this morning with 223 years was the highest session total since long long time, 49 years / 30K results for FAHB * ;]

Now I'm going to type this slowly, as to maybe get an answer [3rd try].

I'll do this in double spacing the way Americans and English ** seem to prefer the printed word these days, so maybe they can dump all the 'in between the lines nonsense there, that ain't there.].

Hypothetical: If a result is near it's mature deadline, 96 hours after submitting, and at say 80%, 90%.

Will the next trickle, which puts it -over- the deadline, be allowed to finish in overtime?

This before the next gen result is created.

* That's 14.41 hours average per result. If 60K is being returned today 30K*2, it would imply with the 120K issued at any time [stemming from 1st gen], the 100K step cycle equivalent is back in 2 days. Not shabby.

** Don't know about the Welsh, the Scottish, or the Northern Irish ;]
[Oct 16, 2015 1:25:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Run dry already?

It was not on purpose the slowdown of the validator on Thursday. We had a query inside the validator that was slowing it down, once I modified that query, it caught up quickly. As for getting 100k results returned in a day, that is probably unlikely, since we only have about 120k in progress at any one time. I'm hoping we get about 60k-80k returned daily in a few days.

As for your other question, if a result is at 80% and gets a hard stop, that result is marked for validation at the moment the message was triggered on the server. This means that on your next scheduler request, the result could already be validated and assimilated (ready to send back to the researchers). Once validated, credit is granted.

Something that we have not mentioned in the past, as it could change at any time. The researchers have said, that if a generation gets to 80k, then that is plenty to go onto the next generation. This value may change in the future. Now, there is no reason for members to be stopping work units at the 80k step, as more steps helps the overall science, 80k is considered bare minimum for a generation from the researchers.

Thanks,
-Uplinger
[Oct 16, 2015 2:13:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: Run dry already?

To expand on my point / query example (others may have expressed the concern differently):

1) Result is at 80 or 90 percent and at 95 hours trickles up
2) Continues on with the 9th or 10th 10K block
3) Deadline is reached at the 96th hour

Will it be soft- or hard-stop instructed?

Is there some call-home maybe that tells client to check in, say every 30 minutes? If hard stop instructed at 96:00 hours, but there's no reconnect until the 97th or 98th hours, some part would go to waste [the feared 9.99%]. Some deadline extension rule would optimize the situation further to make that time of use, even lesser duplication. Maybe there could be code inserted to autonomously stop at 80-90% ** and the next 10K block going over the 96th hour i.e. not wait on the comms to happen.

Anyway, with 4 days standard, running a smaller than 2 day buffer is recommended as else the client goes in EDF state anyhow. So far allow 4-5 tasks in buffer, 2 concurrent app_config, and the 3rd-4th task seem always to be running against the 72 hour clock. All have made it so far, but it's only because I've aborted several before hand as they would not possible have gotten at 70%/72 hours first threshold.

** Similarly, CEP2 goes to 18:00 hours and is hard stopped. Can't there be logic to just skip if by extrapolation of the Nth job time in progress the 18:00 does not lead to completion. Yes we get credit, but the last job running whilst hitting 18:00 is incomplete. Is that piece actually included in the uploaded data?
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Oct 16, 2015 3:06:08 PM]
[Oct 16, 2015 3:01:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 137   Pages: 14   [ Previous Page | 4 5 6 7 8 9 10 11 12 13 | Next Page ]
[ Jump to Last Post ]
Post new Thread