Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 45
Posts: 45   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 8970 times and has 44 replies Next Thread
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: And so it begins...

Not all CPU's are equally capable in the FPU/Integer unit department, but offsetting for cycle efficiency, 3 times speed difference is too steep for same platform comparison for not far apart CPU's. Sure they're not switching up or down [stepping] due localized heat spiking [They run ultra cool compared to any WCG science on my laptop I7-2670QM].
[Oct 1, 2015 4:26:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Aperture_Science_Innovators
Advanced Cruncher
United States
Joined: Jul 6, 2009
Post Count: 139
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

What happens to/with computers running FAAH2 on an unstable internet connection? Some of my (quite competent) systems have unreliable internet access.
----------------------------------------

[Oct 1, 2015 7:13:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: And so it begins...

Nothing much happens in the immediate term per uplinger. Each connect there's a counter being set of 3 days or some other "if there is no N percent completed/reported by sent date plus X, consider the task as discontinued and generate new task from last good trickle."

Theoretically there could be duplicate steps computing, but do not know if the intermittently connecting hosts' [later] result trickles are acknowledged. So i.e.

1) Yours report 50K trickle
2) Offline computing, trickles accumulating in the transfer queue
3) Offline to beyond the critical 'soft stop' point
4) New task generated and sent from 50001 to 150000 which start trickling
5) Original host connects again after 4) and sends more trickles.

The fun starts when WCG goes offline for longer maintenance which can cause one or the other not managing to get the N percent in by the soft stop point.
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Oct 1, 2015 7:32:00 PM]
[Oct 1, 2015 7:28:39 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1930
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

Well, are those FAH2 WU's in short supply? I only got three so far, with two of them starting their trickle by now...
That is with about 30 hosts overall participating in WCG, and currently only MCM still as the only other active WCG project (as I am 100 CPU days/~2 calendar days short of the 20y badge for that one)...
----------------------------------------

[Oct 1, 2015 10:01:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

On a soft stop, we do not create the next work unit at that point. We let your client stop running at the next checkpoint and return the work unit. Once we get that back, then we create the next work unit.

On hard stop, we start from the last know spot immediately, we do not give your client the option to complete successfully. If you have some trickle messages returned by this point, you will get credit for those during the validation of the result. Anything returned after validation is unfortunately not used as another host is already computing those steps.

Thanks,
-Uplinger
[Oct 1, 2015 10:08:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
deltavee
Ace Cruncher
Texas Hill Country
Joined: Nov 17, 2004
Post Count: 4843
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

Well, are those FAH2 WU's in short supply? I only got three so far, with two of them starting their trickle by now...
That is with about 30 hosts overall participating in WCG, and currently only MCM still as the only other active WCG project (as I am 100 CPU days/~2 calendar days short of the 20y badge for that one)...

This is a low priority project. If you have it selected with MCM1 then you will get almost only MCM1 workunits.
https://secure.worldcommunitygrid.org/forums/...ad,38451_offset,40#503549
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by deltavee at Oct 2, 2015 12:40:09 AM]
[Oct 2, 2015 12:38:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
KLiK
Master Cruncher
Croatia
Joined: Nov 13, 2006
Post Count: 3108
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

yesterday just installed 1 more machine with Ubuntu...lots more FAHB available on Ubuntu, 'cause of a smaller number of machines! wink

also, why r those deadlines so short? all my WUs crunch in "high priority" 'cause of that...Win & Linux variants!
cool
----------------------------------------
oldies:UDgrid.org & PS3 Life@home


non-profit org. Play4Life in Zagreb, Croatia
[Oct 2, 2015 9:09:35 AM]   Link   Report threatening or abusive post: please login first  Go to top 
asdavid
Veteran Cruncher
FRANCE
Joined: Nov 18, 2004
Post Count: 521
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

On a soft stop, we do not create the next work unit at that point. We let your client stop running at the next checkpoint and return the work unit. Once we get that back, then we create the next work unit.

On hard stop, we start from the last know spot immediately, we do not give your client the option to complete successfully. If you have some trickle messages returned by this point, you will get credit for those during the validation of the result. Anything returned after validation is unfortunately not used as another host is already computing those steps.

Thanks,
-Uplinger



When are used Soft and Hard stops?
----------------------------------------
Anne-Sophie

[Oct 2, 2015 10:26:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: And so it begins...

Soft Stop if minimum number of trickles are not returned by a certain percent of total allowed deadline time. It's good to know the process actually waits for that next trickle before a follow on task is generated... minimizing chance of duplication.

Hard Stop is when a trickle is bad. Any additional trickle is also considered bad [which is logical as when the trajectory goes off the narrow path, the following steps will be off too].
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Oct 2, 2015 10:35:05 AM]
[Oct 2, 2015 10:33:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
cjslman
Master Cruncher
Mexico
Joined: Nov 23, 2004
Post Count: 2082
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: And so it begins...

Well, I finally finished (valid) my first FAH2 WU: FAH2_ avx17285-ls_ 000083_ 0005_ 001_ 0-- The estimated time was over 18 hours, but it finished in 16 hours (on a Win i5). I just wanted to process one of these puppies to see what it looked like. Now back to chasing after the sapphire on OET. I'll come back later and try to put a dent in FAH2 biggrin.

CJSL

Crunching for a brighter future...
----------------------------------------
I follow the Gimli philosophy: "Keep breathing. That's the key. Breathe."
Join The Cahuamos Team


[Oct 2, 2015 9:05:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 45   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread