Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: DDD2 Type B work units going out. |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 369
|
Author |
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1313 Status: Offline Project Badges: |
Returned one:
----------------------------------------BETA_ erlc_ a138_ pe0000_ 0-- 613 Pending Validation 15-1-10 23:17:09 16-1-10 07:17:50 7.62 131.5 / 0.0 Runned without interruption or restarting during the job. 2 wingmen in progress. |
||
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges: |
Got 1 Beta and finished it in just over 5 hours.
----------------------------------------For some reason although Boinc uploaded it (07:00 GMT - UK time) it did not report it or any other tasks? It uploaded at least one WCG task subsequent to that. - Boinc Version 6.10.18, Win7 Pro 64bit When I clicked on Update it did report it and the other tasks waiting to be uploaded. - Internet is always connected. The wingmen finished in 6.5 and 9h, and managed to report before me. I am awaiting validation (Edit - now Validated). Changed the connect about every 0.01 days to 0.10 days, but the 0.01 days seems to work fine on my other systems. - Q9400 CPU - Drive has 100GB free. Boinc was set to use up to 20GB, and leave 5GB free, so no obvious issues there. - 4GB RAM, using less than 1.7GB. I have now allowed up to 50GB, just in case. I did suspend one WCG task so that the Beta would run, and forgot to enable it again. Perhaps that was an issue? However, even when the system finished all other tasks, and had no work, it did not pick up any new WCG tasks; it did not even ask for new WCG tasks. New Project tasks were allowed and no web changes have been made in last 24h. The only other Boinc project on the systems is GPUGrid which did pick up new tasks. [Edit 2 times, last edit by skgiven at Jan 16, 2010 2:07:14 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
If there was a intermittent condition right when a transaction takes place with the servers, the defer counter kicks in and increments at each failing retry... and then sometimes the file upload gets stuck. That's why there is the Retry Now button for instance. :D
----------------------------------------Suspending a task blocks work fetching for the related project and delays the urge by the client to contact the servers in addition and not rush up to clear the Ready To Report tasks, since the client tries to combine these with work fetches... and then will wait up to 24 hours to seek contact. Now, look in the mirror and ask "who's fault was this?"
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Jan 16, 2010 2:38:39 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Reported the first 2, now in PV, no wingmen in sight,
----------------------------------------BETA_ erlc_ a049_ pe0000_ 1-- 95711 Pending Validation 1/15/10 23:02:09 1/16/10 14:41:04 14.03 150.3 / 0.0 BETA_ erlc_ a048_ pe0000_ 0-- 95711 Pending Validation 1/15/10 23:01:49 1/16/10 14:40:42 13.89 148.9 / 0.0 The last 2 BETA jobs I forced to start and then suspended them and resumed, now waiting to run to let 2 suspended rice jobs finish, and to simulate an interruption. If they succeed, another piece then is known to get properly identified during the validation [if bits of this activity gets included in the result log or the result files].
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges: |
Thanks. Next time I will suspend and then, immediately resume!
O/T @Boinc developers and the WCG Techs It was no more User error than a Boinc/WCG failure to communicate! There was no warning message to say that WCG communication would be stopped after suspending the task: No Text Tip, and nothing in Messages; it just said Task Suspended By User - no mention of communications being suspended. It is unreasonable to presume that suspending one single task should inherently lead a user to think that all work would be stopped because networking would be stopped. If I wanted stop networking I can do this and if I want to suspend all WCG tasks I would naturally do this under Projects rather than Tasks. The bottom line is that the change I made (suspending a single task) also made a unintuitive change that resulted in the WCG losing half a days work. It is an unwanted and unhelpful Feature! |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
One might also use the logic that as YOU had suspended a task that you still had work and chose not to do it so did not need anymore as opposed to Aborting a task. Also if you suspend/resume to move a task up, if you should reboot or stop/restart the manager it will resume the oldest tasks first
----------------------------------------[Edit 1 times, last edit by Former Member at Jan 16, 2010 5:01:20 PM] |
||
|
Hypernova
Master Cruncher Audaces Fortuna Juvat ! Vaud - Switzerland Joined: Dec 16, 2008 Post Count: 1908 Status: Offline Project Badges: |
I got one which ended in bad shape:
----------------------------------------BETA_ erlc_ a209_ pe0000_ 1-- 613 Server Aborted 15.01.10 23:29:03 16.01.10 15:14:11 0.00 0.0 / 0.0 I checked under Server Aborted and got: """""""""""""" Result Name: BETA_ erlc_ a209_ pe0000_ 1-- <core_client_version>6.10.18</core_client_version> """""""""""""" The two other mirror copies here under where validated by wingmen. Here you have the three with mine: BETA_ erlc_ a209_ pe0000_ 0-- 613 Valid 15.01.10 23:29:12 16.01.10 11:43:38 7.91 115.9 / 186.5 BETA_ erlc_ a209_ pe0000_ 1-- 613 Server Aborted 15.01.10 23:29:03 16.01.10 15:14:11 0.00 0.0 / 0.0 BETA_ erlc_ a209_ pe0000_ 2-- 613 Valid 15.01.10 23:28:58 16.01.10 14:03:14 8.07 186.5 / 186.5 Why was that? Any clue I have another one In progress, hope it will be ok. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
@Boinc developers and the WCG Techs It was no more User error than a Boinc/WCG failure to communicate! There was no warning message to say that WCG communication would be stopped after suspending the task: No Text Tip, and nothing in Messages; it just said Task Suspended By User - no mention of communications being suspended. It is unreasonable to presume that suspending one single task should inherently lead a user to think that all work would be stopped because networking would be stopped. If I wanted stop networking I can do this and if I want to suspend all WCG tasks I would naturally do this under Projects rather than Tasks. The bottom line is that the change I made (suspending a single task) also made a unintuitive change that resulted in the WCG losing half a days work. It is an unwanted and unhelpful Feature! You're addressing people at the wrong forum > go here http://boinc.berkeley.edu/dev/index.php, where they, the developers, don't listen either... they even advertise that, so they employ gophers [volunteers like me] to carry problems back and forth. The task suspend > no work fetch has been around for as long as I can remember... at least since the additional buffer function was introduced. Why send you work if you suspended work in the first place? You made the choice! And, result files from other running tasks will still be uploaded. But, I'd agree, that if a job is suspended, a status change should show in the project tab such as "work fetch suspended [indefinitely], because manual task holds are active" or something like that. And a task suspend could e.g. be followed with an additional line such as "until all project tasks are allowed to process work fetch is suspended" in the message log... but as it is BOINC was designed for techno hobbyists and not for the masses. I've posted that statement a few times at the developers forum, which I consider one of the root causes of the low retention (WCG is though quite a bit higher than any other project, due hard work.) Anyway we have several threads in BOINC forum and Chat Room to plant ideas... here in a Beta test thread it's near guaranteed to get lost or forgotten. PS: And I did not say communications was stopped or suspended. It's delayed, up to 24 hours, where a secondary counter even forces a server check in every 48 hours (was 72). If computing in general is suspended, then networking does get suspended. There's a good reasons for that, such as to facilitate backups and restores without edit: Running backups whilst the core client is loaded is anyhow a bad idea, but a restore of such a backup might actually produce a stable resume point. Dicey.
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 2 times, last edit by Sekerob at Jan 16, 2010 5:47:29 PM] |
||
|
USAFA 82
Veteran Cruncher Colorado Springs, Colorado Joined: Jan 20, 2005 Post Count: 1001 Status: Offline Project Badges: |
I actually snagged one! Weeeeee! BETA_ erlc_ a111_ pe0000_ 0 Hello wingman BETA_ erlc_ a111_ pe0000_ 1 4,5 hours to go. Hello wingman! Our third wingman is still in progress. |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Your second wingmen could get an instruction to return to base... abort mission, if the first 2 in the quorum hit target! All depends if the first rounds were fired or not :-]
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
|