Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: AfricanClimate@Home Thread: Is normal WU crunching for almost 70 hours and ask for more 32 hours at least ? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 29
|
Author |
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
I just checked, the CPU are running at 2GHz and very well, many others WU's was chunched in the second processor during all of this time, including two others from AC@H that was returned at average time of 8,2 hours. Follow the status of others related WUs from my results status:
----------------------------------------Workunit Status Project Name: AfricanClimate@Home Created: 03/21/2008 02:04:59 Name: ach1_25_12 Minimum Quorum: 10 Initial Replication: 10 The large number of copies sent out for this workunit is due to the unique nature of this project. We encourage you to read the FAQs about this project for more information. Result Name Status Sent Time Time Due /Return Time CPU Time (hours) Claimed/ Granted BOINC Credit ach1_ 25_ 12_ 13-- In Progress 03/24/2008 05:01:19 03/26/2008 05:01:19 0.00 0.0 / 0.0 ach1_ 25_ 12_ 12-- Pending Validation 03/23/2008 21:45:46 03/24/2008 12:47:29 5.96 85.2 / 0.0 ach1_ 25_ 12_ 11-- No Reply 03/21/2008 21:40:46 03/23/2008 21:40:46 0.00 0.0 / 0.0 ach1_ 25_ 12_ 10-- Error 03/21/2008 11:35:48 03/21/2008 21:39:57 6.06 57.9 / 0.0 ach1_ 25_ 12_ 0-- In Progress 03/21/2008 02:07:20 03/31/2008 02:07:20 0.00 0.0 / 0.0 ach1_ 25_ 12_ 2-- Pending Validation 03/21/2008 02:07:12 03/21/2008 11:33:49 4.80 79.0 / 0.0 ach1_ 25_ 12_ 9-- In Progress 03/21/2008 02:06:05 03/31/2008 02:06:05 0.00 0.0 / 0.0 ach1_ 25_ 12_ 8-- In Progress 03/21/2008 02:05:56 03/31/2008 02:05:56 0.00 0.0 / 0.0 ach1_ 25_ 12_ 7-- In Progress 03/21/2008 02:05:56 03/31/2008 02:05:56 0.00 0.0 / 0.0 ach1_ 25_ 12_ 5-- Pending Validation 03/21/2008 02:05:47 03/23/2008 21:24:31 7.73 77.3 / 0.0 ach1_ 25_ 12_ 3-- Error 03/21/2008 02:05:32 03/24/2008 04:59:34 71.46 1,252.1 / 0.0 ach1_ 25_ 12_ 4-- Error 03/21/2008 02:05:28 03/21/2008 11:35:17 7.07 88.4 / 0.0 ach1_ 25_ 12_ 1-- In Progress 03/21/2008 02:05:28 03/31/2008 02:05:28 0.00 0.0 / 0.0 ach1_ 25_ 12_ 6-- In Progress 03/21/2008 02:05:22 03/31/2008 02:05:22 0.00 0.0 / 0.0 Most ones are in progress but there are errors, no reply, and two finished in short time, and one with erros that was cruched for 71 hours... I think that this WU have problems. I hate abort anyone, but I don't have sure if this huge work will be compensed... once is a old WU (25o. cycle, I would like to listen from techs if this WU was distributed before and maybe aborted...). Hard decision just give up from 90 hours of work... Regards and thanks Sek.
Cheers ! GIB@
----------------------------------------Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 [Edit 1 times, last edit by GIBA at Mar 24, 2008 9:49:13 PM] |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Suspend the job GIBA and continue with others. The ach1_25_xx is quite new and seeing a number with 5-6 hours computing time, it seems like it is client related.
----------------------------------------This sample: 71.46 1,252.1 is not going to be getting the credit if valid when all the other are claiming in the 70-90 range.
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
Thank you Sek I will do it.
----------------------------------------
Cheers ! GIB@
Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
[B-S] Gamma-Ray
Cruncher Joined: Feb 27, 2008 Post Count: 24 Status: Offline |
Yep, I would definetely suspend it at least at this point going by what the other Pc's are running it for. As like was said, either they were finished around the 5 to 8 hour mark by some, or went to over 70+ by another, either way they errored out. Best to cut your time losses when you can, although losing that much time really is sad.
I in the future would keep an eye on any others you may (Hopefully) get and if they go over the 10 hour mark, really keep an eye out on it happening again. The only problem now with having it sit there suspended, is that it will probably keep you from getting another ACH work unit until its been aborted and sent back. As far as the High Priority status for the client, thats typical for any project being run that is fast approaching its deadline date, so it will usually ignore the time swaps with other projects and stay focused on that particular one until its done. Good Luck! G^R |
||
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
Thanks GR.
----------------------------------------It was one of my first AC@H WU's and as beginner in this project, and reading some threads talking about huge WU's and hard work to complete this modeling...and additionally observing that the WU are progressing (slowly is true), my first feeling was keep quiet and Carpem Diem... now I know that for future WU's in AC@H, I need have more attention... Anyway, I expect that techs take a look at that WU and fix it for next coleagues here have the opportunity to running a fair one... Regards. Giba.
Cheers ! GIB@
Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
Just to reference post:
----------------------------------------I just checked and all WU's related with it finished in error or with some kind of problem... really I expect that the time invested by all ten crunchers have any value for scientistis or techs... regards.
Cheers ! GIB@
Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Some 'error' results have changed status after a second validation pass. Presume a new set of 10 was send out, but if the 'errors' exceed a certain number **, the WU is taken out of circulation for investigation.
----------------------------------------** Think for regular projects the number was 7, but with init sets of 10, that could be different.
WCG Global & Research > Make Proposal Help: Start Here!
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Mar 31, 2008 8:01:04 AM] |
||
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
Hi all,
----------------------------------------in the last hours happened a crazy thing here in one of my computers... I receive some AC@H WU's, and few minutes after, all of then and more one WU (ach1 25_15-3) that are running almost around final work, was automatically returned for WCG as error. See below: ============================================== Result Name Device Name Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit ach1_ 27_ 53_ 8-- GIBAT61 Error 03/31/2008 08:28:44 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 27_ 8_ 0-- GIBAT61 Error 03/31/2008 08:27:27 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 27_ 77_ 5-- GIBAT61 Error 03/31/2008 08:27:27 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 27_ 75_ 7-- GIBAT61 Error 03/31/2008 08:27:27 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 27_ 6_ 9-- GIBAT61 Error 03/31/2008 08:27:27 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 27_ 5_ 8-- GIBAT61 Error 03/31/2008 08:27:26 03/31/2008 08:37:33 0.00 0.0 / 0.0 ach1_ 25_ 15_ 3-- GIBAT61 Error 03/31/2008 01:48:27 03/31/2008 08:37:33 0.00 0.0 / 0.0 ============================================== Nothing happened in my infrastructure here, and some others WU's in read to start status, from DDDT and FAAH are returned together with this AC@H ones, in this machine. I don't know what happens, but verified some of this WU's status for others WU's mates, and happened the same problem (returned as error few minutes after downloaded...). Rgds. Giba.
Cheers ! GIB@
Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Anything in the BOINC message log? WCG can remotely cancel redundant work if it appears they are not needed or have e.g a known error.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All! |
||
|
GIBA
Ace Cruncher Joined: Apr 25, 2005 Post Count: 5374 Status: Offline |
The logs (of all WU's) are completely empty... unfortunatelly
----------------------------------------
Cheers ! GIB@
Join BRASIL - BRAZIL@GRID team and be very happy ! http://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=DF99KT5DN1 |
||
|
|