| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 115
|
|
| Author |
|
|
adrianxw
Senior Cruncher Denmark Joined: Apr 13, 2008 Post Count: 196 Status: Offline Project Badges:
|
All the wu's today have crashed at the end of computation. WCG suspended for the moment. Example. Failures on different machines and wingman failures also visible.
----------------------------------------<core_client_version>6.6.20</core_client_version> <![CDATA[ <stderr_txt> Calling initGraphics() INFO: No state to restore. Start from the beginning. called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>E000765_042C_003e0620m_0_0</file_name> <error_code>-131</error_code> </file_xfer_error> </message> ]]> [Edit 2 times, last edit by adrianxw at Jun 27, 2009 7:23:00 AM] |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
hmmmm, why do you get that on CEP and I don't and everyone else doesn't? (Lone report)
----------------------------------------ERR_FILE_TOO_BIG -131 One of the output files is bigger than the maximum set by the project for upload. BOINC will not try to upload this file. Solution: Go to the project's forums and report this behavior. Got any quorum detail info? Booted lately? Can we see the start up log of BOINC? Also a piece of the BOINC message log where the job fails (stdoutdae.txt file for old message info)? My last 15 CEP results from different machines: E000758_ 972C_ 00670420c_ 1-- 849476 Pending Validation 6/24/09 15:31:57 6/27/09 03:29:43 7.90 129.8 / 0.0 E000758_ 858C_ 003c0080u_ 0-- 849476 Valid 6/24/09 14:53:48 6/27/09 01:41:51 8.04 132.0 / 128.0 E000761_ 045C_ 003c00502_ 1-- 95711 Pending Validation 6/25/09 07:42:59 6/26/09 23:19:47 12.01 124.3 / 0.0 E000758_ 645C_ 006v05511_ 1-- 849476 Pending Validation 6/24/09 13:43:29 6/26/09 18:59:20 7.39 105.7 / 0.0 E000758_ 625C_ 003z05500_ 0-- 849476 Valid 6/24/09 13:42:57 6/26/09 16:59:03 8.22 136.7 / 132.8 E000758_ 649C_ 006v05911_ 0-- 849476 Valid 6/24/09 13:42:57 6/26/09 10:51:41 7.38 122.7 / 115.2 E000758_ 648C_ 006v05811_ 1-- 849476 Valid 6/24/09 13:42:55 6/26/09 09:56:34 7.31 121.5 / 112.1 E000760_ 045C_ 006w0550w_ 1-- 95711 Valid 6/24/09 22:47:37 6/26/09 06:38:37 10.99 115.4 / 131.1 E000758_ 631C_ 003d09111_ 0-- 849476 Pending Validation 6/24/09 13:42:53 6/26/09 02:42:42 7.90 131.3 / 0.0 E000758_ 640C_ 006v05011_ 0-- 849476 Pending Validation 6/24/09 13:42:49 6/26/09 01:54:26 7.30 121.4 / 0.0 E000758_ 624C_ 003z05400_ 1-- 849476 Valid 6/24/09 13:40:46 6/25/09 17:56:41 8.17 135.7 / 137.9 E000758_ 639C_ 003d09911_ 1-- 849476 Valid 6/24/09 13:42:47 6/25/09 17:45:57 7.96 132.2 / 128.3 E000758_ 815C_ 006z0750z_ 1-- 95711 Valid 6/24/09 14:41:59 6/25/09 15:41:43 11.44 120.2 / 125.5 E000758_ 626C_ 003z05600_ 0-- 849476 Pending Validation 6/24/09 13:40:24 6/25/09 07:24:01 7.97 132.5 / 0.0 E000758_ 642C_ 006v05211_ 0-- 849476 Valid 6/24/09 13:40:24 6/25/09 06:35:48 7.10 117.9 / 104.5
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
adrianxw
Senior Cruncher Denmark Joined: Apr 13, 2008 Post Count: 196 Status: Offline Project Badges:
|
The example I linked to shows the wu failing by another cruncher after running.
Machines are typically booted once a week, don't recall when these ones were last done, usually follows the arrival of Windows updates. From Messages. 27/06/2009 05:40:19 World Community Grid Computation for task E000765_521C_005v0610z_1 finished 27/06/2009 05:40:19 World Community Grid Output file E000765_521C_005v0610z_1_0 for task E000765_521C_005v0610z_1 exceeds size limit. 27/06/2009 05:40:19 World Community Grid File size: 18114921.000000 bytes. Limit: 15000000.000000 bytes 27/06/2009 05:40:20 World Community Grid [sched_op_debug] Deferring communication for 1 min 0 sec 27/06/2009 05:40:20 World Community Grid [sched_op_debug] Reason: Unrecoverable error for result E000765_521C_005v0610z_1 (<file_xfer_error> <file_name>E000765_521C_005v0610z_1_0</file_name> <error_code>-131</error_code></file_xfer_error>) I have six wu's like that now. Where is the other item you asked for located? |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Quorum details:
On My Grid > Result Status:Check & copy paste for the errorring wu's how your wingmen have been doing for that workunit Startup log: You could extract this from the stdoutdae.txt file in the Boinc data folder, but it might be easier to restart Boinc. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
We are not allowed to see result details of others at WCG. Am interested to see the sent dates. Mine are all 758 and a few 760 completed just now, yours for batch 765, that much I can see. Have a 764 in the queue so will pull it ahead with suspend/release to see what it does today.
----------------------------------------
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Jun 27, 2009 8:13:40 AM] |
||
|
|
adrianxw
Senior Cruncher Denmark Joined: Apr 13, 2008 Post Count: 196 Status: Offline Project Badges:
|
One of the failing wu's is a 766, the rest are 765. I have several "Pending" and a "Valid" 764.
----------------------------------------<edit> Do you think it is worth holding on to the suspended wu's with the thought that the allowable size will get increased? They are 766's if that helps. <another edit> Can now see another of the wu's has been failed out by a wingman. [Edit 4 times, last edit by adrianxw at Jun 27, 2009 9:39:24 AM] |
||
|
|
Ardruin
Cruncher Germany Joined: Dec 3, 2007 Post Count: 21 Status: Offline Project Badges:
|
Hello,
----------------------------------------my computer crash every task since the last day. I lost 33h computation time - the error beginn with the end of computation.... This is the Log: 27.06.2009 10:38:47 World Community Grid Computation for task E000766_288C_003e07806_0 finished 27.06.2009 10:38:47 World Community Grid Output file E000766_288C_003e07806_0_0 for task E000766_288C_003e07806_0 exceeds size limit. 27.06.2009 10:38:47 World Community Grid File size: 33266086.000000 bytes. Limit: 15000000.000000 bytes 27.06.2009 11:48:40 World Community Grid Computation for task E000766_345C_003e01500_0 finished 27.06.2009 11:48:40 World Community Grid Output file E000766_345C_003e01500_0_0 for task E000766_345C_003e01500_0 exceeds size limit. 27.06.2009 11:48:40 World Community Grid File size: 32931748.000000 bytes. Limit: 15000000.000000 bytes 27.06.2009 11:51:08 World Community Grid Computation for task E000766_365C_00300350y_0 finished 27.06.2009 11:51:08 World Community Grid Output file E000766_365C_00300350y_0_0 for task E000766_365C_00300350y_0 exceeds size limit. 27.06.2009 11:51:08 World Community Grid File size: 17726492.000000 bytes. Limit: 15000000.000000 bytes My computer ist a Intel Q9450 @3,2 GHz, 4 GB DDR2-1066, Win Vista 64bit. PrimeStable. Bye Markus |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Thanks for confirming. I've already placed a note in the Back Room for the techs to take this on. Follow Up is though still a hours away as it's night time where they are.
----------------------------------------I'll let the 764 job run and see if it also suffers this issue, but since it's been sitting in queue for a good day, doubt it and would then have expected reports to come in before. Thanks for your patience.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
mito7
Advanced Cruncher Slovakia Joined: Oct 12, 2008 Post Count: 58 Status: Offline Project Badges:
|
One of my CEP WU encourted same error (too big file) as mentioned above. My first of batch 765. I have 7 other in queue. Seems like 765 (766) batches have this problem. I report here how other WUs will end.
----------------------------------------WUs: E000765_815C_002x0050l error E000765_817C_002x0070l error E000765_821C_003e0610d computing (stopped until techs say what to do) E000765_824C_003e0640d computing (stopped until techs say what to do) E000765_921C_002y03107 waiting (stopped until techs say what to do) E000765_929C_002y03907 computing (stopped until techs say what to do) E000765_926C_002y03607 waiting (stopped until techs say what to do) E000765_934C_002y07405 waiting (stopped until techs say what to do) Edit: Names of WUs EDIT: Client 5.10.45, OS WinXP SP3 ![]() [Edit 5 times, last edit by mito7 at Jun 27, 2009 6:06:13 PM] |
||
|
|
Greg Lyke
Advanced Cruncher Joined: May 30, 2008 Post Count: 50 Status: Offline |
As there seems to be a problem with batch 765 (& maybe 766), would it be a good idea to temporarily suspend crunching on CEP & move those resources to a different project? (by changing the profile to something else, suspend all pending 765s & then increasing the cache size)
----------------------------------------I have a (dual core) computer dedicated to CEP at the moment that is just starting on it's 765s (both under 3% done with 6 more 765s in the queue). While I don't particularly want to stop working on CEP at the moment (chasing that silly gold badge ), I also don't want my computer's work time to be wasted if I could switch to something else while this issue is resolved (or if you need more 765 test batches I could let them go & report back much later today/early tomorrow).If it's just a matter of the back room people increasing the size of the files that they will accept & then resending the information then my switch to a different project idea isn't needed. However if that time & effort is just going to be thrown out, I would rather devote my resources to a project that is running smoothly at the moment. Edit: I just noticed that a different machine picked up a couple of batch 767s. If it would be useful I can push them to the front of the queue & see if the same problem exists there as well. They however have an estimated completion time of 21 hours so it will take a while to get any results back. [Edit 1 times, last edit by Greg Lyke at Jun 27, 2009 12:59:41 PM] |
||
|
|
|