Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 115
Posts: 115   Pages: 12   [ 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 21723 times and has 114 replies Next Thread
adrianxw
Senior Cruncher
Denmark
Joined: Apr 13, 2008
Post Count: 196
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
All wu's crashing at the end.

All the wu's today have crashed at the end of computation. WCG suspended for the moment. Example. Failures on different machines and wingman failures also visible.

<core_client_version>6.6.20</core_client_version>
<![CDATA[
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
called boinc_finish

</stderr_txt>
<message>
<file_xfer_error>
<file_name>E000765_042C_003e0620m_0_0</file_name>
<error_code>-131</error_code>
</file_xfer_error>

</message>
]]>
----------------------------------------
[Edit 2 times, last edit by adrianxw at Jun 27, 2009 7:23:00 AM]
[Jun 27, 2009 7:18:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

hmmmm, why do you get that on CEP and I don't and everyone else doesn't? (Lone report)
ERR_FILE_TOO_BIG -131

One of the output files is bigger than the maximum set by the project for upload.
BOINC will not try to upload this file.

Solution: Go to the project's forums and report this behavior.

Got any quorum detail info? Booted lately? Can we see the start up log of BOINC? Also a piece of the BOINC message log where the job fails (stdoutdae.txt file for old message info)?

My last 15 CEP results from different machines:

E000758_ 972C_ 00670420c_ 1-- 849476 Pending Validation 6/24/09 15:31:57 6/27/09 03:29:43 7.90 129.8 / 0.0
E000758_ 858C_ 003c0080u_ 0-- 849476 Valid 6/24/09 14:53:48 6/27/09 01:41:51 8.04 132.0 / 128.0
E000761_ 045C_ 003c00502_ 1-- 95711 Pending Validation 6/25/09 07:42:59 6/26/09 23:19:47 12.01 124.3 / 0.0
E000758_ 645C_ 006v05511_ 1-- 849476 Pending Validation 6/24/09 13:43:29 6/26/09 18:59:20 7.39 105.7 / 0.0
E000758_ 625C_ 003z05500_ 0-- 849476 Valid 6/24/09 13:42:57 6/26/09 16:59:03 8.22 136.7 / 132.8
E000758_ 649C_ 006v05911_ 0-- 849476 Valid 6/24/09 13:42:57 6/26/09 10:51:41 7.38 122.7 / 115.2
E000758_ 648C_ 006v05811_ 1-- 849476 Valid 6/24/09 13:42:55 6/26/09 09:56:34 7.31 121.5 / 112.1
E000760_ 045C_ 006w0550w_ 1-- 95711 Valid 6/24/09 22:47:37 6/26/09 06:38:37 10.99 115.4 / 131.1
E000758_ 631C_ 003d09111_ 0-- 849476 Pending Validation 6/24/09 13:42:53 6/26/09 02:42:42 7.90 131.3 / 0.0
E000758_ 640C_ 006v05011_ 0-- 849476 Pending Validation 6/24/09 13:42:49 6/26/09 01:54:26 7.30 121.4 / 0.0
E000758_ 624C_ 003z05400_ 1-- 849476 Valid 6/24/09 13:40:46 6/25/09 17:56:41 8.17 135.7 / 137.9
E000758_ 639C_ 003d09911_ 1-- 849476 Valid 6/24/09 13:42:47 6/25/09 17:45:57 7.96 132.2 / 128.3
E000758_ 815C_ 006z0750z_ 1-- 95711 Valid 6/24/09 14:41:59 6/25/09 15:41:43 11.44 120.2 / 125.5
E000758_ 626C_ 003z05600_ 0-- 849476 Pending Validation 6/24/09 13:40:24 6/25/09 07:24:01 7.97 132.5 / 0.0
E000758_ 642C_ 006v05211_ 0-- 849476 Valid 6/24/09 13:40:24 6/25/09 06:35:48 7.10 117.9 / 104.5
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jun 27, 2009 7:34:42 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adrianxw
Senior Cruncher
Denmark
Joined: Apr 13, 2008
Post Count: 196
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

The example I linked to shows the wu failing by another cruncher after running.

Machines are typically booted once a week, don't recall when these ones were last done, usually follows the arrival of Windows updates.

From Messages.

27/06/2009 05:40:19 World Community Grid Computation for task E000765_521C_005v0610z_1 finished
27/06/2009 05:40:19 World Community Grid Output file E000765_521C_005v0610z_1_0 for task E000765_521C_005v0610z_1 exceeds size limit.
27/06/2009 05:40:19 World Community Grid File size: 18114921.000000 bytes. Limit: 15000000.000000 bytes
27/06/2009 05:40:20 World Community Grid [sched_op_debug] Deferring communication for 1 min 0 sec
27/06/2009 05:40:20 World Community Grid [sched_op_debug] Reason: Unrecoverable error for result E000765_521C_005v0610z_1 (<file_xfer_error> <file_name>E000765_521C_005v0610z_1_0</file_name> <error_code>-131</error_code></file_xfer_error>)

I have six wu's like that now. Where is the other item you asked for located?
[Jun 27, 2009 8:01:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

Quorum details:
On My Grid > Result Status:Check & copy paste for the errorring wu's how your wingmen have been doing for that workunit

Startup log:
You could extract this from the stdoutdae.txt file in the Boinc data folder, but it might be easier to restart Boinc.
[Jun 27, 2009 8:11:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

We are not allowed to see result details of others at WCG. Am interested to see the sent dates. Mine are all 758 and a few 760 completed just now, yours for batch 765, that much I can see. Have a 764 in the queue so will pull it ahead with suspend/release to see what it does today.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Jun 27, 2009 8:13:40 AM]
[Jun 27, 2009 8:12:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adrianxw
Senior Cruncher
Denmark
Joined: Apr 13, 2008
Post Count: 196
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

One of the failing wu's is a 766, the rest are 765. I have several "Pending" and a "Valid" 764.

<edit>
Do you think it is worth holding on to the suspended wu's with the thought that the allowable size will get increased? They are 766's if that helps.

<another edit>
Can now see another of the wu's has been failed out by a wingman.
----------------------------------------
[Edit 4 times, last edit by adrianxw at Jun 27, 2009 9:39:24 AM]
[Jun 27, 2009 8:50:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Ardruin
Cruncher
Germany
Joined: Dec 3, 2007
Post Count: 21
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

Hello,

my computer crash every task since the last day. I lost 33h computation time - the error beginn with the end of computation....
This is the Log:

27.06.2009 10:38:47 World Community Grid Computation for task E000766_288C_003e07806_0 finished
27.06.2009 10:38:47 World Community Grid Output file E000766_288C_003e07806_0_0 for task E000766_288C_003e07806_0 exceeds size limit.
27.06.2009 10:38:47 World Community Grid File size: 33266086.000000 bytes. Limit: 15000000.000000 bytes


27.06.2009 11:48:40 World Community Grid Computation for task E000766_345C_003e01500_0 finished
27.06.2009 11:48:40 World Community Grid Output file E000766_345C_003e01500_0_0 for task E000766_345C_003e01500_0 exceeds size limit.
27.06.2009 11:48:40 World Community Grid File size: 32931748.000000 bytes. Limit: 15000000.000000 bytes



27.06.2009 11:51:08 World Community Grid Computation for task E000766_365C_00300350y_0 finished
27.06.2009 11:51:08 World Community Grid Output file E000766_365C_00300350y_0_0 for task E000766_365C_00300350y_0 exceeds size limit.
27.06.2009 11:51:08 World Community Grid File size: 17726492.000000 bytes. Limit: 15000000.000000 bytes




My computer ist a Intel Q9450 @3,2 GHz, 4 GB DDR2-1066, Win Vista 64bit. PrimeStable.


Bye
Markus
----------------------------------------
[Jun 27, 2009 9:59:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

Thanks for confirming. I've already placed a note in the Back Room for the techs to take this on. Follow Up is though still a hours away as it's night time where they are.

I'll let the 764 job run and see if it also suffers this issue, but since it's been sitting in queue for a good day, doubt it and would then have expected reports to come in before.

Thanks for your patience.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Jun 27, 2009 10:11:16 AM]   Link   Report threatening or abusive post: please login first  Go to top 
mito7
Advanced Cruncher
Slovakia
Joined: Oct 12, 2008
Post Count: 58
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

One of my CEP WU encourted same error (too big file) as mentioned above. My first of batch 765. I have 7 other in queue. Seems like 765 (766) batches have this problem. I report here how other WUs will end.

WUs:
E000765_815C_002x0050l error
E000765_817C_002x0070l error
E000765_821C_003e0610d computing (stopped until techs say what to do)
E000765_824C_003e0640d computing (stopped until techs say what to do)
E000765_921C_002y03107 waiting (stopped until techs say what to do)
E000765_929C_002y03907 computing (stopped until techs say what to do)
E000765_926C_002y03607 waiting (stopped until techs say what to do)
E000765_934C_002y07405 waiting (stopped until techs say what to do)

Edit: Names of WUs
EDIT: Client 5.10.45, OS WinXP SP3
----------------------------------------

----------------------------------------
[Edit 5 times, last edit by mito7 at Jun 27, 2009 6:06:13 PM]
[Jun 27, 2009 11:06:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Greg Lyke
Advanced Cruncher
Joined: May 30, 2008
Post Count: 50
Status: Offline
Reply to this Post  Reply with Quote 
Re: All wu's crashing at the end.

As there seems to be a problem with batch 765 (& maybe 766), would it be a good idea to temporarily suspend crunching on CEP & move those resources to a different project? (by changing the profile to something else, suspend all pending 765s & then increasing the cache size)

I have a (dual core) computer dedicated to CEP at the moment that is just starting on it's 765s (both under 3% done with 6 more 765s in the queue). While I don't particularly want to stop working on CEP at the moment (chasing that silly gold badge blushing ), I also don't want my computer's work time to be wasted if I could switch to something else while this issue is resolved (or if you need more 765 test batches I could let them go & report back much later today/early tomorrow).

If it's just a matter of the back room people increasing the size of the files that they will accept & then resending the information then my switch to a different project idea isn't needed. However if that time & effort is just going to be thrown out, I would rather devote my resources to a project that is running smoothly at the moment.

Edit: I just noticed that a different machine picked up a couple of batch 767s. If it would be useful I can push them to the front of the queue & see if the same problem exists there as well. They however have an estimated completion time of 21 hours so it will take a while to get any results back.
----------------------------------------
[Edit 1 times, last edit by Greg Lyke at Jun 27, 2009 12:59:41 PM]
[Jun 27, 2009 12:53:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 115   Pages: 12   [ 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread