Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 26
Posts: 26   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 8461 times and has 25 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
This has got to be some sort of record. [ Resolved ]

Hi.

Just saw this task I've got on one of my rigs this morning, it has erred 7 times now it's my turn to have a go.
Hope it's not a faulty task or validator's not playing up, a lot of wasted hours otherwise. confused
We'll see how it goes.

E215036_ 969_ C.36.C30H16N2S2SeSi.00296881.1.set1d06_ 7
----------------------------------------
[Edit 1 times, last edit by Former Member at Sep 7, 2013 5:18:08 AM]
[Sep 1, 2013 10:58:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Jim_Helfrich
Cruncher
Joined: Oct 26, 2010
Post Count: 1
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

I am having the same issue - running XP on an old machine. I have aborted, reset, got another Clean Energy run, watched it restart a couple of time, and aborted it. When you have this fixed, please let me know and I will start allowing Clean Energy to run on my machines again. Thanks.
[Sep 2, 2013 1:04:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

P . P . L ., one of my devices also got into a similarly behaving CEP WU. When I first saw your post I thought it must be the same WU, but it has a different ID. The one my device "contributed" to had the following ID: E214805_ 511_ C.35.C30H17NS2SeSi.00816741.4.set1d06_ 5. I believe the 8th copy is presently running. So far it has soaked up 100.2 CPU hours. biggrin Anyone else have a nomination for most wasteful WU? รพ
[Sep 2, 2013 2:43:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
LAZA74
Advanced Cruncher
Germany
Joined: Sep 28, 2008
Post Count: 56
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

Hai all,

i got 14 WUs which ended up with errors and no clue of what the problem is.
Some between them finished without problems...

Result log:
<core_client_version>7.2.7</core_client_version>
<![CDATA[
<message>
process exited with code 195 (0xc3, -61)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
[18:32:58] Number of jobs = 16
[18:32:58] Starting job 0,CPU time has been restored to 0.000000.
[18:32:58] Starting new Job
Application exited with RC = 0x100
[ERROR] Failed to open either source or destination files while copying C.37.C33H21NOSi2.01320853.1.noopt.bp86.sto6g.n.sp/53.0 to C.37.C33H21NOSi2.01320853.1.noopt.bp86.sto6g.n.sp.53.0. Error: 2
[18:32:59] Finished Job #0
18:32:59 (13780): called boinc_finish

</stderr_txt>
]]>


The worst thing is:
I'm not able to start virtual machines after one of the WUs crashed, the start normal but instead of the login screen the resolution goes to 800x600 (or something like this) and leaves the screen blank - i have to kill the application.

Help would be appreciated.

The errored WUs are:
E215317_258_C.37.C34H21NSSi.01712283.3.set1d06
E215316_957_C.38.C34H19N3S.01548050.0.set1d06
E215314_279_C.38.C34H19N3S.02011968.3.set1d06
E215309_443_A.37.C32H21N3SSi.31.1.set1d06
E215310_217_A.36.C33H21NSSe.37.2.set1d06
E215312_108_C.37.C34H21NSSi.00283356.3.set1d06
E215312_228_C.38.C33H19N3OS.00686810.3.set1d06
E215311_715_A.37.C34H21NS2.34.2.set1d06
E215306_704_C.36.C32H23NSi3.00265403.4.set1d06
E215307_820_A.37.C34H21NS2.50.1.set1d06
E215307_628_C.37.C34H21NSSi.01173971.2.set1d06 -- (this got 3 Errors from 3 different machines and BOINC versions!)

E215307_672_C.37.C33H21NOSi2.01320853.1.set1d06

still pending:
E215314_179_C.37.C34H21NSSi.02101837.3.set1d06
E215313_480_C.37.C34H21NSSi.00525550.3.set1d06

Thanks for help
LAZA
----------------------------------------
NAS - Eigenbau
Xiaomi Mi 10T
[Sep 2, 2013 5:13:39 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

[ERROR] Failed to open either source or destination files while copying C.37.C33H21NOSi2.01320853.1.noopt.bp86.sto6g.n.sp/53.0 to C.37.C33H21NOSi2.01320853.1.noopt.bp86.sto6g.n.sp.53.0. Error: 2


Disk space issue maybe?
[Sep 2, 2013 5:54:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

Hi.
Well after over 11hrs it returned with an error, I did notice that the last file to upload was not as big as usual & was gone quickly, my upload speed is not that quick.

This is the end of the error report for it.
----------
15:16:26] Starting job 12,CPU time has been restored to 25997.090000.
[15:16:26] Starting new Job
[15:16:26] Qink name = fldman
[15:16:31] Qink name = gesman
[15:16:32] Qink name = scfman
Application exited with RC = 0x100
[19:03:20] Finished Job #12
[19:03:20] Starting job 13,CPU time has been restored to 39340.980000.
[19:03:20] Skipping Job #13
[19:03:20] Starting job 14,CPU time has been restored to 39340.980000.
[19:03:20] Skipping Job #14
[19:03:20] Starting job 15,CPU time has been restored to 39340.980000.
[19:03:20] Skipping Job #15
19:03:28 (2625): called boinc_finish

</stderr_txt>
[Sep 2, 2013 10:01:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

another one
E215042_003_C.35.C30H18S2SeSi2.02238628.1.set1d06

this is totally waste of time.. my 3rd WU which was full of errors (I was _10 wingman).. by the time of 6 hours/CEP2 WU I could have been crunching 5-6 FAHV WUs...
[Sep 2, 2013 11:03:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
AgrFan
Senior Cruncher
USA
Joined: Apr 17, 2008
Post Count: 397
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

Had these WUs error out after reaching the 12 hour cutoff. Possibly a bad batch?

E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 6-- - In Progress 9/2/13 14:51:29 9/5/13 14:51:29 0.00 0.0 / 0.0
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 5-- - In Progress 9/2/13 14:42:38 9/5/13 14:42:38 0.00 0.0 / 0.0
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 3-- 640 Error 9/2/13 00:05:47 9/2/13 12:29:47 12.00 290.0 / 0.0 <== me
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 4-- 640 Error 9/2/13 00:01:09 9/2/13 13:18:00 12.00 218.3 / 0.0
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 2-- 640 Error 8/31/13 12:33:35 8/31/13 21:51:50 6.49 417.8 / 0.0
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 1-- 640 Error 8/31/13 12:08:36 9/1/13 23:41:44 6.76 372.0 / 0.0
E215039_ 518_ C.35.C30H18S2SeSi2.01810979.3.set1d06_ 0-- 640 Error 8/30/13 14:34:56 8/31/13 11:53:15 12.00 324.4 / 0.0

E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 6-- - In Progress 9/2/13 13:03:20 9/5/13 13:03:20 0.00 0.0 / 0.0
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 5-- - No Reply 8/30/13 12:46:43 9/2/13 12:46:43 0.00 0.0 / 0.0
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 4-- 640 Error 8/29/13 15:23:05 8/30/13 12:28:54 12.00 464.6 / 0.0 <== me
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 3-- 640 Error 8/28/13 18:23:43 8/29/13 14:45:23 11.87 329.4 / 0.0
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 2-- 640 Error 8/26/13 22:19:10 8/28/13 18:00:24 8.68 349.3 / 0.0
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 1-- 640 Error 8/26/13 05:58:45 8/26/13 21:54:18 7.57 365.6 / 0.0
E214459_ 521_ C.34.C28H16S3SeSi2.00408829.4.set1d06_ 0-- 640 Error 8/24/13 04:55:41 8/26/13 05:57:29 12.00 282.0 / 0.0

I have a few others where I'm the first error. Waiting to see what happens with them.
----------------------------------------

  • i5-10400 (Comet Lake, 6C/12T) @ 2.9 GHz
  • i5-7400 (Kaby Lake, 4C/4T) @ 3.0 GHz
  • i5-4590 (Haswell, 4C/4T) @ 3.3 GHz
  • i5-3330 (Ivy Bridge, 4C/4T) @ 3.0 GHz

----------------------------------------
[Edit 3 times, last edit by AgrFan at Sep 2, 2013 11:27:57 PM]
[Sep 2, 2013 11:23:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

Hi.

Yes not a good look for an Energy saving project, all these possibly thousands of compute hours on these tasks = energy/power used for nothing. crying

Hope the techs have a look at this.
[Sep 3, 2013 2:47:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: This has got to be some sort of record.

Not to mention the hours spent doing double wu to revalidate each and every host that error out
[Sep 3, 2013 4:08:02 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 26   Pages: 3   [ 1 2 3 | Next Page ]
[ Jump to Last Post ]
Post new Thread