Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 38
Posts: 38   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1579 times and has 37 replies Next Thread
[AF>Le_Pommier] Jerome_C2005
Cruncher
Joined: Aug 17, 2006
Post Count: 29
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
All tasks failing on linux host

Hi

All my ARP tasks are failing on a linux debian host

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<message>
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>1bb5d2c08b5f46568d06a38bb37922e8.</file_name>
<error_code>-200 (wrong size)</error_code>
</file_xfer_error>
</message>
]]>


I can see they are failing for the other crunchers also .

Thanks
----------------------------------------

[Nov 5, 2024 8:38:40 AM]   Link   Report threatening or abusive post: please login first  Go to top 
catchercradle
Advanced Cruncher
Joined: Jan 16, 2009
Post Count: 95
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Thanks, I am currently running mine on a Windows host using WINE as I also have CPDN work that is Windows only running. I had been planning on going back to a native Linux client afterwards but will now keep an eye on this thread before doing so!
"By the time they finish and get uploaded, the issue may well be resolved!
[Nov 5, 2024 12:46:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
siu77
Cruncher
Russia
Joined: Mar 12, 2012
Post Count: 20
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Two tasks have been validated on Debian with no problems. This one is validating.

<core_client_version>7.20.5</core_client_version>
<![CDATA[
<stderr_txt>
INFO: Initializing
INFO: No state to restore. Start from the beginning.
Starting WRFMain
[11:26:45] INFO: Checkpoint taken at 2019-03-18_06:00:00
[13:53:54] INFO: Checkpoint taken at 2019-03-18_12:00:00
[16:10:03] INFO: Checkpoint taken at 2019-03-18_18:00:00
[17:58:53] INFO: Checkpoint taken at 2019-03-19_00:00:00
[20:02:45] INFO: Checkpoint taken at 2019-03-19_06:00:00
[22:32:15] INFO: Checkpoint taken at 2019-03-19_12:00:00
[00:53:59] INFO: Checkpoint taken at 2019-03-19_18:00:00
[02:39:47] INFO: Checkpoint taken at 2019-03-20_00:00:00
INFO: Simulation complete compressing output.
02:41:27 (2146034): called boinc_finish(0)

</stderr_txt>


I think the problem is a "download error".
[Nov 5, 2024 5:15:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[AF>Le_Pommier] Jerome_C2005
Cruncher
Joined: Aug 17, 2006
Post Count: 29
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

I have the record on BoincTask of 1 ARP task (the 1st I got) that crunched for 7 hours and ended in success (the only one in success so far), and this task is *not* listed in my WCG account

ARP1_0014278_127_1 07:20:18 96.99% Nov 01, 2024, 12:55:26 PM OK

In my result page for that host I just see the 4 failed tasked (like the one I mention above) + 4 pending tasks that the host didn't start yet...

Not very encouraging.
----------------------------------------

[Nov 5, 2024 9:50:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Boca Raton Community HS
Advanced Cruncher
Joined: Aug 27, 2021
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Most of ours are also failing. I will be able to post more.numbers later tonight along with error messages.
[Nov 5, 2024 10:09:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gj82854
Advanced Cruncher
Joined: Sep 26, 2022
Post Count: 57
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Out of approximately 150 WUs returned from Linux hosts (Ubuntu and Fedora) only 4 had any kind of error indication. Those 4 were flagged as "Too Late" even though all were returned days before the deadline with a BOINC return code of 0. In all 4 of those jobs, 5 wingmen returned errors in each job. I'm concluding that the "too late" indication is a bogus indication. I don't know what the max error count is for work units but it looks like if a WU is returned with a successful completion but the error count is reached before a quorum is reached, it is flagged as too late.
[Nov 6, 2024 1:57:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Boca Raton Community HS
Advanced Cruncher
Joined: Aug 27, 2021
Post Count: 113
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Out of approximately 150 WUs returned from Linux hosts (Ubuntu and Fedora) only 4 had any kind of error indication. Those 4 were flagged as "Too Late" even though all were returned days before the deadline with a BOINC return code of 0. In all 4 of those jobs, 5 wingmen returned errors in each job. I'm concluding that the "too late" indication is a bogus indication. I don't know what the max error count is for work units but it looks like if a WU is returned with a successful completion but the error count is reached before a quorum is reached, it is flagged as too late.


Interesting. And odd.

Also, you returned 150 ARP1 tasks??
[Nov 6, 2024 2:26:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
gj82854
Advanced Cruncher
Joined: Sep 26, 2022
Post Count: 57
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

After further review, 108 returned. 104 as valid or pending verification/validation and 4 as "too Late" out of 368 total WUs. Some from the weekend may have already disappeared from the results list but I have no evidence of that. Another 15 will upload in the next 3 to 4 hours

EDIT: BoincTasks now show 57 ARP1 WUs pending upload. Can't seem to get anything uploaded or downloaded.
----------------------------------------
[Edit 1 times, last edit by gj82854 at Nov 6, 2024 2:05:23 PM]
[Nov 6, 2024 12:32:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 759
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

Out of approximately 150 WUs returned from Linux hosts (Ubuntu and Fedora) only 4 had any kind of error indication. Those 4 were flagged as "Too Late" even though all were returned days before the deadline with a BOINC return code of 0. In all 4 of those jobs, 5 wingmen returned errors in each job. I'm concluding that the "too late" indication is a bogus indication. I don't know what the max error count is for work units but it looks like if a WU is returned with a successful completion but the error count is reached before a quorum is reached, it is flagged as too late.

"Too Late" is used when # error tasks exceeds limit for any further returns so yes it is misleading.
----------------------------------------
Paul.
[Nov 6, 2024 5:06:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12120
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: All tasks failing on linux host

As soon as 5 copies have been returned with errors, the unit is terminated so the "Too Late" message means that it was received after the unit was terminated even if before the deadline.

Mike
[Nov 6, 2024 9:41:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 38   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread