Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 12
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2267 times and has 11 replies Next Thread
NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
ARP is the Problem?

What is it about the Africa Rainfall Project that "breaks" WCG?

Cheers coffee

[Edit - spelled out ARP]
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by NixChix at Dec 11, 2022 9:59:42 PM]
[Dec 11, 2022 9:57:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mctom
Cruncher
Poland
Joined: Dec 3, 2022
Post Count: 13
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

Ever since I enlarged my HDD space allowance and was granted an ARP task, I stopped receiving tasks from OPN and MCM - I think the bottleneck is receiving the task data.
My computers went idle waiting for data - I thought it was a temporary issue on server side, but it does coincide with me receiving an ARP task for the first time.
[Dec 11, 2022 10:27:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Bryn Mawr
Senior Cruncher
Joined: Dec 26, 2018
Post Count: 331
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

Ever since I enlarged my HDD space allowance and was granted an ARP task, I stopped receiving tasks from OPN and MCM - I think the bottleneck is receiving the task data.
My computers went idle waiting for data - I thought it was a temporary issue on server side, but it does coincide with me receiving an ARP task for the first time.


Probably coincidence as they released a batch of ARP around the same time as the other projects ran short of work.
[Dec 11, 2022 11:45:23 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1313
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

Since ARP is in the mix again, I get almost on every MCM1 download several retries.
I was not able to get one single ARP cause all failed on download.

WU download error: couldn't get input files:
<file_xfer_error>
<file_name>9709eb00b79ae8f9b71907412f2a20b4.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ef196f8ab25fbc86556ca04c5a2b6f89.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>88e73f4267e7762d4b2f0dc10a9b9102.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>be00dc3f146fa7a952459a771e46a9c9.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>4ed1fbb83f8aa47a15502894f316ac8c.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>dc05d77d3c1eedb2a4c7cd5211b86413.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>10c9a0e4248c971fe6a691762d5cddf3.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>43675a630e0eaa9ae114e92efa084c65.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>073835cf972871da4fd0d00ade8cef1c.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0005079_134_ARP1_0005079_input_d02</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0005079_134_ARP1_0005079_input_d03</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>291d750acffcfaa7d6e8bb3c1ff8f81e.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>9a5077ac7b573887fb2c02b57d38744e.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>b5b532185fd7f15122a6881685c7e039.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>660cac491166e67232fd9ea4987efbf8.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0014410_134_ARP1_0014410_input_d01</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0014410_134_ARP1_0014410_input_d03</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>1047b0ce62f187bcc3c1f959349ed96e.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>db8a535d4fe97680547313a24171f044.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>e4f58446834efec649a6b1452fcf0995.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>513807df517cd23ff057a33ec8ce43a6.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0028708_134_ARP1_0028708_input_d01</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0028708_134_ARP1_0028708_input_d02</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0028708_134_ARP1_0028708_input_d03</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>c2bec3d6a8dcacca7a2b88b2dd0bda5c.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>fca01a7427e4a031c063d2a2c919770a.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
WU download error: couldn't get input files:
<file_xfer_error>
<file_name>2b898705c36ecfa6fe3db21795db27fb.</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ffacb66cc3ed0df5f977191d8ab2989d.7z</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
<file_xfer_error>
<file_name>ARP1_0027592_134_ARP1_0027592_input_d01</file_name>
<error_code>-119 (md5 checksum failed for file)</error_code>
</file_xfer_error>
============================================================
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>arp1_image02_7.32.tga</file_name>
<error_code>-120 (RSA key check failed for file)</error_code>
<error_message>signature verification failed</error_message>
</file_xfer_error>
============================================================
app_version download error: couldn't get input files:
<file_xfer_error>
<file_name>arp1_image02_7.32.tga</file_name>
<error_code>-120 (RSA key check failed for file)</error_code>
<error_message>signature verification failed</error_message>
</file_xfer_error>
============================================================

----------------------------------------

[Dec 12, 2022 11:22:02 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1928
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

Don't know to what degree ARP should be "responsible" for the download errors, going by Cyclops' last status post, it could be a lack of server space, for which they are waiting for SSDs apparently.
But for me, the download errors seemed to start before I noticed that any ARP WUs had been send out. The most agrevating fact is that these things always happen right at the start of the weekend and then there is crickets for several days from Krembil. Certainly not the response tine and communication of the "good old WCG"... sad

Ralf
----------------------------------------

[Dec 12, 2022 4:08:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 761
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

The SSDs are for speed, not capacity.
Poor response from the file store is the cause of issues with downloads etc.

Paul.
----------------------------------------
Paul.
[Dec 12, 2022 4:17:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1928
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

The SSDs are for speed, not capacity.
Poor response from the file store is the cause of issues with downloads etc.

Paul.
Read Cyclops' post in that regard from last month (or the one before). It clearly stated that increasing capacity is one of the steps they are trying to take. Just replacing existing (spinning rust) storage with SSDs would just be masking the underlying problem, not solving anything...

Ralf
----------------------------------------

[Dec 12, 2022 4:31:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
PMH_UK
Veteran Cruncher
UK
Joined: Apr 26, 2007
Post Count: 761
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

Increasing capacity may be required for future projects.
Speed is required to address issues with current projects.

Paul.
----------------------------------------
Paul.
[Dec 12, 2022 4:48:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2069
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

The most agrevating fact is that these things always happen right at the start of the weekend and then there is crickets for several days from Krembil. Certainly not the response tine and communication of the "good old WCG"... sad
Sorry, but response times in 'the good old days' haven't been that what you seem to remember.

I searched the forums and immediately found this case from 2017:
Post 538015 from 28 January 2017 at 4:27 AM (my localtime), probably late at night in the USA on Friday.
Response from the IBM team was 86½ hours later (that's more than 3½ days), on a Tuesday (in post 538323).
[Dec 12, 2022 5:20:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 1928
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: ARP is the Problem?

The most agrevating fact is that these things always happen right at the start of the weekend and then there is crickets for several days from Krembil. Certainly not the response tine and communication of the "good old WCG"... sad
Sorry, but response times in 'the good old days' haven't been that what you seem to remember.

I searched the forums and immediately found this case from 2017:
Post 538015 from 28 January 2017 at 4:27 AM (my localtime), probably late at night in the USA on Friday.
Response from the IBM team was 86½ hours later (that's more than 3½ days), on a Tuesday (in post 538323).
Exceptions confirm the rule. One incident, almost 6 years ago. Shocking.

I recall several times that issues were solved on a weekend (or in general) within a matter of couple of hours, at least there was an acknowledgement. I even posted one of those case were the issue was resolved in a bit over one hour.
Sorry, communication since Krembil is as worse as it has ever been. And that should be something that could be resolved with barely any computing resources at all....

Ralf
----------------------------------------

[Dec 12, 2022 6:19:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread