Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 46
Posts: 46   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 39326 times and has 45 replies Next Thread
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Cyclops:
Thanks for the update.
Keep us informed.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Oct 28, 2022 12:19:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
ThreadRipper
Veteran Cruncher
Sweden
Joined: Apr 26, 2007
Post Count: 1324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Thanks for the update!
If the number of connections might be a problem, would it be possible to group several OPNG (and other projects' files) into one package (zip or something) that would then unzip upon download into several separate WUs. Then one could send 10 OPNG WUs at once in a single transfer instead of lots of small files separately that need to create a new HTTPS connection?

Wishing best of luck with the issue. My machines are ready and running - waitng for stress-tests and breakthroughs :)
----------------------------------------

Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1

AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock
AMD 3800X @ PBO
AMD 2700X @ 4GHz
[Oct 28, 2022 10:59:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
ADDIE2014
Cruncher
Joined: Apr 13, 2019
Post Count: 33
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Thanks for the update cyclops, I have about 1/3 of my capacity (44 cores)running continuously now and will add in the rest as needed
[Oct 28, 2022 12:45:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Cyclops
Senior Cruncher
Joined: Jun 13, 2022
Post Count: 295
Status: Offline
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Thanks for the update!
If the number of connections might be a problem, would it be possible to group several OPNG (and other projects' files) into one package (zip or something) that would then unzip upon download into several separate WUs. Then one could send 10 OPNG WUs at once in a single transfer instead of lots of small files separately that need to create a new HTTPS connection?

Wishing best of luck with the issue. My machines are ready and running - waitng for stress-tests and breakthroughs :)

Hi ThreadRipper, thanks for your suggestion, I passed it on to the tech team.
[Oct 28, 2022 3:04:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2494
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

<Snip>
Another (less optimistic) theory is that other tasks, specifically OPNG, were the cause of our recent storage issues and database-wide system errors. We have no solid evidence yet, only an observation that there is typically a decline in available OPNG work around the same time the download issues are less prevalent. A high load on the storage server and scheduler coincide with the database crashes and a phenomenon whereby the download/upload server groups intermittently register as down from the perspective of our load balancer.
<Snip>
@Cyclops: How can you possibly determine if the problem lies with OPNG, if you like now don't provide any OPNG's for downloading?
By now you should have been able to notice, that without OPNG's, the website and the DB does not crash. So, start sending out OPNG's, and if the same problem comes back again, then you will know.
----------------------------------------
[Edit 3 times, last edit by Grumpy Swede at Oct 28, 2022 6:14:59 PM]
[Oct 28, 2022 6:02:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Thanks for the update!
If the number of connections might be a problem, would it be possible to group several OPNG (and other projects' files) into one package (zip or something) that would then unzip upon download into several separate WUs. Then one could send 10 OPNG WUs at once in a single transfer instead of lots of small files separately that need to create a new HTTPS connection?
In a rare previous post from one of the techs it was kind of confirmed that the issue is NOT the number http(s) connections on the web server (then you wouldn't immediately get those 107 byte sized response back!), but the number of connections between web server and underlying database server...

Ralf
[Oct 28, 2022 11:10:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

well whaddya know?

Just right in time to ring in the weekend, download errors are back.
(And no OPNG WUs to see anywhere, only OPN1 and a sprinkling of MCM1s)

Ralf
[Oct 29, 2022 3:39:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2346
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

download errors are back.

No sign of serious HTTP errors on my end.
Had some at 01:31- 01:34 UTC and at 02:59 - 03:02 UTC, way too short to call it serious.
[Oct 29, 2022 10:04:31 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Link64
Senior Cruncher
Joined: Feb 19, 2021
Post Count: 206
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Thanks for the update!
If the number of connections might be a problem, would it be possible to group several OPNG (and other projects' files) into one package (zip or something) that would then unzip upon download into several separate WUs. Then one could send 10 OPNG WUs at once in a single transfer instead of lots of small files separately that need to create a new HTTPS connection?

Wishing best of luck with the issue. My machines are ready and running - waitng for stress-tests and breakthroughs :)

Hi ThreadRipper, thanks for your suggestion, I passed it on to the tech team.

Another suggestion: make mcm1.dataset-sarc1.txt sticky, it's deleted every time when a computer runs out of MCM work and downloaded again when it gets next WUs. That''s over 100MB of unnecessarily transfered data every time. Milkyway and Einstein use that for reusable files, even a lot smaller ones, Milkyway uses that even for 4kB files.
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Link64 at Oct 29, 2022 11:01:34 AM]
[Oct 29, 2022 10:59:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: 2022-10-27 Update (Workunits & storage update)

Well, for what it's worth (worse?), I just got a minute ago 35 OPNG WUs on my programming laptop. Nothing though in my two other GPU capable hosts.

Let's see if this indeed increases the likelyhood of download errors (the ones from last night seemed to be gone in the morning)...

Ralf
[Oct 30, 2022 12:02:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 46   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread