Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 352
Posts: 352   Pages: 36   [ Previous Page | 14 15 16 17 18 19 20 21 22 23 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2179367 times and has 351 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server Errors.

going fine on all fronts at the moment
[Jul 25, 2012 5:26:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Eurwin
Cruncher
Joined: Apr 28, 2007
Post Count: 17
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.

Hello,
This evening everything ran smoothly for the first time in day's. Uploading, downloading AND reporting without problems.

Let's hope the problem is almost counterd smile
[Jul 25, 2012 7:54:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.

Unfortunately, I can confirm that the issue is not resolved. Here is what is going on:

1) We are seeing the network adapters on the two servers that handle file upload/downloads filling up their cache and then starting to drop packets. These particular adapters have some known issues with the driver under load, but the question we are looking at is why they are backing up.

2) We are doing some problem determination work on our switches that handle communications between servers. Unfortunately the data collection crashed one of the switches and caused a 10 minute outage earlier today.

3) We have connected a network sniffer to monitor congestion and other issues during an 'episode'.

The challenging thing about this issue, is that once the issue starts, the only way to clear it is if we stop all access to the filesystem and let things settle down. Once everything is calm we are able to start everything back up and it performs fantastic for many hours before occurring again. We have not been able to correlate the episodes with any scheduled task (backups, application scripts, etc). In the short term, we are implementing scripts to do this automatically, but that is a workaround until the fundamental issue is identified and fixed.
[Jul 25, 2012 10:02:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server Errors.

ty for the updated information
[Jul 25, 2012 10:11:39 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.





Maybe Mario can help. biggrin
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Jul 25, 2012 11:11:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
pramo
Veteran Cruncher
USA
Joined: Dec 14, 2005
Post Count: 716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.

FWIW I did go about half dry with a 0.1 day cache so went to 0.2 days and have had enough to keep busy. That said, might go all crazy today and bump to 0.5 just in case. chicken



I got burned running at 0.1.

Close to getting burned with a 0.25.

. . . call out the white shirts with the service cases.

Nod to a minimum cache cruncher!
sitting at .5, until its sorted.
also a big nod to the folks working on this, we know you're busting B's
----------------------------------------

[Jul 26, 2012 2:08:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server Errors.

I've been running with a 0.1 day cache on my two core laptop with no problem. I have been running CFSW exclusively for which the WUs are short (under one hour). Might longer running WUs be a cause of the problem with a smaller cache size?
[Jul 26, 2012 2:16:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Bearcat
Master Cruncher
USA
Joined: Jan 6, 2007
Post Count: 2803
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.

Sounds like the switches need replacing or a different vendor if these aren't up to the task.
----------------------------------------
Crunching for humanity since 2007!

[Jul 26, 2012 2:45:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Bearcat
Master Cruncher
USA
Joined: Jan 6, 2007
Post Count: 2803
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Server Errors.





Maybe Mario can help. biggrin


BFH is the tool of choice when computers don't want to cooperate. wink
----------------------------------------
Crunching for humanity since 2007!

[Jul 26, 2012 2:48:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Server Errors.

only had a couple this AM that I had to hit the retry to upload
----------------------------------------
[Edit 1 times, last edit by Former Member at Jul 26, 2012 10:38:37 AM]
[Jul 26, 2012 10:37:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 352   Pages: 36   [ Previous Page | 14 15 16 17 18 19 20 21 22 23 | Next Page ]
[ Jump to Last Post ]
Post new Thread