Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 16
Posts: 16   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 19514 times and has 15 replies Next Thread
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Filesystem Error [Resolved]

We just experienced an error with the BOINC filesystem. As a result, the operating system re-mounted it in read-only mounted. We are now in the process of taking it offline and running an fsck on the filesystem. While that runs BOINC will be offline and not able to send or receive files.
----------------------------------------
[Edit 1 times, last edit by knreed at Oct 14, 2010 9:27:03 PM]
[Oct 13, 2010 8:22:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

The fsck is likely to take many hours to run.
[Oct 13, 2010 10:21:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

We have modified the server to return a 503 error with a one hour retry for all attempts to download files.
[Oct 13, 2010 11:47:12 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

The fsck is likely to take a long while (many hours). We are still in phase 1 of fsck (out of 5).
[Oct 14, 2010 12:21:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

We have brought the filesystem online but we aren't sure what condition it is in. Instead of making it available, we have made it read-only and we are copying the data to our new filesystem that we were in the process of creating. The copy is now occurring but it will still be a number of hours to be available.
[Oct 14, 2010 11:21:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

Copy is 14% complete. I will update this count periodically.
[Oct 14, 2010 11:32:35 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

21.8% copied
[Oct 14, 2010 12:20:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

32.3% copied.

We have two sets of copies running. One for the workunit input files (download files) and one for the result files (upload). The upload is running faster than the download. As a result, we might be able to bring the uploads online before the downloads are available.
[Oct 14, 2010 1:02:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

Overall 50.5% copied
Upload 81.5% copied
[Oct 14, 2010 2:13:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Filesystem Error

Uploads are back online and handling the surge easily.

Overall copying is at 61.2%. We are watching to see how fast this progresses now that the upload copies are done.
[Oct 14, 2010 3:00:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 16   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread