Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 62
Posts: 62   Pages: 7   [ Previous Page | 1 2 3 4 5 6 7 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5557 times and has 61 replies Next Thread
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

There was as predicted (guess who :-), a serious upload crunch going to the Harvard CEP2 Result server, many surely having seen the initial EOS messages in the client logs including me, shortly after things came back online. Upon that I took the quad offline and back on line around midnight and _4 files (6) flew up at max bandwidth limit. I guess that after the storm laid off, there was no one left to report CEP2 tasks, all working on fresh downloads, so it felt like having the pipe to myself...

Just that you know... there's a 10 day reporting deadline, still :D
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 15, 2010 9:50:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

First big post-outage sign of noon stats life:

10/15/10 0:006:14:34:29 15,835 29

and counting. 12 left in PV, double the pre-outage number, but falling.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 15, 2010 12:21:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

I expect some back offs went all the way up to 24h, so more PV than usual. A few manual updates might still help some.
[Oct 15, 2010 1:40:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Seen Ingleside multiple times refer to client auto-offs up to 4 hours ** and knreed posting yesterday to have set the server to boot the clients 1 hour.

Would hope it does not go to 24 hours unless a client has used it's quota due error and asking again same day being pointless. Would be a potential for many clients idling along when work is available.

edit: spelling

edit2: ** The reference discussion http://www.worldcommunitygrid.org/forums/wcg/...ead,30042_offset,0#298081
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 2 times, last edit by Sekerob at Oct 15, 2010 2:21:27 PM]
[Oct 15, 2010 2:13:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
bieberj
Senior Cruncher
United States
Joined: Dec 2, 2004
Post Count: 406
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Thank you knreed and everyone else who worked hard to get WCG back up and running and for keeping us informed with the issue. It is good to be able to see why I couldn't upload my results and get new tasks and seeing the techs' best guess when WCG would be back up. Good job!
[Oct 15, 2010 2:18:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Probably different here but at GPUGrid you can download tasks before uploads finish. It asks for new tasks because existing tasks are complete. You might need to start uploading, or not, but they do not finish uploading before you can start downloading.

This is a BOINC client-feature, not server-side, so WCG and GPUGrid works the same way...

If a BOINC projects #uploads (or really, if #tasks with upload-files) is more than 2* #cpus, or 2* #gpu's (whatever is larger), the BOINC client refuses to make any work-request to the affected project.

It doesn't matter if the uploads is currently transferring, or is currently backed-off, it only goes on how many total there is.

Many projects has a single upload per task, in this case it's just to count #files in transfer-tab. So, if example you've got a quad-core, if you've got example 2 uploads in progress and 6 waiting, you've got a total of 8, and can ask for work. If you've got 2 + 7 on the other hand it's 9 total, > 8, and can't ask for work.

For WCG, you'll need to count tasks instead, if example a quad-core, you can have max 8 tasks marked as "Uploading". If these tasks has 8 uploads or 80 doesn't matter, you'll still allowed to ask for more work. With 9 tasks on the other hand, work-request is blocked.


This feature was added to guard against computers that manages crunching tasks faster than they're managing to upload them. If these computers is allowed to continue downloading more work, they'll "soon" return all work after the deadline, and for most projects this would mean the result is useless, and won't give any credit.

As an added bonus it also eases-off the load on project-servers, in cases like now there is a server-problem, but not everyone understands this is an advantage...
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Oct 15, 2010 4:47:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Thanks,
[Oct 15, 2010 7:54:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

I'm out of work. I keep getting "Backing off X hours".

Yesterday I got "Project down for maintanance".

Is this a client-side problem?

PS. the problem is happening NOW, as we speak.

PSS. My quad-core laptop keeps running out of work. It completes most tasks in 2-3 hours then I get the "back-off" message.

All my "slower" computers are crunching away.
----------------------------------------
[Edit 1 times, last edit by Former Member at Oct 15, 2010 8:29:59 PM]
[Oct 15, 2010 8:27:24 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Boinc Manager, Projects, WCG, Update ?

I'm just after downloading CEP2 tasks. Try again.
[Oct 15, 2010 8:47:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: File System Error Update thread

Still having the same problem as 3 days ago. I'm using BOINC manager to do WGC tasks...

I tried to "reset" project. It downloaded some new tasks and completed them, but after 3-4 hours I got the same problem. It hasnt even reported the task it completed. They just sit there saying "Ready to report". One of the tasks says "downloading" and it's been like that for 24 hours now.

Here's some of the abnormal messages from my log:
"Temporary failed download of XXX"
"Backing of X hours of task XXX"
"[error] reported by file upload server: Can't open file."

These messages are all over my log.
----------------------------------------
[Edit 3 times, last edit by Former Member at Oct 16, 2010 3:54:12 PM]
[Oct 16, 2010 3:46:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 62   Pages: 7   [ Previous Page | 1 2 3 4 5 6 7 | Next Page ]
[ Jump to Last Post ]
Post new Thread