Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 6
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 858 times and has 5 replies Next Thread
Papa3
Senior Cruncher
Joined: Apr 23, 2006
Post Count: 360
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
WCG/FaaH server problems?

On the BOINC developer's list, there has recently been some discussion of difficulties with getting FaaH jobs reported back to the WCG servers, which apparently is due to WCG server problems (see the quoted material below).

Has WCG/FaaH had server problems that would account for this behavior in the BOINC client? If so, has WCG solved the problem, or are we still at risk of having it recur?

---- From the BOINC Developer's list:
I have had this problem with WCG myself recently. In all cases I
observed that it was a problem with the WCG server with large deferrals
on the client side after repeated failed connection attempts.

john

On Mon, 2008-02-04 at 18:32 -0500, David Anderson wrote:
> If the client doesn't report a completed and uploaded job
> within a day of its report deadline, that's a bug.
> If you see that with the current client, please let me know.
> -- David
>
> William wrote:
>
> > * Having BOINC automatically report ALL completed
> > tasks in a TIMELY manner. When I tested a prior
> > version of BOINC last summer by running it unattended
> > for a month or so on an otherwise unused Windows box,
> > at the end of the month it had failed to report some
> > of its 100% completed tasks - and the reporting
> > deadline for these tasks had already passed! (World
> > Community Grid, Fight Aids @ Home, under 24/7 network
> > availability conditions).
[Feb 6, 2008 9:46:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WCG/FaaH server problems?

As far as I know, there is no problem. Properly configured clients report tasks at the next scheduler request, which is usually a couple of hours after the task finishes.

If anybody has a specific problem, then report it here and we will be happy to help solve the configuration problem or determine whether there is a larger problem.

I saw the message on the BOINC list, and I didn't understand what John was getting at. As David Anderson said:
If the client doesn't report a completed and uploaded job
within a day of its report deadline, that's a bug.
If you see that with the current client, please let me know.

Please let us know, as well.
[Feb 6, 2008 9:53:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WCG/FaaH server problems?

I am not aware of any outages at this time. I am probably dooming myself at the moment but we have actually had a nice spell of two months with a very stable environment.

The only issues we have been working with uploads and downloads are in regards to people behind proxies but those folks have constant issues - not intermittent and we are making progress on resolving their issues (using the 5.10.40 client)

If someone is having issues (with a 5.10 client), then they should post their messages log here and we will take a look.
----------------------------------------
[Edit 1 times, last edit by knreed at Feb 6, 2008 10:07:41 PM]
[Feb 6, 2008 10:06:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Papa3
Senior Cruncher
Joined: Apr 23, 2006
Post Count: 360
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: WCG/FaaH server problems?

BOINC 5.10.30

2/20/2008 3:24:06 AM|World Community Grid|Sending scheduler request: To report completed tasks. Requesting 74649191713 seconds of work, reporting 2 completed tasks
2/20/2008 3:24:33 AM|World Community Grid|[work_fetch_debug] compute_work_requests(): work req 74649191735.295700, shortfall 74649191735.295700, urgency Need
2/20/2008 3:25:02 AM|World Community Grid|Scheduler request succeeded: got 0 new tasks
2/20/2008 3:25:02 AM|World Community Grid|Message from server: No work could be sent.
2/20/2008 3:25:02 AM|World Community Grid|Message from server: No work is available for FightAIDS@Home
[Feb 20, 2008 8:34:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: WCG/FaaH server problems?

Hi Papa3,

yes, my clients have been receiving this message since about 4 UTC this morning for all projects on windows. There have been work volume problems causing for the servers to choke. Sit back and hopefully it will self recover. FAAH jobs with longer run times are in the pipeline to reduce the server loads but wont reach production until probably tonight.

sorry
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Feb 20, 2008 8:49:21 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: WCG/FaaH server problems?

Just a bit from the post:

---- From the BOINC Developer's list:
I have had this problem with WCG myself recently. In all cases I
observed that it was a problem with the WCG server with large deferrals
on the client side after repeated failed connection attempts.

john

> William wrote:
>
> > When I tested a prior version of BOINC last summer
> > by running it unattended for a month or so on an
> > otherwise unused Windows box, at the end of the month
> > it had failed to report some of its 100% completed tasks

In addition to possible reason being the deferrals, (very) past versions of Boinc clients were also occasionally getting stuck forever in any scheduler request (mostly to an unresponsive server). The last request apparently never got a response and the client continued tu crunch and upload finished results, until the cache eventually emptied.

I (and not only me) have been observing this on long running Linux clients and am still now on 5.10.28, but forgot whether/when it happened on Windows.
[Feb 20, 2008 4:10:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread