Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 131
Posts: 131   Pages: 14   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 603347 times and has 130 replies Next Thread
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7580
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Still dry here at 21:27 UTC

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Nov 28, 2023 9:27:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I got a MCM resend. It looks like it is ramping up. That more is being sent out over time, but it will take a while to fill everyone's caches.
[Nov 29, 2023 1:16:47 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

MCM and OPNG are flowing, but not enough. I get a WU here and there, but rarely. Please bring back ARP soon.
[Dec 2, 2023 3:40:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 873
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Scheduler requests started to report something odd this afternoon, and I happened to catch the transition between "working, but no work available" and "something is wrong" states on one of my systems...

Sat 02 Dec 2023 15:17:20 GMT | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | Scheduler request completed: got 0 new tasks
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks sent
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for OpenPandemics - COVID 19
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for OpenPandemics - COVID-19 - GPU
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for Africa Rainfall Project
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for Help Stop TB
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for Mapping Cancer Markers
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | No tasks are available for Smash Childhood Cancer
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | Tasks for AMD/ATI GPU are available, but your preferences are set to not accept them
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | Tasks for Intel GPU are available, but your preferences are set to not accept them
Sat 02 Dec 2023 15:17:22 GMT | World Community Grid | Project requested delay of 121 seconds
Sat 02 Dec 2023 15:19:28 GMT | World Community Grid | Sending scheduler request: To fetch work.
Sat 02 Dec 2023 15:19:28 GMT | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
Sat 02 Dec 2023 15:19:31 GMT | World Community Grid | Scheduler request completed: got 0 new tasks
Sat 02 Dec 2023 15:19:31 GMT | World Community Grid | Another scheduler instance is running for this host
Sat 02 Dec 2023 15:19:31 GMT | World Community Grid | Project requested delay of 121 seconds

Ever since then, requests have seen the same response (apart from the occasional but infrequent HTTP error). Uploads still seem to work but with scheduler access broken they can't be reported, and no new work can be requested :-(

Other users have already noted this in a thread in the Mapping Cancer Markers forum

[Edit]A bit of research on this shows that it's a lock-file issue: scheduler requests use a per-host lock file to ensure that there aren't two concurrent requests from one host. The file is created at the start of the request, holds the PID of the scheduler instance, and is deleted at the end of the request.

There are two possible error conditions, one of which is that the lock file can't be acquired in the first place, the other that there is an existing lock. Unfortunately, although the message written to the server log distinguishes the two cases, the message sent to the client does not.

In this case, I suspect the issue is an inability to create the lock file in the first place :-(

Cheers - Al.
----------------------------------------
[Edit 1 times, last edit by alanb1951 at Dec 2, 2023 10:02:17 PM]
[Dec 2, 2023 9:38:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

TigerLily ! HELP !

No WUs are flowing, and no completed WUs can be returned. Al's above post would be useful info for the techs.
[Dec 2, 2023 11:59:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12146
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

2 messages receeived recently.

03/12/2023 02:34:02 | World Community Grid | Another scheduler instance is running for this host

03/12/2023 01:54:34 | World Community Grid | Not requesting tasks: don't need (CPU: not highest priority project; Intel GPU: )
03/12/2023 01:54:39 | World Community Grid | Scheduler request to https://scheduler.worldcommunitygrid.org/boinc/wcg_cgi/fcgi failed: HTTP service unavailable

Mike
[Dec 3, 2023 2:38:04 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7580
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I wonder if this particular hiccup is related to the validated but not purged problem. Since my oldest work unit is from Oct. 24 I looked and there have been 49,999,047 units returned. Make me wonder if some file ran out of space or somehow caused the system to lock up. Might just be coincidence or it may have nothing to do with the current problem. However, inquiring minds want to know, even if we are blessed with the current and on going lack of communication.

Cheers

Edit: Makes me wonder if the powers that be prohibit communication on the weekends.
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
----------------------------------------
[Edit 1 times, last edit by Sgt.Joe at Dec 3, 2023 3:20:18 AM]
[Dec 3, 2023 3:19:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I had a similar thought Sgt. Joe. Wouldn't we have gotten a disk full message?? just wondered if it was a different error msg.

I don't think they prohibit weekend communication, just don't force communication on the weekends. The server room techs don't work weekends as we have learned in past down times.

They have stronger personal work/life boundaries than most American tech workers I know. It's refreshing to see.
[Dec 3, 2023 4:28:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
The_Mole
Cruncher
Joined: Nov 10, 2007
Post Count: 17
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Tasks keep trickling in but it's by far not enough and it seems like all the hosts that currently request work bottleneck the server:

Yesterday evening my PC has 82 tasks ready to report, but...
12/2/2023 8:50:52 PM | World Community Grid | Sending scheduler request: To report completed tasks.
12/2/2023 8:50:52 PM | World Community Grid | Reporting 82 completed tasks
12/2/2023 8:50:52 PM | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
12/2/2023 8:50:59 PM | | Project communication failed: attempting access to reference site
12/2/2023 8:50:59 PM | World Community Grid | Scheduler request to https://scheduler.worldcommunitygrid.org/boinc/wcg_cgi/fcgi failed: Failure when receiving data from the peer
12/2/2023 8:51:00 PM | | Internet access OK - project servers may be temporarily down.

Over night 70 were uploaded, 12 are still left:
12/3/2023 10:53:57 AM | World Community Grid | Scheduler request to https://scheduler.worldcommunitygrid.org/boinc/wcg_cgi/fcgi failed: HTTP service unavailable

(Timestamps are GMT +1)

A couple days ago, I attached Einstein@home with a resource share of 0. That way it only fills up idle threads without creating a waiting queue and any WCG tasks will displace them, once they come in.
----------------------------------------

[Dec 3, 2023 10:18:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 859
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I'm also on einstein at resource share 0 looking for pulsars until WCG gets fixed. I love that Boinc has a mechanism for a backup project or two.

You can look at https://wuprop.boinc-af.org/active_projects.py to find an active project for a backup. I also find it weird that this site found a ARP and OPN ghost WU
[Dec 3, 2023 4:34:42 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 131   Pages: 14   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread