Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 484
Posts: 484   Pages: 49   [ Previous Page | 8 9 10 11 12 13 14 15 16 17 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 957199 times and has 483 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

Are these betas being sent only to reliable hosts, or to any hosts signed up for beta testing? If it's only reliable hosts, then another nice thing is that the errors aren't knocking the hosts out of reliable status. I'm continuing to get new betas on the machines that suffered -161 errors. :)

Beta's go to all machines, regardless reliability state... to also test those not so reliable machines, to see if they're up to the [new project] jobs. All it needs is to opt in. :D

Think the production reliability is not affected by Beta fails, at least through this had true Beta errors and still getting production short-deadline tasks.

--//--

edit: Seeing in BOINCTasks history three -161 now and 2 server aborted Betas.
----------------------------------------
[Edit 2 times, last edit by Former Member at Jul 2, 2011 4:48:56 AM]
[Jul 2, 2011 4:43:04 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

Think the 6.10.59 scheduler is a little lost, suffering extensive dichotomy... now 15 tasks have run partial, 7 of which are CEP2, 4 of these progressing in High Priority. Linux is holding up, but I've never seen BOINC using 2.9GB of Virtual memory, total Disk use including temp files at 5.13GB on top of 2.88GB of ram (info per BOINCTasks and 6.12.33 remotely connected BOINC Manager. Hope the seams hold.

--//--
[Jul 2, 2011 9:58:42 AM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

I would stay away from that system for a while; poking the KB never helps when your drive is being used for memory. Alternatively suspending non-started tasks would prevent the situation further deteriorating, and when tasks finish things will get easier on the system. Pity Boinc can't see a disaster coming.
At least there is no chance of your 15 partial runs being server aborted ;)

Are we to take it that this Beta run is at an end?
[Jul 2, 2011 10:44:35 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

Seen only 4 of 6 active sciences. That said, Clean Water is not really of ''loading'' interest because the only thing that's coming down is an instruction to compute x/y/z seed from the table already on the hosts. HCMD2 is also not that hefty, but the exercise would have to be repeated in full come Tuesday cause ''the load could not be simulated'' ;P

As for suspending, done that this morning to force those CEP2 tasks ahead that had 5-6-7 hours down, to get the quickest stress relieve, but from there I'll just let it... think the machine will survive without too much trouble. Set up uses 2 physical disks to spread load.

--//--
[Jul 2, 2011 10:57:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
johncmacalister2010@gmail.com
Veteran Cruncher
Canada
Joined: Nov 16, 2010
Post Count: 799
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

Both my Linux PCs run mostly unattended. I dis see this morning one task 'aborted by project' and one which appeared to take some time downloading. Unfortunately, I cannot remember the project name/s - too sleepy to record.
----------------------------------------


crunching, crunching, crunching.

AMD Ryzen 5 2600 6-core Processor with Windows 11 64 Pro.

AMD Ryzen 7 3700X 8-Core Processor with Windows 11 64 Pro (part time)


smile
[Jul 2, 2011 11:19:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
anhhai
Veteran Cruncher
Joined: Mar 22, 2005
Post Count: 839
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

Seen only 4 of 6 active sciences. That said, Clean Water is not really of ''loading'' interest because the only thing that's coming down is an instruction to compute x/y/z seed from the table already on the hosts. HCMD2 is also not that hefty, but the exercise would have to be repeated in full come Tuesday cause ''the load could not be simulated'' ;P

As for suspending, done that this morning to force those CEP2 tasks ahead that had 5-6-7 hours down, to get the quickest stress relieve, but from there I'll just let it... think the machine will survive without too much trouble. Set up uses 2 physical disks to spread load.

--//--



restarting this beta from scratch? Sekerob, did you do the rain dance for me? Because I am 99.99999% guaranteed to get sapphire come the Tuesday's batch of betas.
----------------------------------------

[Jul 2, 2011 12:32:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
sk..
Master Cruncher
http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif
Joined: Mar 22, 2007
Post Count: 2324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

About 100min ago I had a temporary upload error for task BETA_op957_00027_17_0, but it resolved itself quickly enough.

02/07/2011 15:11:19 | World Community Grid | Computation for task BETA_op957_00027_17 finished
02/07/2011 15:11:22 | World Community Grid | Started upload of BETA_op957_00027_17_0
02/07/2011 15:11:45 | World Community Grid | Temporarily failed upload of BETA_op957_00027_17_0: connect() failed
02/07/2011 15:11:45 | World Community Grid | Backing off 10 min 17 sec on upload of BETA_op957_00027_17_0
02/07/2011 15:12:00 | | Project communication failed: attempting access to reference site
02/07/2011 15:12:02 | | Internet access OK - project servers may be temporarily down.
02/07/2011 15:20:39 | World Community Grid | update requested by user
02/07/2011 15:20:42 | World Community Grid | Sending scheduler request: Requested by user.
02/07/2011 15:20:42 | World Community Grid | Requesting new tasks for CPU and NVIDIA GPU
02/07/2011 15:20:46 | World Community Grid | Scheduler request completed: got 0 new tasks
02/07/2011 15:20:46 | World Community Grid | No work sent
02/07/2011 15:20:46 | World Community Grid | No work is available for Discovering Dengue Drugs - Together - Phase 2
02/07/2011 15:20:46 | World Community Grid | No work available for the applications you have selected. Please check your settings on the website.
02/07/2011 15:22:04 | World Community Grid | Started upload of BETA_op957_00027_17_0
02/07/2011 15:22:10 | World Community Grid | Finished upload of BETA_op957_00027_17_0
02/07/2011 15:22:13 | World Community Grid | Sending scheduler request: To report completed tasks.
02/07/2011 15:22:13 | World Community Grid | Reporting 1 completed tasks, requesting new tasks for CPU and NVIDIA GPU

Slightly odd 1min 18sec delay between update request and upload start. This seems to happen more often with recent Boinc versions.

I presently have 144 listed as Valid, though I think some may have been removed by this stage. I need another 15days to take me to red. With 77 IP (including a few 12h tasks) and 30 in PV (about 7days) I should get there, but it depends on which systems run which tasks - the HCC tasks don't take long, and there are not too many FAAH in the field. Guess I could soon be looking at a Red Beta badge, and for the next 3 years!

Anyway, going to normalize settings and will try to remember to re-adjust accordingly on Tuesday for any would be re-runs, if we get informed in advance that is.

PS. It was odd watching the PV page change (<5min between each):
Page: < First 1 2 3 Last >
Page: < First 1 2 3 2 Last >
Page: < First 1 2

1, 2, 3, 2 tongue
[Jul 2, 2011 4:10:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
littlepeaks
Veteran Cruncher
USA
Joined: Apr 28, 2007
Post Count: 748
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

I have a questions for the WCG gurus -- since these betas seem to be real science WUs, but downloaded from a different server, are the WUs that are successfully completed considered successfully completed WUs for the science projects, or will they have to be crunched again as non-beta WUs?
[Jul 3, 2011 6:08:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

It would be good if the repair jobs didn't have such short deadlines. (19 hours and 12 mins)

For some sciences it's fine, but for some it's impossibly short. e.g. HPF2. Most systems seem to do them in 10-16 hrs. Even that makes 19 hrs a very tight deadline. My P4/2.6HT has done several of them, all taking slightly over 22 hours. Now it is doing a repair job with the 19:12 deadline. It'll be at least 3 hours late, so it'll just trigger another WU being sent out unnecessarily.
[Jul 3, 2011 6:11:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Set of Beta WU's - General System Test, All Sciences

In production the scheduler would award extra grace time to the slower devices on top of the standard fraction. Don't know though how predictable the HPF2 times were, but the last HPF2 Beta on me quad is after 7.5 hours on 51% and due in 10... tight. Someone else will get a copy if not making it and it will be canceled if not started before mine finishes and reports. Tasks in deadline exceed report immediately AFAICS.

Though seeing your concern, certainly I'd not be too worried over the last efficiency squeeze and consider it part of the full robustness test of the added download servers and their integration with the overall system. After the initial hick-ups all started BETA tasks computed here properly to the end [and had about half a dozen that were server aborted for reasons I've not delved into]. Total worth about 12 CPU days.

--//--

Afterthought: Maybe in these types of Betas the tasks could be controlled in distribution to only go to those that have opted in for a particular science i.e. if your P4-HT were not opted to receive HPF2 in production, it best not send a beta of HPF2. Would that make sense? [Can see some cons to that suggestion too ;>]

edit: bolded for emphasis, general, all science beta's :D
----------------------------------------
[Edit 1 times, last edit by Former Member at Jul 3, 2011 7:22:45 PM]
[Jul 3, 2011 6:38:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 484   Pages: 49   [ Previous Page | 8 9 10 11 12 13 14 15 16 17 | Next Page ]
[ Jump to Last Post ]
Post new Thread