Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 57
Posts: 57   Pages: 6   [ Previous Page | 1 2 3 4 5 6 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 11563 times and has 56 replies Next Thread
DSL Freak
Advanced Cruncher
USA
Joined: Feb 12, 2013
Post Count: 62
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

I got a batch of 20378 WUs on 3 different machines that are all running great. Previously all 3 Windows 10 machines were erroring out with the last batches.
----------------------------------------
Crunchin' for a cure!
[Jan 20, 2016 12:53:40 AM]   Link   Report threatening or abusive post: please login first  Go to top 
yangbomb
Cruncher
Joined: Aug 6, 2015
Post Count: 16
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Going to keep crunching on wcg after I finished running some of the seti tasks.
Thanks Uplinger smile
----------------------------------------

[Jan 20, 2016 1:38:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: errors

Afternoon stats for the past 4 days
Day Runtime,            Points,         Results
Sun 205:268:17:44:10 323,759,743 408,886 1:297:18:49:17
Mon 223:110:02:20:11 350,997,390 480,999 1:333:15:32:24
Tue 223:118:13:49:28 349,141,627 443,008 1:322:12:05:04
Wen 157:023:01:02:48 244,557,793 685,715 1:287:06:00:17
The price of quality control problems... drop in production of 66 runtime years, and this is only for a single 12 hour crunching session.

(Whatever happened to the test-ahead of batch groups to prevent these occurances?)

cool

Edit: Oh yeah, and the FAH2 validator(s) appears to be off since about midnight UTC
----------------------------------------
[Edit 1 times, last edit by SekeRob* at Jan 20, 2016 3:49:14 PM]
[Jan 20, 2016 3:39:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
tombell12
Advanced Cruncher
Australia
Joined: Oct 8, 2009
Post Count: 87
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Found out that most of my errored workunits had a tiny amount of credit granted to them. A few were still 0.0 but I thought that was interesting smile
[Jan 21, 2016 2:10:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
yoro42
Ace Cruncher
United States
Joined: Feb 19, 2011
Post Count: 8979
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Members should start seeing results going out for batch 20378 now. rbm and anyone having issues requesting more work, please let me know the message you are getting in your messages log. I may be able to manually boost you from my end.

Thanks,
-Uplinger



Uplinger,
Thanks for the update. I'll let mine continue to run.
Now I'm off to the Dentist which should be about as much fun as your having.

Haven't had WU error since the my last one reported in on 1/19/16 at 22:23 with credit granted.

Back to smooth sailing and the Dentist was not to bad either.
Thanks Uplinger
----------------------------------------

[Jan 21, 2016 11:11:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: errors

(Whatever happened to the test-ahead of batch groups to prevent these occurances?)


Properly written code should check parameters and never crash with segmentation violations too. It's that kind of sloppy coding that leads to buffer overrun exploits, etc.
[Jan 22, 2016 7:09:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
RichSavarie
Cruncher
Canada
Joined: Aug 9, 2005
Post Count: 49
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Well, here's a new one, at least to me. First time I've seen this:
16-Feb-2016 11:33:42 | World Community Grid | Task MCM1_0020911_4562_0 exited with zero status but no 'finished' file
16-Feb-2016 11:33:42 | World Community Grid | If this happens repeatedly you may need to reset the project.
16-Feb-2016 11:38:46 | World Community Grid | Computation for task MCM1_0020911_4562_0 finished

What's that all about, 'eh?
[Feb 16, 2016 5:01:07 PM]   Link   Report threatening or abusive post: please login first  Go to top 
SekeRob
Master Cruncher
Joined: Jan 7, 2013
Post Count: 2741
Status: Offline
Reply to this Post  Reply with Quote 
Re: errors

Let's be adult about this one... it's been discussed a zillion times on these forums, so can easily be found [Boot, or you could follow the instruction... if this happens frequently, reset project ;O]. Also, there's an FAQ on this in the Community maintained frequently asked questions.
[Feb 16, 2016 6:30:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
RichSavarie
Cruncher
Canada
Joined: Aug 9, 2005
Post Count: 49
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Apologies. I did attempt a search but was unsuccessful. My Google-Fu was weak on this one.

In any case, it hasn't happened again since but I am keeping an eye on it just out of curiosity.
[Feb 18, 2016 8:51:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
cornel
Cruncher
Joined: Jan 29, 2009
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: errors

Hello,

All my machines run MCM fine except one with Ubuntu 14.04.3 with the latest low latency kernel on top of XenServer.
On this machine almost all workunits error out and only a few are viewed as invalid.
The machine has an AMD Athlon 64 x2 6000+ CPU (yes, old) and 2GB of RAM available to the VM.
With other project there seemed to be no problem with this machine; today I tried a reset of the project, but still, WUs error out with a SIGSEGV message.
One example of such a workunit: https://secure.worldcommunitygrid.org/ms/devi....do?workunitId=1676444175
[May 1, 2016 8:30:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 57   Pages: 6   [ Previous Page | 1 2 3 4 5 6 | Next Page ]
[ Jump to Last Post ]
Post new Thread