Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 68
Posts: 68   Pages: 7   [ Previous Page | 1 2 3 4 5 6 7 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 6897 times and has 67 replies Next Thread
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

Mike,

Once a quorum 3 / valid status has been achieved and all open 'in progress' work units have returned, WU's will still be listed for an addititional 4 days and then drop off the Result Status page.

cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Nov 27, 2006 10:55:49 PM]
[Nov 27, 2006 10:54:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

I noticed I had several invalids in a row when attempting to test my new laptop a few days ago. It seemed like it was running fine, and even managed to complete and return several valid units.

Just a guess, but I seemed like it started when I started hitting the suspend button to pause the unit mid-processing.

Here's my data in case it might be useful...I've stopped running it on my laptop as I don't want to overheat it, but it would be interesting to see if there was something actually causing this...



Core 2 Duo Mobile, 2 gigs of ram, Windows MCE, BOINC 5.4.11
Computer ID: 85462


B05560_ 0168_ CTMA4A-37-23-2-c2 Invalid 11/22/2006 10:21:38 11/24/2006 18:41:59 3.41 33 / 9
B05560_ 0049_ CTMA4A-37-14-10-c2 Invalid 11/22/2006 10:16:26 11/24/2006 18:41:59 3.85 37 / 6
B05560_ 0126_ CTMA4A-37-2-18-c2 Invalid 11/22/2006 08:31:02 11/23/2006 18:00:56 3.10 30 / 11
B05559_ 0195_ CTMA4A-37-25-21-c1 Invalid 11/22/2006 07:18:15 11/23/2006 18:24:07 5.18 50 / 12


Interesting to note, the units that were invalid all took over 3 hours, while the units that were valid were about 2 hours or less. Also, I noticed with the last unit I attempted to complete on this machine, it was over 50% complete, and then I rebooted, and the unit started over. When it started over, it seemed to take even longer to reach the 50% mark, and so I simply aborted the unit, figuring it would probably end up invalid like the others anyhow.
[Nov 28, 2006 4:58:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

Mousie,

it is really important to look at the quorum detail in the Result Status page by clicking on the work unit name. If a 4th or subsequent copy was send out after the return of your WU, u can be assured that the yours was broke. Just look at the return timestamps versus any send timestamps.

Windows MCE is new to the forum reports, guess its the Media Centre version.

None of your claims seem outlandish for a HDC

The duration varies wildly on HDC... from just over 1 hour to 5 hours and more on my 2.53gz P4 HT. If u see WU's with SEWELL1A in it, add a further hour to the expectation.

On temp problem, find posts with ThreadMaster.... does a bang up job in throttling BOiNC.....really smooth. It just uses an X percent of the available CPU time, which reduces temperature.

The past 50%, then return to 0% has been observed by a few more... see other thread. U should also see that there was no loss of CPU time. The loss of progress is just the nature of the beast. Using the hibernation function, not using electricity when machine is off, allows restart at the exact same progress point.

Is there any real CPU throttle function on the laptop as in, reducing the clock speed?

Let us know.

Cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 28, 2006 7:23:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

Mousie, since it is a laptop then there is a reasonable chance it was caused by overheating. I can't offer a good explanation for why it would affect some WUs but not others except to say your notebook may be just on the borderline, sometimes it overheats while other times it does not. Did you notice the CPU temperature while crunching?

Many motherboard manufacturers have free temperature monitoring utilities you can download from their website. There is also the most exellent Speedfan, 3rd party software, which monitors temperature of almost all motherboard brands. In addition to monitoring temperature, such utilities usually allow you to speed up the cooling fan(s) sooner. In fact your notebook may already have such a utility installed by the manufacturer or available from their website.

You might also try throttling the CPU with Threadmaster. If you search these fora for Threadmaster you'll find links to advice on setting it up. From what I've heard it works well but can be a bit tricky to setup. Also, the 5.6.x alpha versions of BOINC have a built in throttle but it's reported to not work very well. They claim 5.8 will see the throttle fixed but perhaps the current alpha, 5.7.x already has a proper throttle.

Yet another alternative is to use UD which has an easy to use throttle that is known to work well.

---
[Nov 28, 2006 7:30:24 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

t is really important to look at the quorum detail in the Result Status page by clicking on the work unit name. If a 4th or subsequent copy was send out after the return of your WU, u can be assured that the yours was broke. Just look at the return timestamps versus any send timestamps.

Yes, there was a 4th copy sent out, and in the case of the unit I aborted, a fifth copy.

You are correct, MCE is Media Center Edition - it's basically Win XP with more media focus.


Is there any real CPU throttle function on the laptop as in, reducing the clock speed?

When it is idle, it does appear to run at half speed, however when in use it seems to run at full speed, as far as I can tell.

Mousie, since it is a laptop then there is a reasonable chance it was caused by overheating. I can't offer a good explanation for why it would affect some WUs but not others except to say your notebook may be just on the borderline, sometimes it overheats while other times it does not. Did you notice the CPU temperature while crunching?

The temperatures I saw being reported weren't too bad, but I don't know how much I trust the numbers it was reporting. 51-57C usually.
[Nov 28, 2006 12:20:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

Mousie,

experiment with switching off power saving features when u r hanging on the 110 / 220 juice. Best place is likely in the BIOS, but there is often software like Speedstep that controls it....depends on the brand.

Just reading some lappies running at 80 / 100c continuously...57c can't fry an egg. For a solid temp check, try out Speedfan.... www.amico.com. The latest is now also able to deal with the Core 2 Duo's.

cheers
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 28, 2006 12:30:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

First post edited with computer IDs and new examples. Thanks.
[Nov 28, 2006 2:26:18 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

I, too, have been getting a large amount of "Invalids". All clients are the standard client.

Right now I have 298 Invalids.

Filter by:
Device Name: All Result Status: Invalid
Page: 1 2 3 4 5 6 ... 12 Last >


I thought this was being worked on.
I can't afford to pour $$ into electricity and equipment to have the results "tossed aside" and denigrated.
[Nov 28, 2006 2:30:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

DCManiak,

I just looked at some of your computers. Most are doing just fine and returning lots of valid results. However, you have at least one that is returning almost entirely invalid resuts such as: 'dfiado'.

Use the filter by device name to check each computer.

Kevin
[Nov 28, 2006 2:53:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Problem: Invalid Working Units in large numbers. Please help.

I noticed I had several invalids in a row when attempting to test my new laptop a few days ago...

Just a guess, but I seemed like it started when I started hitting the suspend button to pause the unit mid-processing.


Mousie,

I've had this problem too on HDC WU's, and Sekerob has commented on it in the HDC thread.

The HDC checkpointing doesn't work too well, so if you suspend when you are well into a task, it can start over. Unfortunately, your CPU time does not start over, so when you put in your (valid) claim, you get penalised for being a statistical outlier.

I think.

Jonathan.
[Nov 28, 2006 6:21:15 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 68   Pages: 7   [ Previous Page | 1 2 3 4 5 6 7 | Next Page ]
[ Jump to Last Post ]
Post new Thread