| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 68
|
|
| Author |
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Mike,
----------------------------------------Once a quorum 3 / valid status has been achieved and all open 'in progress' work units have returned, WU's will still be listed for an addititional 4 days and then drop off the Result Status page. cheers
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 1 times, last edit by Sekerob at Nov 27, 2006 10:55:49 PM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I noticed I had several invalids in a row when attempting to test my new laptop a few days ago. It seemed like it was running fine, and even managed to complete and return several valid units.
Just a guess, but I seemed like it started when I started hitting the suspend button to pause the unit mid-processing. Here's my data in case it might be useful...I've stopped running it on my laptop as I don't want to overheat it, but it would be interesting to see if there was something actually causing this... Core 2 Duo Mobile, 2 gigs of ram, Windows MCE, BOINC 5.4.11 Computer ID: 85462 B05560_ 0168_ CTMA4A-37-23-2-c2 Invalid 11/22/2006 10:21:38 11/24/2006 18:41:59 3.41 33 / 9 B05560_ 0049_ CTMA4A-37-14-10-c2 Invalid 11/22/2006 10:16:26 11/24/2006 18:41:59 3.85 37 / 6 B05560_ 0126_ CTMA4A-37-2-18-c2 Invalid 11/22/2006 08:31:02 11/23/2006 18:00:56 3.10 30 / 11 B05559_ 0195_ CTMA4A-37-25-21-c1 Invalid 11/22/2006 07:18:15 11/23/2006 18:24:07 5.18 50 / 12 Interesting to note, the units that were invalid all took over 3 hours, while the units that were valid were about 2 hours or less. Also, I noticed with the last unit I attempted to complete on this machine, it was over 50% complete, and then I rebooted, and the unit started over. When it started over, it seemed to take even longer to reach the 50% mark, and so I simply aborted the unit, figuring it would probably end up invalid like the others anyhow. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Mousie,
----------------------------------------it is really important to look at the quorum detail in the Result Status page by clicking on the work unit name. If a 4th or subsequent copy was send out after the return of your WU, u can be assured that the yours was broke. Just look at the return timestamps versus any send timestamps. Windows MCE is new to the forum reports, guess its the Media Centre version. None of your claims seem outlandish for a HDC The duration varies wildly on HDC... from just over 1 hour to 5 hours and more on my 2.53gz P4 HT. If u see WU's with SEWELL1A in it, add a further hour to the expectation. On temp problem, find posts with ThreadMaster.... does a bang up job in throttling BOiNC.....really smooth. It just uses an X percent of the available CPU time, which reduces temperature. The past 50%, then return to 0% has been observed by a few more... see other thread. U should also see that there was no loss of CPU time. The loss of progress is just the nature of the beast. Using the hibernation function, not using electricity when machine is off, allows restart at the exact same progress point. Is there any real CPU throttle function on the laptop as in, reducing the clock speed? Let us know. Cheers
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Mousie, since it is a laptop then there is a reasonable chance it was caused by overheating. I can't offer a good explanation for why it would affect some WUs but not others except to say your notebook may be just on the borderline, sometimes it overheats while other times it does not. Did you notice the CPU temperature while crunching?
Many motherboard manufacturers have free temperature monitoring utilities you can download from their website. There is also the most exellent Speedfan, 3rd party software, which monitors temperature of almost all motherboard brands. In addition to monitoring temperature, such utilities usually allow you to speed up the cooling fan(s) sooner. In fact your notebook may already have such a utility installed by the manufacturer or available from their website. You might also try throttling the CPU with Threadmaster. If you search these fora for Threadmaster you'll find links to advice on setting it up. From what I've heard it works well but can be a bit tricky to setup. Also, the 5.6.x alpha versions of BOINC have a built in throttle but it's reported to not work very well. They claim 5.8 will see the throttle fixed but perhaps the current alpha, 5.7.x already has a proper throttle. Yet another alternative is to use UD which has an easy to use throttle that is known to work well. --- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
t is really important to look at the quorum detail in the Result Status page by clicking on the work unit name. If a 4th or subsequent copy was send out after the return of your WU, u can be assured that the yours was broke. Just look at the return timestamps versus any send timestamps. Yes, there was a 4th copy sent out, and in the case of the unit I aborted, a fifth copy. You are correct, MCE is Media Center Edition - it's basically Win XP with more media focus. Is there any real CPU throttle function on the laptop as in, reducing the clock speed? When it is idle, it does appear to run at half speed, however when in use it seems to run at full speed, as far as I can tell. Mousie, since it is a laptop then there is a reasonable chance it was caused by overheating. I can't offer a good explanation for why it would affect some WUs but not others except to say your notebook may be just on the borderline, sometimes it overheats while other times it does not. Did you notice the CPU temperature while crunching? The temperatures I saw being reported weren't too bad, but I don't know how much I trust the numbers it was reporting. 51-57C usually. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Mousie,
----------------------------------------experiment with switching off power saving features when u r hanging on the 110 / 220 juice. Best place is likely in the BIOS, but there is often software like Speedstep that controls it....depends on the brand. Just reading some lappies running at 80 / 100c continuously...57c can't fry an egg. For a solid temp check, try out Speedfan.... www.amico.com. The latest is now also able to deal with the Core 2 Duo's. cheers
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
First post edited with computer IDs and new examples. Thanks.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I, too, have been getting a large amount of "Invalids". All clients are the standard client.
Right now I have 298 Invalids. Filter by: Device Name: All Result Status: Invalid Page: 1 2 3 4 5 6 ... 12 Last > I thought this was being worked on. I can't afford to pour $$ into electricity and equipment to have the results "tossed aside" and denigrated. |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
DCManiak,
I just looked at some of your computers. Most are doing just fine and returning lots of valid results. However, you have at least one that is returning almost entirely invalid resuts such as: 'dfiado'. Use the filter by device name to check each computer. Kevin |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I noticed I had several invalids in a row when attempting to test my new laptop a few days ago... Just a guess, but I seemed like it started when I started hitting the suspend button to pause the unit mid-processing. Mousie, I've had this problem too on HDC WU's, and Sekerob has commented on it in the HDC thread. The HDC checkpointing doesn't work too well, so if you suspend when you are well into a task, it can start over. Unfortunately, your CPU time does not start over, so when you put in your (valid) claim, you get penalised for being a statistical outlier. I think. Jonathan. |
||
|
|
|