| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 32
|
|
| Author |
|
|
rilian
Veteran Cruncher Ukraine - we rule! Joined: Jun 17, 2007 Post Count: 1460 Status: Offline Project Badges:
|
CandymanWCG , errors happen on HPF2, i've reported several myself in past...
----------------------------------------HPF2 works on different sets of input data and maybe this influences quite "high" error rate |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
Roger that. I only got one error so far. I guess we'll see...
----------------------------------------Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
Darn! Just had another one...
---------------------------------------- Oh, well, I still have 3 other to crunch. Per aspera ad astra, right? But I hope that someone's working really hard to fix this, otherwise people will just give up on the project altogether and that's not going to help anyone. Cheers! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() [Edit 2 times, last edit by CandymanWCG at Jan 16, 2011 10:16:35 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Candyman,
Fear not, the last version update reduced the error rate for the science by 90% to well below the alert threshold. As outlined in my previous post there are gotchas with this science which are at this time impossible to reproduce on the lab machines. Some members have this error, and only in Windows, leaving an opt-out for HPF2 or running a mix, so that incidental fails do not affect the device reliability. cheers |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
Oooooh, snap!
----------------------------------------OK, so this project had a lot of errors. Point taken. It still has errors on Windows machines. Got it. Most of the errors occur without any real processing time being lost because it's almost instantaneous. Check. But what in the world am I suppose to think when I see that a task which ran for 5 hours gets an error result that looks like this: Result Name: ob337_ 00040_ 5-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <message> Incorrect function. (0x1) - exit code 1 (0x1) </message> <stderr_txt> ERROR:: Exit at: .\nblist.cc line:711 </stderr_txt> ]]> And when I looked in the Messages tab, I could read this: 1/16/2011 12:06:49 PM World Community Grid Restarting task ob337_00040_5 using hpf2 version 617 1/16/2011 12:06:49 PM World Community Grid Restarting task ob342_00026_11 using hpf2 version 617 1/16/2011 12:08:01 PM World Community Grid Computation for task ob337_00040_5 finished 1/16/2011 12:08:01 PM World Community Grid Output file ob337_00040_5_0 for task ob337_00040_5 absent 1/16/2011 12:08:01 PM World Community Grid Starting ob392_00045_2 1/16/2011 12:08:01 PM World Community Grid Starting task ob392_00045_2 using hpf2 version 617 A few more details: I think this job was completed last night. I scheduled my PC to shutdown at a specified time. When I started the BOINC manager this morning, I think it tried to communicate with the server and then I got the error about it not being able to send the result. There were no other projects waiting in the queue just 3 other HPF2. The thing is, I really don't appreciate having spent 5 hours and now getting that error. Any advice? Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() [Edit 3 times, last edit by CandymanWCG at Jan 20, 2011 11:03:15 PM] |
||
|
|
Hypernova
Master Cruncher Audaces Fortuna Juvat ! Vaud - Switzerland Joined: Dec 16, 2008 Post Count: 1908 Status: Offline Project Badges:
|
There are other projects at WCG.I have Window platforms and after having made a lot of experiments I can advise HCC, HFCC, FA@H, HCMD2. These are rock solid, reliable and never in short supply of WU. With HPF2, C4CW, CEP2 I always had a lot of trouble. I did my best not to discriminate them and contributed to them an acceptable amount of runtime, but not more.
----------------------------------------![]() ![]() |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
Hi Hypernova,
----------------------------------------I understand that there are other projects "worthy of the cause", but to me, they're not just a way to compete against other people and their machines, but what I like to think is that I'm actually contributing to these projects and I will eventually see the results. Therefore, I want to be able to choose the most critical projects (to me) and stick with them. I won't give up on the HPF2 project nor the other 3 projects where I have enlisted just like that. But it is definitely very frustrating to see that my computing time ends up in the trash can, that's why I'm thinking about workarounds or quick fixes, if there are any and if there's anyone out there that can provide them. To those people, I thank in advance! Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() [Edit 1 times, last edit by CandymanWCG at Jan 17, 2011 8:23:35 AM] |
||
|
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges:
|
Have you restarted yet?
Apart from running a mix of projects there is little else I can offer in terms of quick fix advice. It might be interesting to run these tasks with higher priority, to see if it helps. I guess somone already tried that. Are your Bios and chipset drivers up to date? If not back up your existing ones and do some reading before you try to update it - they might not offer anything and always a bit risky, but if they suggest improvements give it a go - if you mess up you can always get a replacement Bios chip for about £10. It's the randomness of these errors that make them difficult to work round. I suspected for a long time that the Windows update service was to blame, but even with it off they errors randomly popped up. ![]() |
||
|
|
CandymanWCG
Senior Cruncher Romania Joined: Dec 20, 2010 Post Count: 421 Status: Offline Project Badges:
|
Hi skgiven,
----------------------------------------Restart what? The machine or the task (is there such a thing?)? I haven't restarted the PC when I got the error. It was scheduled to shutdown during the night and I got the error next morning. BIOS and chipset are up to date and all other drivers. I begin to see now that there are no hot fixes, quick fixes or workarounds. Let's just hope they'll get sorted by the techies sooner, rather than later. I think I'll just leave it be and try to ignore this type of things from now on as I'm sure this wasn't the last time. A very disappointed "cheers to you all"... Knowledge is limited. Imagination encircles the world! - Albert Einstein ![]() |
||
|
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges:
|
I just meant restart the system.
I suppose you could look in your systems event logs for any tasks/events that started up around the time the task failed, but if the techs and scientists cant work this one out it might not be worth bothering. Good luck, |
||
|
|
|