Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Retired Forums Forum: UD Windows Agent Support [Read Only] Thread: Aborted WUs |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 16
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have just got an aborted WU after 11 hours of crunching. Sure enough, I haven't got any points fot it. My concern is that lately I have been getting a lot of aborted WUs (close to 40% of all WUs that I have got). My PC doesn't have any overclocking and haven't had such problems on the grid.org. So, am I alone on this issue or does anybody else have the same? Is there a solution? I don't want to spend my PC's time on worthless crunching if that's an issue and will be back to grid.org.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello Gor,
Are you using BOINC Windows or UD Windows? If UD, are the work units FAAH or HPF? mycrofth |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I'm using UD Agent for Windows. I have tried FAAH units, but all 3 of them aborted.
|
||
|
David Autumns
Ace Cruncher UK Joined: Nov 16, 2004 Post Count: 11062 Status: Offline Project Badges: |
must admit I saw one yesterday
----------------------------------------recent one's have gone through to completion. maybe some of the FAAH Workunit's don't converge like the occasional HPF WU. I'm sure if there are any issues IBM will be on it like a shot. Keep a lookout for any more Dave |
||
|
Viktors
Former World Community Grid Tech Joined: Sep 20, 2004 Post Count: 653 Status: Offline Project Badges: |
I have just got an aborted WU after 11 hours of crunching. Sure enough, I haven't got any points fot it. My concern is that lately I have been getting a lot of aborted WUs (close to 40% of all WUs that I have got). My PC doesn't have any overclocking and haven't had such problems on the grid.org. So, am I alone on this issue or does anybody else have the same? Is there a solution? I don't want to spend my PC's time on worthless crunching if that's an issue and will be back to grid.org. If the work unit was for HPF, then sometimes, the software intentially finishes off the work early when it happens to be a "non-convergent" work unit. What exactly were the unual indications besides not getting points? Was this for HPF or FightAIDS@home? If your computer's memory or processor got hit by a gamma ray or some other such event, it could have crashed the program. If this happens often on your computer, try one of the CPU and Memory testers for an overnight run on each just to make sure it is not your hardware. |
||
|
Online Gamer
Cruncher Joined: Sep 18, 2005 Post Count: 14 Status: Offline |
I have just got an aborted WU after 11 hours of crunching. Sure enough, I haven't got any points fot it. My concern is that lately I have been getting a lot of aborted WUs (close to 40% of all WUs that I have got). My PC doesn't have any overclocking and haven't had such problems on the grid.org. So, am I alone on this issue or does anybody else have the same? Is there a solution? I don't want to spend my PC's time on worthless crunching if that's an issue and will be back to grid.org. If the work unit was for HPF, then sometimes, the software intentially finishes off the work early when it happens to be a "non-convergent" work unit. What exactly were the unual indications besides not getting points? Was this for HPF or FightAIDS@home? If your computer's memory or processor got hit by a gamma ray or some other such event, it could have crashed the program. If this happens often on your computer, try one of the CPU and Memory testers for an overnight run on each just to make sure it is not your hardware. Hi Viktors, Please excuse the first use of the forum, but everything has been running fine till I started with the FAAH wu's. I seem to have the same problem. I regularly crunch for between 40% and 90% then the agent goes to 0% and comms with grid server then downloads a new wu. It doesn't appear to send anything to the server just gets a new wu. Also the run time, points etc doesn't update. I'm not too bothered about that but its the wasted time that seems to be a problem. I see you were online so if you want a quick reply I'll hang around Cheers Steve |
||
|
retsof
Former Community Advisor USA Joined: Jul 31, 2005 Post Count: 6824 Status: Offline Project Badges: |
Check your computer memory with memtest86. This one massages it very well. Rosetta only uses a smaller amount of memory. It could be that the larger aids runs are hitting a portion of memory that wasn't used for Rosetta. Aborted workunits can indicate memory problems or excessive overclocking. Yes, I know that there are a few Rosettas that don't converge, but that shouldn't happen that often. Repeated failures indicate something else.
----------------------------------------I have run about 30 aids workunits so far and have had no problem with them.
SUPPORT ADVISOR
----------------------------------------Work+GPU i7 8700 12threads School i7 4770 8threads Default+GPU Ryzen 7 3700X 16threads Ryzen 7 3800X 16 threads Ryzen 9 3900X 24threads Home i7 3540M 4threads50% [Edit 1 times, last edit by retsof at Nov 25, 2005 9:34:43 PM] |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
If the work unit was for HPF, then sometimes, the software intentially finishes off the work early when it happens to be a "non-convergent" work unit. What exactly were the unual indications besides not getting points? Was this for HPF or FightAIDS@home? I usually crunch only HPF units but have tried FAAH units. As I said before, all 3 of them got aborted. Then I switched back to HPF units only. When a WU gets aborted, usually this happens at least 2 hours into crunching and as someone in this thread pointed out, between 40% and 90%. There is no stats updated, the Agent just gets a new WU. If your computer's memory or processor got hit by a gamma ray or some other such event, it could have crashed the program. If this happens often on your computer, try one of the CPU and Memory testers for an overnight run on each just to make sure it is not your hardware. Look, I have been in distributed computing since 2002. Different projects, different PCs. Mostly, I have been with grid.org which is using the same agent as WCG. I have crunched HPF WUs there with no problems using this very PC that we are talking about. Moreover, this PC has been slightly overclocked before recently. After assembling it and overclocking, I checked it with several testers running them for hours. And again, I have had no problems with HPF WUs on grid.org with the same agent even on the overclocked PC. After I had got several aborted HPF WUs on WCG, I got rid of overclocking completely. It solved the problem for less than a week (I had 15 completed WUs in a row). Now I'm experiencing the same problem. I'll take your advice and run tests overnight again, but something tells me that they will come up clean. |
||
|
Online Gamer
Cruncher Joined: Sep 18, 2005 Post Count: 14 Status: Offline |
Hi Gor, Retsof
----------------------------------------Well I've just sat and watched the damn thing for 7 hours after memtest86+ gave me a clean bill of health (p.c. memory at least, don't know about my own now!!), and guess what. Yep after 5 aborted wu's between 40% and 90% its just completed, sent results and updated stats, downloaded new wu and is crunching away again on a faah wu. Will keep everyone updated via this thread as to how the next few units progress but having crunched everything until the last lot of wu's without a glitch and having a 1.5 gig swap file and 1 gig memory (checked) a p4 3 gig cpu it may point to the fact that several of these aborted wu's doesnt always point to something on the p.c. It may just be our bad luck, or exceptional luck in downloading so many on the run. Cheers all, will keep you updated Steve |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi, Steve.
----------------------------------------That's very strange, but I have the same config on my PC: P4 3.0 GHz and 1 Gb of memory. And we have the same problems with aborted WUs. Are your CPU and memory Northwood core and Crucial by any chance? [Edit 1 times, last edit by Former Member at Nov 27, 2005 5:41:17 AM] |
||
|
|