Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Retired Forums Forum: Member-to-Member Support [Read Only] Thread: Do duplicate device names cause problems? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 12
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
On a new computer, I keep getting the message that starts like this: "Something in your machine is making the grid project program repeately crash..."
I have tried all the workarounds posted online that I can find, including making sure I am running as UD.exe and shutting off Data Execution Prevention (DEP) for this program. Nothing has helped so far. I did notice that this device had the same name as an old one of mine, since this is a rebuilt computer and the prior incarnation also ran the grid. Specifically, this device is named OYARSA and the old one was named Oyarsa, so they were the same except for capitalization. I renamed the old device to be called "Oyarsa - OLD", and the program has been running succesfully now for ... 10 minutes? But sometimes it does run for a bit before crashing. Can anybody confirm or deny that it was a good idea to rename the old device to prevent naming conflicts? Thanks, Chris Leonard Cedar Rapids, Iowa |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Can anybody confirm or deny that it was a good idea to rename the old device to prevent naming conflicts? Chris -- I am afraid that a duplicate name is not the cause of your problems. I think I have three devices with the same name among the seven or so I have defined since the beginning of the project (I really only have two systems but have reinstalled a few times for one reason or another). If it continues to crash, try to get the exact content of the message and report any other observations that might be strange at that time and we will try to work with you on resolving the problem. Best regards, |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks for this information! Oddly, since changing the name of the old device, the grid software has stopped having problems ... at least temporarily. It is currently up to 1 hour, 5 minutes (19 percent) of its task. Previously, it would not get to 10 minutes typically, and frequently bombed in the first minute or two.
I will keep you posted if the program starts crashing again. Thanks for your advice - I really appreciate it! Cheers, Chris |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I think that I will chime in with a link to the Useful Utilities thread at http://www.worldcommunitygrid.org/forums/wcg/viewthread?thread=2490 which includes:
2) CPU Tester: http://7byte.com/index.php?page=download General diagnostic program 3) Memory Tester: http://www.memtest86.com/ Memory diagnostic program |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Thanks. I use several of those utilities already, and some of them are really good. I really like the MBM and SIS utilities, for example.
----------------------------------------Oddly, though, the computer itself is rock-solid - it is only the grid software that has been having problems. I work as a database administrator, and have some expertise with Windows also, and I'm pretty comfortable making the statement that this doesn't look like a hardware issue (unless you pursuade me otherwise). Too many other things - some of them quite demanding, such as virtual computers and server components - are functioning perfectly. And that all stopped this morning, when I renamed my old device. So, either (a) there is some condition that can cause problems with duplicate device names, although it obviously doesn't surface for everybody who has duplicate device names, or (b) it is going to start failing again for some reason. Either way, I'm scratching my head... Thanks again. I appreciate the brainstorming help! -Chris [Edit 1 times, last edit by Former Member at Jul 20, 2005 11:43:08 PM] |
||
|
Viktors
Former World Community Grid Tech Joined: Sep 20, 2004 Post Count: 653 Status: Offline Project Badges: |
If you get one of those special messages, it is best to work with us through the "contact us" link. We will have to work with you to figure out what is wrong in your particular situation because there are many things which could cause the problem.
BTW, some machines which work just fine normally, have had serious problems in their floating point hardware. Since the operating system and normal applications often make little use of floating point, such problems don't show up until you run something like Rosetta, which uses floating point extensively. |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hi Viktors,
Thanks for the info. It seems like you all really want to work on solving these problems, which is refreshing. I did, in fact, use the "Contact Us" link, and got an email back from wcgrid@us.ibm.com. However, it wasn't clear how to continue that correspondence. Do I just reply to the email that was sent from wcgrid@us.ibm.com? Regarding your comments (and those from mycrofth) about hardware: fair enough. I'll run hardware tests on this PC, since that hasn't been done on this workstation (it's new). I'm still optimistic that that won't be the problem, since there are a few graphical applications installed (think "Google Earth" with tilting graphics, etc., and you'll be in the right ballpark) that I think would use the floating point. But you're right, I don't really know how the floating-point is behaving, and until we check that out it's not worth your time to worry about other possible causes. So, I'll check it out Hot CPU Tester Pro 4 Lite is running right now) and get back to you with results. By the way, here's a new interesting tidbit: the application is working today, but it wasn't last night. I am sitting, physically, at the computer today, but last night was using a console RDP session. My other devices run Rosetta fine, even when in a NON-console RDP session. So my current status is: DEP is ignoring UD.exe, and the app seems to work well when I am at the computer (which is an improvement). However, it has problems when I'm remote. This made me think two words: screen saver, but my screen saver is the very bare-bones builtin "Blank" screensaver (not to be confused with "None"). Thanks again, Chris |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
OK, I'm stopping the CPU test, because every time it tests MMX it stops the streaming broadcast of the Cubs game. :o)
Basically, all the tests (default configuration - just "install it and run it") ran 4 times, and some of them ran 5 times. All completed without errors. Is this an adequate check for the processor? I will not be able to run the bios-based memory test right away, since I need to keep the computer running so I can work. I will try to run it later today. Thanks again, Chris |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
If it only crashes when you are running the machine remotely?? Do write privileges change for remote execution? Every few minutes each new conformation gets written into a temporary work file in the WorldCommunityGrid directory. Do you have a complex user structure on your new computer?
Just trying to think what remote execution might change. mycrofth |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Have you tried giving it a workout with prime 95?
That will give all the important parts of your system a good workout. http://www.mersenne.org/freesoft.htm Memtest needs a good few passes....the longer you run it the better. These may not find the root cause of this issue but at least it will confirm/deny overall stability issues. Hope this helps. |
||
|
|