Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 42
Posts: 42   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3724 times and has 41 replies Next Thread
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Continual error 193

For all I remember, not interested in any extended debate, HCMD was launched when the UD agent still had the fastest share of WCG CPU time. Considering the anticipated limited CPU years needed, the decision not to port it to the BOINC platform was made then. History.

P3/Athlon not supported... refer to post by I think uplinger early in this thread. If this is changed, I don't know. Any written thought on later change is speculation. It's a matter for WCG to decide to revert to the very lowest common denominator. If it impacts performance, I would not. WCG techs will know from the error reports how many devices are causing error in that class. A similar decision to ending for W98 was made, else WCG/We'd be forever stuck with legacy. So, let's see what WCG decides.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[May 18, 2009 7:14:32 AM]   Link   Report threatening or abusive post: please login first  Go to top 
gordoma
Veteran Cruncher
Windsor, UK
Joined: Jul 21, 2005
Post Count: 729
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

P3/Athlon not supported... refer to post by I think uplinger early in this thread.

Can I just ask for a quick bit of clarification? Uplinger mentioned that this problem was not seen in beta, but would now be given a higher priority for attention. My P3-1000 running Ubuntu has been crunching through other applications with no problem, but just errors out on HCMD2. Does this suggest that this problem may be resolved with an update to the application?

Should I therefore just select it to run other projects (for the time being), or are we looking at P3s not being to run any applications in the future? I'm considering retiring this machine as despite it's ability to sit there crunching 24/7, it's noisy and power-hungry and I'm not prepared to spend any money to fix it. However I want to make sure I have a good idea of its usefulness, both present and future, before I make a final decision.

----------------------------------------
[May 18, 2009 7:30:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Continual error 193

It's a matter for WCG to decide to revert to the very lowest common denominator. If it impacts performance, I would not.

You've got to be kidding. As things stand, HCMD2 WUs are being given to P3 machines, which may error immediately, fetch another, and repeat. If specifically set to HCMD2, they'll hit their daily quota, then sit idle for the rest of the day, then do the same the next day (with the daily quota halved). Result: Lots of idle machines, huge numbers of error WUs, massive amounts of useless work for for the scheduler, lots of pointless downloading (some of us pay for downloads, you know). Plus, we are very limited in number of boinc profiles. I'm sure I'm not the only one running the same profile for some pre-P4 and some P4+ machines. Having to divide up all profiles by CPU type, so as to avoid P3 machines fetching P4+ WUs becomes really messy.

Getting some tiny performance increase on some machines while creating this huge mess can't possibly be worth it. At the very least, the scheduler needs to recognise pre-P4 machines and not send them P4+ WUs. If it were to do that, then the obvious next step is not to ignore pre-P4 machines, but to have the best of both worlds: Simply have 2 copies of the WU executable, one with the P4 optimisations and one without, and send them to the appropriate machines.

This is pretty obviously just a screw-up, as P3/Athlon machines are apparently working fine under Windows. What's amazing is that there's still not even a mention of this in the Known Issues forum.
[May 18, 2009 8:00:42 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

P3/Athlon not supported... refer to post by I think uplinger early in this thread.

Can I just ask for a quick bit of clarification? Uplinger mentioned that this problem was not seen in beta, but would now be given a higher priority for attention. My P3-1000 running Ubuntu has been crunching through other applications with no problem, but just errors out on HCMD2. Does this suggest that this problem may be resolved with an update to the application?

Should I therefore just select it to run other projects (for the time being), or are we looking at P3s not being to run any applications in the future? I'm considering retiring this machine as despite it's ability to sit there crunching 24/7, it's noisy and power-hungry and I'm not prepared to spend any money to fix it. However I want to make sure I have a good idea of its usefulness, both present and future, before I make a final decision.


Greetings Matt,

P3s are going to be supported on most projects. HCMD2 on linux is the only version having issues with P3's and older. We had to use a different compiler for this project on Linux so the work units would run at a rate of speed similar to windows. If we did not then work units were taking 100x longer than the windows version on similar computers. We are looking into fixing the compiler options so that it'll work consistently across all processor x86 versions.

I would suggest changing your p3s and older to run something other than HCMD2 at the moment. I have done this on my older boxes until the issue is solved. I did this by changing a device profile to have all but HCMD2 selected and then told the older machines to use this profile.

Kremmen, I am going to look into your suggestion on having it omit sending work units to p3s and older. But this suggestion will also force older machines to be idle (without impact to their quotas though) for members who have only selected HCMD2 in their projects. But as I mentioned earlier, the performance issue was not a slight difference as it was a lot worse near 100x run difference.

-Uplinger
[May 18, 2009 2:19:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
gordoma
Veteran Cruncher
Windsor, UK
Joined: Jul 21, 2005
Post Count: 729
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

Thanks Uplinger... very much appreciate your time to respond and clarify for me. I have now used another profile for these machines and set them to not run HCMD2
----------------------------------------
[May 18, 2009 2:56:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TXR13
Cruncher
Canada
Joined: Dec 5, 2005
Post Count: 36
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

P3s are going to be supported on most projects. HCMD2 on linux is the only version having issues with P3's and older. We had to use a different compiler for this project on Linux so the work units would run at a rate of speed similar to windows. If we did not then work units were taking 100x longer than the windows version on similar computers. We are looking into fixing the compiler options so that it'll work consistently across all processor x86 versions.

...

Kremmen, I am going to look into your suggestion on having it omit sending work units to p3s and older. But this suggestion will also force older machines to be idle (without impact to their quotas though) for members who have only selected HCMD2 in their projects. But as I mentioned earlier, the performance issue was not a slight difference as it was a lot worse near 100x run difference.

-Uplinger



Thank you for the update, Uplinger! It is greatly appreciated. smile I didn't realize that the issue was restricted to HCMD2 on P3 on Linux only. That seems to be a very specific subset, and while I'm sure it's not a trivial number of machines involved, it may not be worth the sort of performance degradation you indicate.

If it's possible to change the compiler options to work across all platforms and CPUs, that would obviously be best. But if that's not possible, the next best thing would truly be to block work for HCMD2 from being assigned to P3s or Athlons running under Linux. Yes, if the member only has HCMD2 selected, the machines will sit idle. But better they sit idle than to have significant numbers of errors and resends generated, I think.

Again, just my $0.02. And whichever way this works, I greatly appreciate the explanation. Thanks again! peace
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by TXR13 at May 18, 2009 3:55:25 PM]
[May 18, 2009 3:52:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Continual error 193

P3s are going to be supported on most projects. HCMD2 on linux is the only version having issues with P3's and older. We had to use a different compiler for this project on Linux so the work units would run at a rate of speed similar to windows. If we did not then work units were taking 100x longer than the windows version on similar computers. We are looking into fixing the compiler options so that it'll work consistently across all processor x86 versions.

What is so different about HCMD2 that it requires using a different compiler?

Also, making it work across all x86 processors might be unnecessary. There probably aren't many P2 and older machines left. How is performance if you do P3 optimisations, but not P4?
[May 19, 2009 2:54:31 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Dadinck
Cruncher
Joined: May 14, 2009
Post Count: 6
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

Excuse me for asking such a NOOB question, and then making a simplistic observation:

Question: Since HCMD2 started recently, wouldn't it use the latest release of BOINC to generate its work units?

Since I am a noob, I am running 6.2.28. I don't have a problem with HCMD2. 1 unit takes 1:57:07 hours.
----------------------------------------

[May 19, 2009 4:39:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Continual error 193

Hello again! Let me try to respond to all the posts since my last.

Matt, no problem on explaining the issue.

TXR13, The reason it is limited to HCMD2 linux for older processors is that we use a different compiler for this application. Most of our science applications are programmed in C/C++ but some are in Fortran. Which caused the different compiler. Since without optimizations the work units were taking forever to run we had to add some optimizations causing this issue. We are going to try our best to make it work on all processors in linux. This is our goal.

Dadinck, Each science application uses the BOINC api to have it communicate with the client/manager. But each work unit is specifically run on the science application which is downloaded through BOINC onto your machine. The problem people are seeing is that on older machines (I believe it is processors with out SSE2) that are running linux they error out immediately due to an invalid operation being passed through the processor.
The best example I can give on this is basically someone gave you a book to read English, and in the middle of the book a quote was written in Spanish. Since I am not fluent in Spanish, I would stop reading and close the book as I can not continue the story properly. But other members who know Spanish would be able to continue the story properly to then end.

Hope this helps,
-Uplinger
[May 19, 2009 3:09:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Continual error 193

Most of our science applications are programmed in C/C++ but some are in Fortran. Which caused the different compiler. Since without optimizations the work units were taking forever to run we had to add some optimizations causing this issue.


The really odd thing about that is that your windows compiler is obviously producing sufficiently efficient, fast code which works for older machines, so whatever linux compiler you're using appears to simply produce poor code unless explicitly told to compile for P4s.

We are going to try our best to make it work on all processors in linux. This is our goal.


That'd be good ... and should theoretically be easy. Any code a windows compiler can generate can be generated by a linux compiler too.

Alternatively, there are plenty of Fortran-to-C converters, so why not convert the Fortran to C and then use the standard C compiler?
[May 20, 2009 3:24:23 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 42   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread