Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 12
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3657 times and has 11 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Something off with the projected runtime on MCM1

The strangeness is, they sit in the Ready to Start with about 4 hours, but then when they start, the projected runtime almost immediately drops to under 1 hour and then in the next 5-10 minutes climbs back up to the 4 hours range, usually finishing in the 4:15-4:30 range. fraction_done_exact is set to on in app_config for all apps, but only MCM shows this behavior.
[Dec 23, 2019 9:18:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

When I load a batch of MCM to process, I see intermittently, the run time jump from the normal 2-3 hours per task to over 1 & sometimes over 2 days. They actually do run for that length of time. Machine is nowhere near maxed out. Any thoughts?
[May 19, 2023 4:08:08 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7844
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

When I load a batch of MCM to process, I see intermittently, the run time jump from the normal 2-3 hours per task to over 1 & sometimes over 2 days. They actually do run for that length of time. Machine is nowhere near maxed out. Any thoughts?

I am going to hazard a guess you are using a laptop and it might be throttling due to heat issues. If not heat, there may be some process which is monopolizing your CPU time. You might check to see if your machine has been hacked and is busy mining crypto.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[May 19, 2023 6:37:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
bfmorse
Senior Cruncher
US
Joined: Jul 26, 2009
Post Count: 442
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

One thing to check: earlier this year, I had a similar problem with a newly purchased REFURBISHED MFF[1] unit and ALL WU’s I was attempting to process.

It seems that the provided AC adapter was not DESIGNED to be used with that system and because of that never was able to communicate with CMOS. So, the system ALWAYS was fully throttled back (until the proper AC Adapter was provided).

A quick check would be to pull up the TASK MANAGER, select the performance tab, then select CPU - and see if the indicated CPU speed is as expected -or- significantly lower. Then proceed from there.

Happy crunching.

1. MFF - Micro Form Factor
[May 19, 2023 7:13:05 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

Not sure this will help, but thanks for your suggestions. When I load a batch (say 4-6 tasks), after they start running, just some of them will jump the remaining time to days, versus hours. The others run normally. My machine is a fairly high end desktop.
[May 19, 2023 9:39:55 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7844
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

The other thing you may want to look at is the power settings in the operating system.
The other thought is if your system has some performance cores and some efficiency cores. The performance cores will show greater throughput than the efficiency cores.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
----------------------------------------
[Edit 1 times, last edit by Sgt.Joe at May 19, 2023 10:21:14 PM]
[May 19, 2023 10:19:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2173
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

The other thing you may want to look at is the power settings in the operating system.
The other thought is if your system has some performance cores and some efficiency cores. The performance cores will show greater throughput than the efficiency cores.
Cheers
I am pretty sure that is all the same basic problem with some WUs that has been mentioned since well before the move away from IBM.

It is less an issue with the performance of the cores of the host CPU, but seems to be something in the algorithm that is causing what in software development terms is called a "race condition". In some cases, it just takes longer than usual to finish, and on quite a number of my hosts, this has in recent weeks resulted in a larger than usual number of WUs that NEVER finish (unless I manually abort them or they exceed their deadline).

This issue was avoided back in IBM days, and I seriously doubt that anyone at UHN/Jurisica Lab has ever bothered to look into since the move either...

Ralf
[May 19, 2023 10:55:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

Thank you. Very insightful.
[May 21, 2023 3:28:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7844
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

The other thing you may want to look at is the power settings in the operating system.
The other thought is if your system has some performance cores and some efficiency cores. The performance cores will show greater throughput than the efficiency cores.
Cheers
I am pretty sure that is all the same basic problem with some WUs that has been mentioned since well before the move away from IBM.
It is less an issue with the performance of the cores of the host CPU, but seems to be something in the algorithm that is causing what in software development terms is called a "race condition". In some cases, it just takes longer than usual to finish, and on quite a number of my hosts, this has in recent weeks resulted in a larger than usual number of WUs that NEVER finish (unless I manually abort them or they exceed their deadline).
This issue was avoided back in IBM days, and I seriously doubt that anyone at UHN/Jurisica Lab has ever bothered to look into since the move either...
Ralf

I will concede a race condition is possible, but such a condition has not occurred in any of the MCM units which have run through my machines. It would be helpful to know how often this occurs in relation to the number of MCM units processed, if it occurs on particular machine architectures, and under what operating systems it occurs. It would also be helpful to know what the error condition may indicate and if it occurs more often than once for a particular work unit. If it only occurs once for a particular work unit and the other identical work units come back valid, that would point to some condition or set of conditions in a particular system.
If this is a rare enough condition it may not warrant the time needed to solve the problem. Overall, it would be beneficial to know an answer as it may shed some light on avoiding future problems.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
----------------------------------------
[Edit 1 times, last edit by Sgt.Joe at May 21, 2023 10:39:55 AM]
[May 21, 2023 10:39:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Something off with the projected runtime on MCM1

Thank you, Sgt. Joe - good diagnostic suggestions!... Instead of posting my [overlong] attempt on the same topic, I'd just like to identify some "fine details" it might be interesting to know:

  • How is the user's BOINC client set up -- % of cores and % of CPU in particular[1]
  • what is the user's work mix when problems occur -- all MCM1?
  • Is the MCM1 task VMethod=LOO or VMethod=NFCV?[2] - (in the result report on the web site);
  • Is the user using other software on the system when problems occur.
Unfortunately, without access to the MCM1 source code and build environment all we can do is try to gather information and see where that takes us -- given that MCM is in its home territory here, that might well help them do something about the problems as and when they eventually have time. And I'd happily take part in a proper Beta test for a revised version of MCM1 :-)

Cheers - Al.

[1] The "% of CPU time" value seems to have played a part in issues with MCM1 tasks looping during initialization. If that's a timer problem, it might also affect this other issue.

[2] I mention the VMethod thing because [as you may recall] discussions about why there seem to be two distinct sorts of run-time for MCM1 seemed to suggest that VMethod=NFCV tasks spent a lot longer calling various O/S features.
[May 21, 2023 4:25:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 12   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread