| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 38
|
|
| Author |
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Among the largest CPUtime consumed and evaluated as INVALID:
CMD2_ 1885-2AR9_ D.clustersOccur-2DME_ A.clustersOccur_ 1_ 31056_ 33656_ 0-- 9.34hrs CPUtime CMD2_ 1886-2AMY_ A.clustersOccur-2IAE_ B.clustersOccur_ 7_ 56427_ 57121_ 0-- 6.41hrs CPUtime CMD2_ 1885-2AR9_ D.clustersOccur-2C63_ C.clustersOccur_ 2_ 44274_ 48049_ 1-- 6.01hrs CPUtime CMD2_ 1885-2ARQ_ A.clustersOccur-2FFU_ A.clustersOccur_ 13_ 1-- 4.94hrs CPUtime CMD2_ 1885-2ARQ_ A.clustersOccur-2DW4_ A.clustersOccur_ 8_ 1-- 3.75hrs CPUtime Ouch, it hurts! How do I stop this? (BOINC_v6.10.59) |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Don't know. What have you got to offer in terms of message logs [right when the task finishes] and Result Log, those linked where it says ''Invalid"?
Is this continues for just HCMD2, other sciences inbetween validation normally? --/-- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Is this continues for just HCMD2, other sciences inbetween validation normally? Lately, I'm crunching under one scienceProject at a time. I just got 2yrs runTime for all-C4CW-crunching with very minimal, if any problems at all. I also did some HFCC, HCC, HPF2, and FightAIDS, and CMD2 -- and of these projects, CMD2 proved to be the most problematic for me...Not very different from the CMD2 I used to know (~3years ago): problematic. Then, I was mixing projects but now I'm doing exclusive CMD2 and the same CMD2 problems! I continue to be lost as to why one project (C4CW) behaves smoothly as a whistle while other projects (CMD2) is studded with irregularities, randomness, unpredictability, and at last, errors. I would like to think that the CMD2 code could use some improvement to make it come inline with the current model (in my book) project: C4CW. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well, can only refer to what was asked in my previous post... Message and Result Logs.
--//-- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Well, can only refer to what was asked in my previous post... Message and Result Logs. There is nothing on my end's message and result logs that would spell out the difference between what would eventually be evaluated as Valid, Inconclusive, or Invalid; it all comes out as 'normal' WUs and the cruncher expecting that it is a validly crunched WU. It is WCG, and not the cruncher that spells the difference in the valuation. So I ask WCG: what do you see in your logs there that made WCG deem an otherwise Valid WU as Invalid? |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Of course same Device, OS, AV/FW etc? What are specs?
I've asked the admin to move to the HCMD2 forum. Maybe others see it there and say ''ah'' an other victim, for some correllation. Rather odd with a 3 year interval, and a new science compile since then. --//-- |
||
|
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3315 Status: Offline Project Badges:
|
My computers used to get a considerable number of invalid Wu's for HCMD2.
----------------------------------------Result logs only said "maximum runtime exceeded" (something like this) even when CPU hours were lower than 6 hours. I decided to get away from HCMD2 All other projects crunch fine. ![]() - AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W - AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W - AMD Ryzen 7 7730U 8C/16T 3.0 GHz [Edit 1 times, last edit by Falconet at Jun 12, 2011 10:28:46 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Right on the money, Falconet, and every word of it ! I'm optimistic that we'll get to the source of the matter and next get to fully understand the underlying mechanisms involved.
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My computers used to get a considerable number of invalid Wu's for HCMD2. Result logs only said "maximum runtime exceeded" (something like this) even when CPU hours were lower than 6 hours. I decided to get away from HCMD2 All other projects crunch fine. It's a tough question, but what are the commonalities between you 2? Only from Falconet's post we make out a premature MRE for part of his tasks and andzgrid confirming it being right on the money Tasks running longer than 6 hours is not clear... do they too get the MRE? How about you compare OS/Software notes. Seems not the CPU from what BOINCStats reveals, but see an XP and a Vista device for Falconet. Did both do this? It would help to have the info here to make any tracking by the techs easier. --//-- |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Hello WCG
Attention / Reference: Sekerob-WCG CA / Sekerob [Jun 12, 2011 7:20:22 AM] post Greetings: Herewith are the particulars of various items as you requested. A] Machine_Specifics (stdoutdae.txt): ---------------------------------------------- BOINC client version 6.10.59 for x86_64-pc-linux-gnu Data directory: /var/lib/boinc-client Processor: 6 AuthenticAMD AMD Phenom(tm) II X6 1090T Processor [Family 16 Model 10 Stepping 0] Processor: 512.00 KB cache Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm 3dnowext 3dnow constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni monitor cx16 popcnt OS: Linux: 2.6.38-8-generic Memory: 7.81 GB physical, 18.88 GB virtual Disk: 439.86 GB total, 413.67 GB free [World Community Grid] Host location: none [World Community Grid] General prefs: using your defaults Preferences: max memory usage when active: 7200.60MB max memory usage when idle: 7200.60MB max disk usage: 32.00GB don't use GPU while active Not using a proxy Initialization completed B] WUname: CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0 ------------------------------------------------------------------------------------------------------------------- C] WU_specifics at machine (stdoutdae.txt): ------------------------------------------------------- 10-Jun-2011 07:18:05 [World Community Grid] Starting CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0 10-Jun-2011 07:18:05 [World Community Grid] Starting task CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0 using hcmd2 version 640 10-Jun-2011 19:21:41 [World Community Grid] Computation for task CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0 finished 10-Jun-2011 20:04:11 [World Community Grid] Started upload of CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0_0 10-Jun-2011 20:04:16 [World Community Grid] Finished upload of CMD2_1885-2AR7_B.clustersOccur-2JC2_C.clustersOccur_0_12833_19204_0_0 D] WU_specifics at WCG (Main) Results WebPage: --------------------------------------------------------------- ResultName: CMD2_ 1885-2AR7_ B.clustersOccur-2JC2_ C.clustersOccur_ 0_ 12833_ 19204_ 0-- Status: Invalid SentTime/ReturnTime: 6/9/11 06:25:34 / 6/10/11 20:04:51 CPUtime(hours): 12.00 Claimed/Granted BOINCcredit: 455.3 / 23.1 E:] WU_specifics at WCG "Results Log" WebPage: -------------------------------------------------------------- Result Name: CMD2_ 1885-2AR7_ B.clustersOccur-2JC2_ C.clustersOccur_ 0_ 12833_ 19204_ 0-- <core_client_version>6.10.59</core_client_version> <![CDATA[ <stderr_txt> INFO: Initializing Platform. INFO: No state to restore. Start from the beginning. Finishing early because max runtime has been exceeded.43202.390000 called boinc_finish </stderr_txt> As far as I can tell, there is nothing in the data for the Device, OS, AV/FW, or any specs that will differentiate when or whether a submitted WU is evaluated as being Valid, Inconclusive, or Invalid. Please note that the wordings for Valid, Inconclusive, or Invalid WUs (as shown in the "Results Log" webPage) are similar in that there is always the "Initializing Platform", "No state to restore", and "Finishing early because ...." and that this is true whether the WU ends up being evaluated as Valid, Inconclusive or Invalid. The only differentiation that I can discern is the number following the "...has been exceeded" phrase. Now, how do we crunchers ascertain that a submitted-WU was correctly evaluated as Valid, Inconclusive, or Invalid? Good day ; |
||
|
|
|