Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5191 times and has 12 replies Next Thread
ThreadRipper
Veteran Cruncher
Sweden
Joined: Apr 26, 2007
Post Count: 1324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Only invalids with 7850k APU

Hi,

I am getting only invalids so far on both the MCM WUs verified so far on my new A10-7850K @ stock. PC is stable otherwise, with no indications of instability.

The invalid WUs:
MCM1_ 0001867_ 1140_ 0--
MCM1_ 0001867_ 7855_ 0--

Perhaps techs have more insight, if this could be a more general architechtural incompatibility of some kind?
----------------------------------------

Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1

AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock
AMD 3800X @ PBO
AMD 2700X @ 4GHz
[Feb 3, 2014 12:00:56 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Randzo
Senior Cruncher
Slovakia
Joined: Jan 10, 2008
Post Count: 339
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

Hello,

please post here the error log.
[Feb 3, 2014 4:53:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7849
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

I am seeing a rash of invalids in several different batches, 1848,1851,1853,1856,1864,1869. I do not see anything valid in those batches on any of my machines, but saw the wingmen did get valids on them. I have not had any invalids since batch 1869 and am currently seeing batches up to 1898. Must have been something transient.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Feb 3, 2014 9:19:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ThreadRipper
Veteran Cruncher
Sweden
Joined: Apr 26, 2007
Post Count: 1324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

Thanks for the replies!
Yes there seems to be some sort of glitch perhaps in some batches.

Here are my error logs anyway:
Result Name: MCM1_ 0001867_ 1140_ 0--



<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1867_1140
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 14
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 23726
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 23726
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379641140


[17:02:06] Initializing
[17:02:13] Running
[17:02:13] EvaluateFitnessOfStartingGeneSignatures 23726
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[17:55:21] Initializing
[17:55:26] Running
[17:55:27] EvaluateFitnessOfStartingGeneSignatures 23726
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_1140.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[16:58:21] Initializing
[16:58:26] Running
[16:58:26] EvaluateFitnessOfStartingGeneSignatures 23726
[17:21:31] Writing final output
[17:21:32] Closing Output Stream
[17:21:32] Cleaning up
Result.out = 4818791.000000
Run complete, CPU time: 29989.035625
17:21:32 (4180): called boinc_finish

</stderr_txt>
]]>


and the second WU:
Result Name: MCM1_ 0001867_ 2428_ 1--



<core_client_version>7.2.33</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1867_2428
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 14
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 23726
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 23726
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379642428


[17:02:07] Initializing
[17:02:14] Running
[17:02:14] EvaluateFitnessOfStartingGeneSignatures 23726
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[17:55:21] Initializing
[17:55:26] Running
[17:55:27] EvaluateFitnessOfStartingGeneSignatures 23726
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_windows_x86_64 -SettingsFile MCM1_0001867_2428.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[16:58:21] Initializing
[16:58:26] Running
[16:58:26] EvaluateFitnessOfStartingGeneSignatures 23726
[17:24:11] Writing final output
[17:24:12] Closing Output Stream
[17:24:12] Cleaning up
Result.out = 4818973.000000
Run complete, CPU time: 30148.203125
17:24:12 (4204): called boinc_finish

</stderr_txt>
]]>

Seems also now I did get some valid ones too, but in different batches...
----------------------------------------

Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1

AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock
AMD 3800X @ PBO
AMD 2700X @ 4GHz
[Feb 3, 2014 11:37:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7849
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

Update:
I have gotten some more invalids in subsequent batches, but as of yet I can see no pattern. After a period of very good stability, it seems some issues are resurfacing.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Feb 5, 2014 12:05:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
armstrdj
Former World Community Grid Tech
Joined: Oct 21, 2004
Post Count: 695
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

We are looking into the issues and will report back once we know more. Based on reports in the forums it does look like an issue with restoring from a checkpoint for a certain type of workunits we are running now. A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart.

Thanks,
armstrdj
[Feb 5, 2014 3:03:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
David_L6
Senior Cruncher
USA
Joined: Aug 24, 2006
Post Count: 296
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart.

Thanks,
armstrdj



I always have mine set like that on all computers.

The last week or so I've been getting almost all invalids on MCM on one computer. It has a Q6700 and Windows XP (32 bit). Yesterday it ran 6 or 8 FAAH work units and all of those were valid. I have two other computers running MCM and they aren't getting any invalids. They also have Q6700 CPUs but they are running Vista (one 32 bit and the other 64 bit).
----------------------------------------

[Feb 5, 2014 10:39:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
ThreadRipper
Veteran Cruncher
Sweden
Joined: Apr 26, 2007
Post Count: 1324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

We are looking into the issues and will report back once we know more. Based on reports in the forums it does look like an issue with restoring from a checkpoint for a certain type of workunits we are running now. A temporary workaround until we have a permanent solution in place would be to turn on the setting to leave the applicaiton in memory when suspended to reduce the likelihood of a restart.

Thanks,
armstrdj



Thanks for the info!
And for all replies from everyone!

I also run with "leave application in memory while suspended", but I know that there was a computer power cycle in-between start and finish of those invalid WUs.

Now I have returned mostly valid results as well with that 7850K so that's good. Yes, it seems it does have something to do with checkpoints, since I have not a single invalid returned on from my main 3930K cruncher which operates 24/7 so there should be no (or very few) checkpoint restarts needed there.

Alright crunching on...
----------------------------------------

Join The International Team: https://www.worldcommunitygrid.org/team/viewTeamInfo.do?teamId=CK9RP1BKX1

AMD TR2990WX @ PBO, 64GB Quad 3200MHz 14-17-17-17-1T, RX6900XT @ Stock
AMD 3800X @ PBO
AMD 2700X @ 4GHz
[Feb 5, 2014 11:18:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
l_mckeon
Senior Cruncher
Joined: Oct 20, 2007
Post Count: 439
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

I haven't checked for invalids, but I notice MCM seems to lose all work since the last checkpoint if you suspend and then re-enable a task (according to BoincTasks).

I do leave suspended tasks in memory.
[Feb 6, 2014 1:20:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Only invalids with 7850k APU

I just checked my rigs and I've got a fair few invalid as well, here's my list.

MCM1_ 0001896_ 1623_ 1-- Intel-i7-2700K Invalid 2/2/14 22:21:44 2/5/14 00:00:21 5.26 / 5.31 136.9 / 59.3

MCM1_ 0001896_ 1086_ 0-- Intel-i7-2700K Invalid 2/2/14 22:21:44 2/4/14 23:11:37 5.28 / 5.34 136.6 / 75.6

MCM1_ 0001894_ 5145_ 0-- Black-AMD-X6 Invalid 2/2/14 20:33:13 2/4/14 02:32:45 7.12 / 7.17 125.1 / 67.4

MCM1_ 0001894_ 4990_ 0-- Black-AMD-X6 Invalid 2/2/14 20:33:13 2/4/14 03:01:32 7.42 / 7.48 133.4 / 73.5

MCM1_ 0001877_ 5417_ 1-- Intel-i7-2700K Invalid 2/2/14 01:28:23 2/4/14 01:34:01 6.34 / 6.40 170.6 / 107.4

MCM1_ 0001877_ 4972_ 0-- Intel-i7-2700K Invalid 2/2/14 01:28:23 2/4/14 01:34:01 6.34 / 6.40 170.7 / 91.8

MCM1_ 0001875_ 5353_ 1-- Black-AMD-X6 Invalid 2/1/14 23:06:15 2/3/14 03:36:31 8.83 / 8.89 208.2 / 111.8

MCM1_ 0001872_ 3272_ 0-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 20:39:13 6.51 / 6.76 198.6 / 102.6

MCM1_ 0001872_ 2918_ 0-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 20:39:13 6.54 / 6.77 198.7 / 99.5

MCM1_ 0001872_ 2777_ 1-- Intel-i7-2700K Invalid 2/1/14 20:32:06 2/3/14 02:12:21 6.51 / 6.57 216.2 / 103.9

The only time these might get restarted, is when I restart in the mornings.
[Feb 6, 2014 2:21:18 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread