Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 9
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3381 times and has 8 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Workunit with a replication of 5, three of them invalid, two valid

Hello,
as we are to post our problems here, so do I.

Could someone of WCG pleas have a look at the Workunit
Project Name: Mapping Cancer Markers
Created: 01/31/2014 21:59:15
Name: MCM1_0001875_3958
Minimum Quorum: 2
Replication: 5

and explain why there have been three invalid answers, one of them mine.
What is the exact reason and can I do anything about it?

Here are the three invalid results

My invalid result:


Result Log

Result Name: MCM1_ 0001875_ 3958_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1875_3958
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 16
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 14650
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 14650
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379723958


[00:19:02] Initializing
[00:19:05] Running
[00:19:05] EvaluateFitnessOfStartingGeneSignatures 14650
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[10:49:23] Initializing
[10:49:26] Running
[10:49:26] EvaluateFitnessOfStartingGeneSignatures 14650
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[15:39:08] Initializing
[15:39:11] Running
[15:39:11] EvaluateFitnessOfStartingGeneSignatures 14650
[17:26:59] Writing final output
[17:26:59] Closing Output Stream
[17:26:59] Cleaning up
Result.out = 3092103.000000
Run complete, CPU time: 25216.560227
17:26:59 (1567): called boinc_finish

</stderr_txt>
]]>

------------------------------------------------------------------------------------------------
The second invalid result:


Result Log

Result Name: MCM1_ 0001875_ 3958_ 2--
<core_client_version>6.10.45</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1875_3958
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 16
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 14650
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 14650
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379723958


[01:15:28] Initializing
[01:15:30] Running
[01:15:30] EvaluateFitnessOfStartingGeneSignatures 14650
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[02:52:34] Initializing
[02:52:36] Running
[02:52:36] EvaluateFitnessOfStartingGeneSignatures 14650
[06:12:16] Writing final output
[06:12:16] Closing Output Stream
[06:12:16] Cleaning up
Result.out = 3092044.000000
Run complete, CPU time: 17362.644779
06:12:17 (12483): called boinc_finish


------------------------------------------------------------------------------------------------


The third invalid result:


Result Log

Result Name: MCM1_ 0001875_ 3958_ 3--
<core_client_version>6.10.45</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1875_3958
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 16
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 14650
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 14650
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379723958


[06:23:13] Initializing
[06:23:17] Running
[06:23:17] EvaluateFitnessOfStartingGeneSignatures 14650
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[01:20:04] Initializing
[01:20:11] Running
[01:20:11] EvaluateFitnessOfStartingGeneSignatures 14650
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
[01:20:05] Initializing
[01:20:10] Running
[01:20:10] EvaluateFitnessOfStartingGeneSignatures 14650
[07:15:46] Writing final output
[07:15:46] Closing Output Stream
[07:15:46] Cleaning up
Result.out = 3092095.000000
Run complete, CPU time: 42634.742859
07:15:46 (16214): called boinc_finish

</stderr_txt>
]]>

-------------------------------------------------------------------------------------------------

Have all the three invalid results the same technical reason?

And here come the two valid results:

Result Log

Result Name: MCM1_ 0001875_ 3958_ 0--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1875_3958
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 16
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 14650
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 14650
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379723958


[03:53:54] Initializing
[03:54:03] Running
[03:54:03] EvaluateFitnessOfStartingGeneSignatures 14650
[20:03:40] Writing final output
[20:03:41] Closing Output Stream
[20:03:41] Cleaning up
Result.out = 3092358.000000
Run complete, CPU time: 57980.100000
20:03:41 (29658): called boinc_finish

</stderr_txt>
]]>

------------------------------------------------------------------------------------


Second valid result:


Result Log

Result Name: MCM1_ 0001875_ 3958_ 4--
<core_client_version>6.10.45</core_client_version>
<![CDATA[
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.28_i686-pc-linux-gnu -SettingsFile MCM1_0001875_3958.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Settings File
DateOfDesign = 11/08/2013
Designer = PMCC_OCI
WorkOrderID = 1875_3958
DatasetID = 17_72_SDG_v1
NumberOfGenesInStartingSignature = 16
NumberOfGenesInSignatureMin = 10
NumberOfGenesInSignatureMax = 20
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 14650
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 14650
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
NReps = 9
TrainFrac = 1.0
NFolds = 10
VMethod = OOB
ModelType = SVM
FitnessFn = 0
MinFitness = -1
SvmArgs = "-v 0 -c 0.01 -t 1 -d 3 -r 0"
SvmLearnLimit = 500000
RSeed = 379723958


[19:41:57] Initializing
[19:42:00] Running
[19:42:00] EvaluateFitnessOfStartingGeneSignatures 14650
[06:04:45] Writing final output
[06:04:45] Closing Output Stream
[06:04:45] Cleaning up
Result.out = 3092358.000000
Run complete, CPU time: 37312.414650
06:04:46 (20690): called boinc_finish

</stderr_txt>
]]>

---------------------------------------------------------------------------

I know that an invalid is not a tragedy, but nevertheless I would like to konow the reason and change something in my machine to avoid invallid results in the future

Thank you very much for your Help
MS
[Feb 8, 2014 7:19:28 AM]   Link   Report threatening or abusive post: please login first  Go to top 
asdavid
Veteran Cruncher
FRANCE
Joined: Nov 18, 2004
Post Count: 521
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

I can see that all invalid results have been restarted (several initializing, running in the logs)
There was at the beginning of the project a problem with MCM tasks turning invalid when they were stopped. This was solved, but anyway there has been some new occurrences of this problem during the last days.
So, until we have more feedback from techs on that, try avoiding stopping an restarting MCM tasks and choose to leave them in memory when suspended.
----------------------------------------
Anne-Sophie

[Feb 8, 2014 3:34:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

Greetings, Anne-Sophie,
thank you very much for your answer.
I checked my BOINC settings and found out, that I had already decided to leave the
program in the memory when suspended when the error occured.

So here I cannot change anything at this point.

Nevertheless thank you for the hint.

At least the techs will know, that the error occurs even if BOINC is left in the memory when suspended.

There must be a different reason for the error.
May the techs find out.
Nevertheless: Thank you once again for your help, Anne-Sophie and best whishes
from Germany to France
MS
[Feb 8, 2014 5:33:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

Hmmm,

forced six WUs to restart and all of them went straight into PendingVerification with two of them already turning invalid.
Looked at some of my repair units and nearly all invalid results have been restarted according to their log.
Are the bad old times returning again?

Matthias
[Feb 10, 2014 8:26:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

The techs acknowledged a problem on 5 Feb and are working towards a solution.
[Feb 10, 2014 8:34:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

About a half of all Workunits in this Project has invalid results. It's sucks enormous. This Project waste our Time and Electricity only. Today two new invalid. Abour 24 hour of computing goes to ****. I consider cancel this Project.

http://www74.zippyshare.com/v/58645869/file.html


**Edited for inappropriate forum language** TKH
----------------------------------------
[Edit 1 times, last edit by TKH at Mar 17, 2014 12:47:00 PM]
[Mar 15, 2014 7:26:45 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

About a half of all Workunits in this Project has invalid results
Those workunits sent to you in February would have been MCM version 7.28, which are known to have that problem if the machine is restarted. Since 13 March, all new MCM units sent out have been version 7.29 which solved the restart problem. So please hang in there and keep contributing!
[Mar 15, 2014 8:50:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

May be. My patience was already exhausted.... Last WUS with invalid results were sent to me two days ago, but it was still version 7.28.
Thank you for pointing me to the new version!

Jirka234
[Mar 15, 2014 10:40:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7851
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Workunit with a replication of 5, three of them invalid, two valid

About a half of all Workunits in this Project has invalid results. It's sucks enormous. This Project waste our Time and Electricity only. Today two new invalid. Abour 24 hour of computing goes to Hell. I consider cancel this Project.

http://www74.zippyshare.com/v/58645869/file.html

You were not the only one with a rash of these. Hang in there, I am sure they will get better, although you will probably still get a few now and then.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Mar 15, 2014 6:12:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread