Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 141
Posts: 141   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 23825 times and has 140 replies Next Thread
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Regarding MCM1 initial tasks waiting to be sent:

It appears that the tasks Adri identified have now been sent out, as has that odd one I mentioned a while back where I got [and returned] one of the initial tasks but the other didn't get sent at that time.

However, Sgt. Joe's strange WU (where only one initial task went out, failed, and a retry was issued and returned) still seems to be stuck, so the fix didn't (or couldn't) pick that one up :-(

Cheers - Al.

P.S. Retries for MCM1 late returners still seem to be getting stuck...
[Jun 28, 2024 5:27:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1293
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

It's Monday !

We still have an MCM waiting to be sent issue, but from reading posts it looks like some forgotten WUs got cleaned up.

We are waiting on ARPs to be sent out soon. I'm trying to be patient, as they sort out a new process and test it. There will be bumps in the process, please be kind to the tech team.
[Jul 1, 2024 2:08:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
hchc
Veteran Cruncher
USA
Joined: Aug 15, 2006
Post Count: 865
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Wonder how big the tech team is and if they're part-time for WCG and divided among other duties/projects/organizations.
----------------------------------------
  • i5-7500 (Kaby Lake, 4C/4T) @ 3.4 GHz
  • i5-4590 (Haswell, 4C/4T) @ 3.3 GHz
  • i5-3570 (Broadwell, 4C/4T) @ 3.4 GHz

[Jul 1, 2024 7:52:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Some of the older Waiting to be sent tasks now appear to be getting sent out -- I've had quite a few retries since around 08:30 (UTC) after over a week with hardly any sign of retries! Also, a very small number of my tasks that were waiting for a retry to free up now have "In Progress" instead of Waiting :-) -- progress at last!

I was also getting a supply of new work at the same time as those retries, rather than not getting new work at all when retries are in the pipeline; that makes a refreshing change from past experience, and I hope the good behaviour continues :-)

So far it only seems to have got as far as retries that have been waiting for over a week; without knowing how large the backlog was, who knows how long it'll take to clear...

If this is the result of Tech Team efforts after Canada Day, my thanks :-)

Cheers - Al.
[Jul 3, 2024 2:00:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7844
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I have also noticed a new rash of replies, some are _2's and a few are _3's. Apparently at least some of the logjam seems to have broken.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jul 3, 2024 2:54:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2346
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

alanb1951 (Al) added on the 23rd of May his post 696777.
As others may not have seen what Al was really talking about and I failed to show the Error Log and the OS version when I listed the designated workunit, I'm adding the details below now, finally, here, for future reference and as back reference. biggrin
$ wcgstats -fve= 510764828 -S
workunit 510764828
MCM1_0216199_5325_0  Linux Sparky  Error                 2024-04-28T08:34:36  2024-04-28T09:35:05    0.00/0.00
MCM1_0216199_5325_1 Waiting to be sent 0.00/0.00
MCM1_0216199_5325_2 Linuxmint Pending Validation 2024-04-28T09:35:11 2024-04-28T17:48:45 3.12/3.12
Details: ---------------------------------------------------------------------------------------------------------------------------------------
MCM1_0216199_5325_0  Linux Sparky  Error                 2024-04-28T08:34:36  2024-04-28T09:35:05    0.00/0.00
OS-Version: SparkyLinux 8 (Seven-Sisters) [6.8.0-1001-oracle|libc 2.39]
Logfile:
<core_client_version>7.24.1</core_client_version>
<message>
process exited with code 2 (0x2, -254)</message>
<stderr_txt>
Process creation (../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_map_7.61_aarch64-android-linux-gnu) failed: Error -1, errno=2
execv: No such file or directory

</stderr_txt>
MCM1_0216199_5325_1 Waiting to be sent 0.00/0.00
MCM1_0216199_5325_2 Linuxmint Pending Validation 2024-04-28T09:35:11 2024-04-28T17:48:45 3.12/3.12
OS-Version: LMDE 4 (debbie) [4.19.0-8-amd64|libc 2.28 (Debian GLIBC 2.28-10)]
Logfile:
<core_client_version>7.14.2</core_client_version>
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_map_7.61_x86_64-pc-linux-gnu -SettingsFile MCM1_0216199_5325.txt -DatabaseFile dataset-curatedOvarian_EarlyLate_v1.0.txt
Settings File
DateOfDesign = 08/05/2014
Designer = Krembil-cubes-2023-09-22
WorkOrderID = 0216199_5325
DatasetID = curatedOvarian_EarlyLate_v1.0
NumberOfGenesInStartingSignature = 60
NumberOfGenesInSignatureMin = 60
NumberOfGenesInSignatureMax = 60
GroupVectorValues = {A}{B}{C}{D}{E}{F}
ExplicitStartingGeneSignatures = A B D F
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
SearchAlgorithmNumberToCreate = 3576
SearchAlgorithmSequentialStartPosition = 5
RunPermutationAlgorithm = 0
PermutationGroups = A
PermutationGroupsForReplacement = G
PermutationAlgorithm = replaceFromRandomlyToRandomlyGreedy
PermutationsNumIterations = 0
OptimizationAlgorithmFrequency = 0 0 1
FBeta = 1.5
SimAnnealIMax = 20000
SimAnnealAlpha = 0.9996
FitnessFn = 0
MinFitness = 0.274771
NReps = 10
TrainFrac = 0.7
NFolds = 10
VMethod = LOO
ModelType = SVM
SvmArgs = "-v 0 -t 1 -d 3 -s 0.03 -r 10"

SvmLearnLimit = 500000
RSeed = 111115425


[06:20:01] Initializing
[06:20:05] Running
[06:20:05] EvaluateFitnessOfStartingGeneSignatures 3576
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_map_7.61_x86_64-pc-linux-gnu -SettingsFile MCM1_0216199_5325.txt -DatabaseFile dataset-curatedOvarian_EarlyLate_v1.0.txt
[09:56:53] Initializing
[09:57:02] Running
[09:57:02] EvaluateFitnessOfStartingGeneSignatures 3576
[12:46:53] Writing final output
[12:46:53] Closing Output Stream
[12:46:53] Cleaning up
Result.out = 1490.000000
Run complete, CPU time: 11218.272765
12:46:53 (1505): called boinc_finish(0)

</stderr_txt>

Adri
[Jul 3, 2024 10:01:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Hans Sveen
Veteran Cruncher
Norge
Joined: Feb 18, 2008
Post Count: 982
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Welcome back!
Everything seems to function as expected 👌😊
----------------------------------------
[Edit 1 times, last edit by Hans Sveen at Jul 15, 2024 5:00:30 PM]
[Jul 11, 2024 8:00:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1293
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

The latest ARP update says they are internally testing this weekend, and IF all goes well we will get ARP next week. How exciting. I'm hoping all goes well with the internal test.
[Jul 12, 2024 9:15:27 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Unixchick
Veteran Cruncher
Joined: Apr 16, 2020
Post Count: 1293
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

I'm hoping for an ARP update today. I hope the testing went well this weekend, and we get some WUs this week. Whatever the outcome, I hope we get some news.
[Jul 15, 2024 2:08:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
alanb1951
Veteran Cruncher
Joined: Jan 20, 2006
Post Count: 1317
Status: Recently Active
Project Badges:
Reply to this Post  Reply with Quote 
Re: Project Status (First Post Updated)

Regarding ARP restart:

I've been monitoring the three ARP1 "stats" files throughout the hiatus, and yesterday saw a change to state.txt -- signs that something is happening :-)

On 2024-07-15 the file was the same as it had been since the last pre-hiatus activity; on 2024-07-16 the values had changed, losing 441 units as a result. Given that there is probably a lot of background activity regarding the unit data, this may well be a temporary loss that resolves once we re-enter production mode.

If it turns out that some units have been deemed unprocessable (remember those units that needed special time-slice treatment to continue?) I'd expect WCG to let us know that as part of the restart announcement. If, however, the full complement of 35607 units is still in play and the discrepancy remains once work has flowed for a few days, speculation as to causes may begin :-)

Hopefully, new work is truly imminent now, and I await the next stats files updates [and any upcoming announcements - hint, hint :-)] with interest!

Cheers - Al.
[Jul 17, 2024 5:02:01 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 141   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread