Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 43
Posts: 43   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5996 times and has 42 replies Next Thread
Thyme Lawn
Cruncher
Joined: Dec 9, 2008
Post Count: 46
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

3 on an i5-3230M @ 2.6GHz. 2 pending and 1 running at 30% after 1:45:00. Estimate is 5 hours for the pending task, but if the current rate of progress is maintained it looks like the running one will take around 6 hours.
Completed with 5:54:18 CPU time and 6:06:22 elapsed time.
Second task completed on my i5 with 5:47:35 CPU time and 5:56:30 elapsed time, waiting on the wingman.

Both of my PV tasks were successfully validated.
----------------------------------------
"The ultimate test of a moral society is the kind of world that it leaves to its children." - Dietrich Bonhoeffer
----------------------------------------
[Edit 1 times, last edit by Thyme Lawn at Sep 24, 2014 5:06:05 PM]
[Sep 24, 2014 5:04:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

I just started the validator for this beta. Members should start seeing results validating.


Thanks, Uplinger. All mine went Valid.
[Sep 24, 2014 5:06:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

New Beta WUs seem to process OK. Got 9 of them on 3930K rig, 7 WUs have validated so far, 1 is pending and the other one in progress.

Good work guys...
[Sep 24, 2014 7:38:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

Aside from the errors already mentioned and explained, all mine have turned Valid except 3 in PVal. Looking good for the new project, I hope.
[Sep 24, 2014 7:57:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Crystal Pellet
Veteran Cruncher
Joined: May 21, 2008
Post Count: 1324
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

I got a resend for verification

BETA_ ugm1_ ugm1_ 00014_ 0365_ 3-- - In Progress 9/24/14 19:58:13 9/28/14 19:58:13 0.00 0.0 / 0.0 <-- mine
BETA_ ugm1_ ugm1_ 00014_ 0365_ 2-- 719 Pending Verification 9/24/14 05:34:58 9/24/14 19:58:10 5.51 128.9 / 0.0 <-- several restarts from checkpoints
BETA_ ugm1_ ugm1_ 00014_ 0365_ 0-- 719 Pending Verification 9/23/14 22:21:42 9/24/14 07:01:46 2.55 70.8 / 0.0 <-- from start to end without restarts
BETA_ ugm1_ ugm1_ 00014_ 0365_ 1-- 719 Error 9/23/14 22:21:40 9/24/14 05:34:57 4.32 110.1 / 0.0 <-- Maximum elapsed time exceeded after 8000 query sequences compared.

I restarted mine after shutting down BOINC and restart after the first checkpoint.

Edit: The wingman *_2-- went Invalid. Strange is the appently restore to checkpoints before the last one:

Result Name: BETA_ ugm1_ ugm1_ 00014_ 0365_ 2--


<core_client_version>5.10.45</core_client_version>
<![CDATA[
<stderr_txt>
Unable to open checkpoint file starting from 0
Checkpoint restored: 55
Checkpoint restored: 280
Checkpoint restored: 336
500 query sequences compared.
Checkpoint restored: 834
1000 query sequences compared.
Checkpoint restored: 1162
Checkpoint restored: 1108 <-- this one
Checkpoint restored: 1108 <-- this one
Checkpoint restored: 1216
Checkpoint restored: 1275
Checkpoint restored: 1162 <-- this one
1500 query sequences compared.
Checkpoint restored: 1493
1500 query sequences compared.
Checkpoint restored: 1767
2000 query sequences compared.
Checkpoint restored: 2212
Checkpoint restored: 2265
2500 query sequences compared.
Checkpoint restored: 2705
Checkpoint restored: 2943
3000 query sequences compared.
Checkpoint restored: 3211
Checkpoint restored: 3256
Checkpoint restored: 3361
Checkpoint restored: 3309 <-- this one
3500 query sequences compared.
Checkpoint restored: 3527
4000 query sequences compared.
4500 query sequences compared.
5000 query sequences compared.
5500 query sequences compared.
6000 query sequences compared.
6500 query sequences compared.
7000 query sequences compared.
7500 query sequences compared.
8000 query sequences compared.
8500 query sequences compared.
9000 query sequences compared.
9500 query sequences compared.
10000 query sequences compared.
10500 query sequences compared.
11000 query sequences compared.
11500 query sequences compared.
12000 query sequences compared.
12500 query sequences compared.
13000 query sequences compared.
13500 query sequences compared.
14000 query sequences compared.
14500 query sequences compared.
15000 query sequences compared.
15500 query sequences compared.
16000 query sequences compared.
16500 query sequences compared.
17000 query sequences compared.
17500 query sequences compared.
Run complete, CPU time: 19835.077125
22:23:11 (2308): called boinc_finish

</stderr_txt>
----------------------------------------
[Edit 1 times, last edit by Crystal Pellet at Sep 25, 2014 5:45:10 AM]
[Sep 24, 2014 8:20:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
anhhai
Veteran Cruncher
Joined: Mar 22, 2005
Post Count: 839
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

I checked those betas that I had that error out and I noticed that most of them have at least one other wingman that error out, however oter wingman were able to complete them successful. Not sure if this issue is wide spread. Does anyone else have this issue?
----------------------------------------

[Sep 25, 2014 4:18:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

I have 3 14 tasks running very well. Did anyone else notice the large file download size on Windows? The task text files were between 4 and 7 meg I remember correctly. Will this be normal when the project hits prime time? It will be interesting to see what upload sizes will be like.

Sure did. My largest downloaded file was beta19.ugm1_ugm1_00014_a_0002.txt (8.26MB, zipped down with a compression ratio of 43% to 4.68MB). The input files used for the 2 pending tasks on my i5 are that file plus one other (*_b_0047.txt is 7.48MB and *_b_0048.txt is 7.81MB with similar zip compression ratios of 41% and 43%).

7zip gives even better compression:

  • a_0002 - 59% (3.38MB)
  • b_0047 - 47% (3.99MB)
  • b_0048 - 52% (3.71MB)
Input files are deleted when the last task using them is reported.

It's the bezerk cleanup mania of the latest clients once the last task completes of an app. Maybe though some counter measure was taken since, for not had cep2 for quite a while, but see the 2 big zips and the app in the wcg project folder.

Seems someone was reading my mind before typing the comment above. Something in a future client, but it also needs change on the server side.


David Anderson [Tue, 23 Sep 2014 19:39:09 +0000]
client: add notion of sticky file lifetime

If a <file_info> in scheduler RPC reply specifies <sticky_lifetime>,
the client will calculate a "sticky_expire_time",
store it in the client state file,
and make the file unsticky when that time is reached.

Note: if a later RPC reply includes the same file,
the client will update sticky_expire_time.
So if a file is used repeatedly it won't get expired.

[Sep 25, 2014 4:32:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

anhhai, you don't give any details of the errors, but if they're the "max elapsed time exceeded" variety, then, yes, others are seeing the situation you mention and Keith has commented (implying the time-limit estimates will sort themselves out in production) - see earlier posts in this thread.
[Sep 25, 2014 6:04:40 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

Anhhai, please read Tony's post, he explains it well there.

As for this beta test, things are going well, we are planning on adding more work units into the mix that will test the sizing functionality of the build script. That way we don't see work units that run for 15 minutes and others for 4 hours.

Again, thank you for your participation in the BETA!

-Uplinger
[Sep 25, 2014 9:04:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7699
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Sept 23, 2014 [ Issues Thread ]

Managed to break one anyway.
BETA_ ugm1_ ugm1_ 00015_ 0003_ 1--
Invalid 9/24/14 00:48:01 9/25/14 08:57:44 4.84 / 4.84 71.3 / 71.3

Result Log

Result Name: BETA_ ugm1_ ugm1_ 00015_ 0003_ 1--
<core_client_version>7.0.27</core_client_version>
<![CDATA[
<stderr_txt>
Unable to open checkpoint file starting from 0
11:18:53 (7453): No heartbeat from client for 30 sec - exiting
11:18:53 (7453): timer handler: client dead, exiting
Unable to open checkpoint file starting from 0
500 query sequences compared.
1000 query sequences compared.
1500 query sequences compared.
2000 query sequences compared.
2500 query sequences compared.
3000 query sequences compared.
3500 query sequences compared.
4000 query sequences compared.
12:30:16 (7492): No heartbeat from client for 30 sec - exiting
12:30:16 (7492): timer handler: client dead, exiting
Checkpoint restored: 4297
4500 query sequences compared.
5000 query sequences compared.
12:45:00 (7630): No heartbeat from client for 30 sec - exiting
12:45:00 (7630): timer handler: client dead, exiting
Checkpoint restored: 3201
12:49:29 (7675): No heartbeat from client for 30 sec - exiting
12:49:29 (7675): timer handler: client dead, exiting
Checkpoint restored: 3201
3500 query sequences compared.
.
.
.
17000 query sequences compared.
17500 query sequences compared.
Run complete, CPU time: 17415.672135
16:33:34 (7695): called boinc_finish

I have a few more in pending verification as this one was, so I suspect they may become invalid also.

Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Sep 26, 2014 2:16:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 43   Pages: 5   [ Previous Page | 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread