Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 148
Posts: 148   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 17505 times and has 147 replies Next Thread
simjoe
Cruncher
Joined: Dec 4, 2013
Post Count: 35
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Got several jobs out of this beta, all run fine. I only saw a minor issue with the checkpoints. It looks like that they get mixed up if the job get interrupted frequently in a short time ( reboot of the VM here ( win7 VM )) and then result invalid.
BETA_ ugm1_ ugm1_ 00031_ 1846_ 1--
BETA_ ugm1_ ugm1_ 00031_ 1877_ 1--
[Oct 13, 2014 7:18:57 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

All the work units that had two copies still out in progress have been marked for server abort. This ended up being 471 work units.

Thanks,
-Uplinger
[Oct 13, 2014 7:56:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Hype
Cruncher
Germany
Joined: Nov 18, 2011
Post Count: 43
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

I've got WU BETA_betaugm1_ugm1_00010_0863, which is stuck at 30,889% :o
----------------------------------------

[Oct 13, 2014 8:34:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

RoundFour,

Please feel free to abort that work unit.

Thanks,
-Uplinger
[Oct 13, 2014 8:35:11 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mamajuanauk
Master Cruncher
United Kingdom
Joined: Dec 15, 2012
Post Count: 1900
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

All the work units that had two copies still out in progress have been marked for server abort. This ended up being 471 work units.

Thanks,
-Uplinger
Many thanks Uplinger
----------------------------------------
Mamajuanauk is the Name! Crunching is the Game!



[Oct 13, 2014 8:41:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Yarensc
Advanced Cruncher
USA
Joined: Sep 24, 2011
Post Count: 136
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Anybody else seeing WU's marked as Too Late when they were completely (seemingly) successfully and returned on time? For reference:

BETA_ betaugm1_ ugm1_ 00049_ 1304_ 1
BETA_ betaugm1_ ugm1_ 00027_ 0195_ 1
[Oct 13, 2014 8:54:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

Yarensc,

I have set all workunits for this to have a total number of workunits sent to 2. That way no additional copies are sent. If someone returns the first result back as aborted, or some other error. The the entire workunit will get marked as an error, which should cause yours to stop, and get thrown into a bucket called Too Late, even though they technically are not. It is something I have done with these beta work units. You should still get credit for the work you have done.

Thanks,
-Uplinger
[Oct 13, 2014 8:57:44 PM]   Link   Report threatening or abusive post: please login first  Go to top 
mt4cancer
Cruncher
Joined: Aug 10, 2011
Post Count: 11
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

I had two betas running on an i7-2600 (8-thread) running Ubuntu 14.04. Both had been running for about 24 hours, but upon reboot of the system, they both reset to zero, and I aborted them.

Result Name: BETA_ betaugm1_ ugm1_ 00011_ 0111_ 0--
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
aborted by user
</message>
<stderr_txt>
Unable to open checkpoint file starting from 0
Unable to open checkpoint file starting from 0

</stderr_txt>
]]>
[Oct 13, 2014 9:58:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Yarensc
Advanced Cruncher
USA
Joined: Sep 24, 2011
Post Count: 136
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

If someone returns the first result back as aborted, or some other error. The the entire workunit will get marked as an error, which should cause yours to stop, and get thrown into a bucket called Too Late


Sorry, forgot to mention that there were only two sent out for both of these and I was the first to return a result on both of them, just FYI. Good luck with the final bits of the project!

Edit: they seem to change to valid when the wingman returns.
----------------------------------------
[Edit 1 times, last edit by Yarensc at Oct 13, 2014 10:52:31 PM]
[Oct 13, 2014 10:51:03 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Seoulpowergrid
Veteran Cruncher
Joined: Apr 12, 2013
Post Count: 818
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New BETA test - Oct 6, 2014 [ Issues Thread ]

6 received
4 valid
1 pending
1 "too late" sent 10/12 returned 10/13 the same dates for sent and returned and same "too late" result as my wingman.
BETA_betaugm1_ugm1_00042_0293

Result log:
<core_client_version>7.2.42</core_client_version>
<![CDATA[
<stderr_txt>
Unable to open checkpoint file starting from 0
500 query sequences compared.
1000 query sequences compared.
....
49500 query sequences compared.
Run complete, CPU time: 21244.075779
05:46:04 (29712): called boinc_finish
----------------------------------------

[Oct 14, 2014 12:24:58 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 148   Pages: 15   [ Previous Page | 6 7 8 9 10 11 12 13 14 15 | Next Page ]
[ Jump to Last Post ]
Post new Thread