Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 32
Posts: 32   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4709 times and has 31 replies Next Thread
Albatros010
Cruncher
Joined: Aug 8, 2007
Post Count: 14
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Suddenly lots of Invalid WUs

Hi,

suddenly there are lots of invalid WUs. Here's an example:


Result Log

Result Name: FAH2_ 001421_ avx17385-0_ 000004_ 000068_ 145_ 3--


<core_client_version>7.14.2</core_client_version>
<![CDATA[
<stderr_txt>
INFO: result number = 3
%IMPACT-I: Requested file to open for appending md.out Does not exist.
Opening it as a new file.
%IMPACT-I: Softcore binding energy with umax = 1000.00000
%IMPACT-I: Using AGBNP2: Analytical Generalized Born Model + Analytic
Non-Polar Hydration Model
%IMPACT-I: Hybrid potential for binding with lambda = 0.23370
agbnpf_assign_parameters(): info: attempting to load from SQL tables.
[21:09:15] INFO: Checkpointed. Progress 500 of 10000 steps complete CPU time 179.703125
[21:12:18] INFO: Checkpointed. Progress 1000 of 10000 steps complete CPU time 348.000000
[21:15:37] INFO: Checkpointed. Progress 1500 of 10000 steps complete CPU time 516.875000
[21:19:08] INFO: Checkpointed. Progress 2000 of 10000 steps complete CPU time 690.250000
[21:22:35] INFO: Checkpointed. Progress 2500 of 10000 steps complete CPU time 865.359375
[21:26:00] INFO: Checkpointed. Progress 3000 of 10000 steps complete CPU time 1037.734375
[21:29:28] INFO: Checkpointed. Progress 3500 of 10000 steps complete CPU time 1208.531250
[21:33:01] INFO: Checkpointed. Progress 4000 of 10000 steps complete CPU time 1378.250000
[21:36:30] INFO: Checkpointed. Progress 4500 of 10000 steps complete CPU time 1550.875000
[21:40:07] INFO: Checkpointed. Progress 5000 of 10000 steps complete CPU time 1723.640625
INFO: result number = 3
%IMPACT-I: Softcore binding energy with umax = 1000.00000
%IMPACT-I: Using AGBNP2: Analytical Generalized Born Model + Analytic
Non-Polar Hydration Model
%IMPACT-I: Hybrid potential for binding with lambda = 0.23370
agbnpf_assign_parameters(): info: attempting to load from SQL tables.
[21:45:21] INFO: Checkpointed. Progress 5500 of 10000 steps complete CPU time 1893.141125
[21:48:51] INFO: Checkpointed. Progress 6000 of 10000 steps complete CPU time 2066.063000
[21:52:10] INFO: Checkpointed. Progress 6500 of 10000 steps complete CPU time 2237.641125
[21:55:28] INFO: Checkpointed. Progress 7000 of 10000 steps complete CPU time 2412.094250
[21:58:43] INFO: Checkpointed. Progress 7500 of 10000 steps complete CPU time 2585.625500
[22:01:58] INFO: Checkpointed. Progress 8000 of 10000 steps complete CPU time 2762.484875
[22:05:16] INFO: Checkpointed. Progress 8500 of 10000 steps complete CPU time 2938.500500
[22:08:27] INFO: Checkpointed. Progress 9000 of 10000 steps complete CPU time 3112.469250
[22:11:38] INFO: Checkpointed. Progress 9500 of 10000 steps complete CPU time 3283.344250
[22:14:51] INFO: Checkpointed. Progress 10000 of 10000 steps complete CPU time 3455.344250
%IMPACT-I: Species 1 written to SQL file md-out1.dms
%IMPACT-I: Species 2 written to SQL file md-out2.dms
22:14:52 (11384): called boinc_finish(0)

</stderr_txt>
]]>

Regards Uli
[Nov 12, 2018 9:28:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
cz50975
Advanced Cruncher
Joined: Dec 9, 2004
Post Count: 91
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

Hi,

all my FAH2 upoloads after 15:45 today validated as INVALID. From 4 different machines with total number close to 100.

Regards,

MCh
[Nov 12, 2018 10:48:09 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Steve W
Advanced Cruncher
Joined: Dec 9, 2005
Post Count: 110
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

I don't get too many FAH2 WU but I got one and it is also flagged as invalid. WU - FAH2_001752_avx62808-1-1_000003_000024_133.

No restarts, just a continuous run through the WU and it got flagged as invalid.

The above WU currently has 3 invalids, one server aborted and one in progress.

Might need a tech to look into it or pass back to the scientists to find out the problem.
[Nov 12, 2018 10:51:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
cz50975
Advanced Cruncher
Joined: Dec 9, 2004
Post Count: 91
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

BTW all MIP1 uploads from the same machines after 15:45 validated as VALID.
[Nov 12, 2018 10:59:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
UBT - JohnR
Cruncher
Joined: Apr 30, 2006
Post Count: 35
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

After about 14.34 all marked invalid. About 350 WU's. from all my computers.
[Nov 12, 2018 11:47:43 PM]   Link   Report threatening or abusive post: please login first  Go to top 
AMuthig
Advanced Cruncher
USA
Joined: Nov 30, 2013
Post Count: 59
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

Same with me. 600+ invalids for this project only, approximately the same time frame as stated above.
[Nov 13, 2018 1:41:50 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Macroman
Advanced Cruncher
Joined: Jun 4, 2005
Post Count: 112
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

I've had about 30 in the same time period, turning off the project and killing my WU's here.
[Nov 13, 2018 3:41:54 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sabrina Tarson
Advanced Cruncher
United States
Joined: Jun 27, 2012
Post Count: 149
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

Can confirm I also have probably 20+ work units during this time. Turned off the project as well as aborting those in my cache.
----------------------------------------
----------------------------------------
[Edit 1 times, last edit by Chase Tarson at Nov 13, 2018 3:55:18 AM]
[Nov 13, 2018 3:53:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
ca05065
Senior Cruncher
Joined: Dec 4, 2007
Post Count: 325
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

I have also de-selected FAH2 after noticing 50+ invalid work units.
[Nov 13, 2018 7:24:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Suddenly lots of Invalid WUs

Several pages of invalid, with quite a few in between as server aborted, suggesting awareness, but still distributing new tasks that go invalid.

FAH2_ 001551_ avx38748-2_ 000005_ 000010_ 143_ 0--	3113135	Invalid	11/13/18 02:39:54	11/13/18 07:08:21	2.05 / 2.37	33.7 / 0.0
FAH2_ 001530_ avx38742-5_ 000007_ 000077_ 137_ 2-- 3113135 Invalid 11/13/18 02:32:14 11/13/18 06:59:30 2.00 / 2.31 32.9 / 0.0
FAH2_ 001592_ avx38779-7-1_ 000009_ 000009_ 135_ 2-- 3113135 Invalid 11/13/18 02:12:53 11/13/18 06:49:23 2.07 / 2.39 34.1 / 0.0
FAH2_ 001740_ avx62778-0-2_ 000007_ 000037_ 132_ 1-- 3113135 Invalid 11/13/18 02:06:40 11/13/18 06:42:47 2.02 / 2.32 33.0 / 0.0
FAH2_ 001447_ avx17558-0_ 000001_ 000016_ 143_ 4-- 3113135 Invalid 11/13/18 00:12:18 11/13/18 04:47:29 1.96 / 2.24 31.9 / 0.0
FAH2_ 001516_ avx38719-0_ 000005_ 000093_ 147_ 1-- 3113135 Invalid 11/13/18 00:10:07 11/13/18 04:26:00 1.94 / 2.22 31.6 / 0.0
FAH2_ 001518_ avx38741-0-1_ 000005_ 000042_ 128_ 0-- 3113135 Invalid 11/13/18 00:10:07 11/13/18 04:23:47 2.00 / 2.29 32.6 / 0.0
FAH2_ 001504_ avx38705-1_ 000002_ 000099_ 138_ 3-- 3113135 Server Aborted 11/13/18 01:12:29 11/13/18 03:21:12 0.00 / 0.00 0.0 / 0.0
FAH2_ 001475_ avx17684m-0-1_ 000008_ 000076_ 138_ 0-- 3113135 Invalid 11/12/18 21:43:55 11/13/18 02:39:54 2.19 / 2.52 35.9 / 0.0
FAH2_ 001351_ avx101139-1_ 000007_ 000079_ 149_ 2-- 3113135 Invalid 11/12/18 21:48:11 11/13/18 02:32:14 2.04 / 2.35 33.5 / 0.0
FAH2_ 001475_ avx17684m-0-1_ 000004_ 000096_ 142_ 0-- 3113135 Invalid 11/12/18 21:41:51 11/13/18 02:12:53 2.03 / 2.34 33.3 / 0.0
FAH2_ 001413_ avx17375-2_ 000003_ 000091_ 139_ 3-- 3113135 Invalid 11/12/18 21:27:18 11/13/18 02:06:40 2.02 / 2.32 33.1 / 0.0
FAH2_ 001445_ avx17557-2_ 000008_ 000008_ 147_ 3-- 3113135 Server Aborted 11/12/18 23:47:21 11/13/18 01:12:29 0.00 / 0.00 0.0 / 0.0
FAH2_ 001468_ avx17680-0_ 000001_ 000001_ 140_ 0-- 3113135 Invalid 11/12/18 21:12:25 11/13/18 00:12:18 2.02 / 2.36 33.6 / 0.0
FAH2_ 001763_ gl5243106-0_ 000006_ 000060_ 123_ 2-- 3113135 Invalid 11/12/18 19:32:00 11/13/18 00:10:07 2.02 / 2.35 33.5 / 0.0

[Nov 13, 2018 8:50:53 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 32   Pages: 4   [ 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread