Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 60
Posts: 60   Pages: 6   [ Previous Page | 1 2 3 4 5 6 ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 11010 times and has 59 replies Next Thread
phytell
Cruncher
Joined: Sep 8, 2014
Post Count: 39
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

Can anyone confirm for me whether invalid results for this project are actually doing anything for the research?
If everything that's being sent in is still making a difference then I'm happy, but if invalid means "doesn't matter and will be redone", then I'm wasting a whole bunch of time and there's really no reason to continue working on this project until the invalid issue is resolved (reached silver badge recently on 6 threads - if that's wasted its going to be rather disheartening).
[Dec 10, 2015 12:07:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
petehardy
Senior Cruncher
USA
Joined: May 4, 2007
Post Count: 318
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

Got my first Invalid.

Result Name: FAH2_ avx38783-ls_ 000030_ 0013_ 024_ wcgfahb00020000_ 0-- 

<core_client_version>7.6.9</core_client_version>
<![CDATA[
<stderr_txt>
[02:33:03] INFO:Turning trickle messaging on.
[02:33:03] INFO:Turning intermediate uploads on.
%IMPACT-I: Requested file to open for appending md.out Does not exist.
Opening it as a new file.
%IMPACT-I: Softcore binding energy with umax = 1000.00000
%IMPACT-I: Using AGBNP2: Analytical Generalized Born Model + Analytic
Non-Polar Hydration Model
%IMPACT-I: Hybrid potential for binding with lambda = 0.00330
agbnpf_assign_parameters(): info: attempting to load from SQL tables.
[02:41:27] INFO: Checkpoint skipped. Progress 1000/100000 CPU time 501.671875
[02:49:35] INFO: Checkpointed. Progress 2000 of 100000 steps complete CPU time 987.828125
[02:57:40] INFO: Checkpoint skipped. Progress 3000/100000 CPU time 1472.000000
[03:05:55] INFO: Checkpoint skipped. Progress 4000/100000 CPU time 1961.984375
[03:14:09] INFO: Checkpointed. Progress 5000 of 100000 steps complete CPU time 2455.531250
[03:22:32] INFO: Checkpoint skipped. Progress 6000/100000 CPU time 2951.593750
[03:30:35] INFO: Checkpointed. Progress 7000 of 100000 steps complete CPU time 3433.140625
[03:38:45] INFO: Checkpoint skipped. Progress 8000/100000 CPU time 3923.406250
[03:47:08] INFO: Checkpointed. Progress 9000 of 100000 steps complete CPU time 4423.437500
[03:55:29] INFO: Sending trickle message to server.
[03:55:29] INFO: Starting intermediate upload, index = 1
[03:55:29] INFO: Checkpoint skipped. Progress 10000/100000 CPU time 4923.343750
[04:03:58] INFO: Checkpointed. Progress 11000 of 100000 steps complete CPU time 5431.890625
[04:12:23] INFO: Checkpoint skipped. Progress 12000/100000 CPU time 5932.765625
[04:20:59] INFO: Checkpointed. Progress 13000 of 100000 steps complete CPU time 6448.343750
[04:29:39] INFO: Checkpoint skipped. Progress 14000/100000 CPU time 6967.734375
[04:38:24] INFO: Checkpointed. Progress 15000 of 100000 steps complete CPU time 7491.468750
[04:47:13] INFO: Checkpoint skipped. Progress 16000/100000 CPU time 8019.984375
[04:55:39] INFO: Checkpointed. Progress 17000 of 100000 steps complete CPU time 8521.218750
[05:03:48] INFO: Checkpoint skipped. Progress 18000/100000 CPU time 9009.062500
[05:11:50] INFO: Checkpoint skipped. Progress 19000/100000 CPU time 9487.843750
[05:19:55] INFO: Sending trickle message to server.
[05:19:55] INFO: Starting intermediate upload, index = 2
[05:19:55] INFO: Checkpointed. Progress 20000 of 100000 steps complete CPU time 9971.968750
[05:27:55] INFO: Checkpoint skipped. Progress 21000/100000 CPU time 10451.312500
[05:35:55] INFO: Checkpoint skipped. Progress 22000/100000 CPU time 10929.562500
[05:44:08] INFO: Checkpointed. Progress 23000 of 100000 steps complete CPU time 11419.156250
[05:52:46] INFO: Checkpoint skipped. Progress 24000/100000 CPU time 11931.390625
[06:01:30] INFO: Checkpointed. Progress 25000 of 100000 steps complete CPU time 12454.343750
[06:10:20] INFO: Checkpoint skipped. Progress 26000/100000 CPU time 12980.437500
[06:19:02] INFO: Checkpointed. Progress 27000 of 100000 steps complete CPU time 13499.875000
[06:27:38] INFO: Checkpoint skipped. Progress 28000/100000 CPU time 14014.453125
[06:36:04] INFO: Checkpointed. Progress 29000 of 100000 steps complete CPU time 14513.859375
[06:44:22] INFO: Sending trickle message to server.
[06:44:22] INFO: Starting intermediate upload, index = 3
[06:44:22] INFO: Checkpoint skipped. Progress 30000/100000 CPU time 15012.015625
[06:52:53] INFO: Checkpointed. Progress 31000 of 100000 steps complete CPU time 15519.515625
[07:01:35] INFO: Checkpoint skipped. Progress 32000/100000 CPU time 16038.187500
[07:10:23] INFO: Checkpointed. Progress 33000 of 100000 steps complete CPU time 16565.562500
[07:19:31] INFO: Checkpoint skipped. Progress 34000/100000 CPU time 17113.515625
[07:28:50] INFO: Checkpointed. Progress 35000 of 100000 steps complete CPU time 17665.750000
[07:38:04] INFO: Checkpoint skipped. Progress 36000/100000 CPU time 18218.859375
[07:46:50] INFO: Checkpointed. Progress 37000 of 100000 steps complete CPU time 18743.875000
[07:55:35] INFO: Checkpoint skipped. Progress 38000/100000 CPU time 19267.656250
[08:04:11] INFO: Checkpointed. Progress 39000 of 100000 steps complete CPU time 19776.296875
[08:12:43] INFO: Sending trickle message to server.
[08:12:43] INFO: Starting intermediate upload, index = 4
[08:12:43] INFO: Checkpoint skipped. Progress 40000/100000 CPU time 20286.562500
[08:21:21] INFO: Checkpointed. Progress 41000 of 100000 steps complete CPU time 20804.046875
[08:30:02] INFO: Checkpoint skipped. Progress 42000/100000 CPU time 21323.000000
[08:38:39] INFO: Checkpointed. Progress 43000 of 100000 steps complete CPU time 21834.750000
[08:47:13] INFO: Checkpoint skipped. Progress 44000/100000 CPU time 22341.468750
[08:55:30] INFO: Checkpointed. Progress 45000 of 100000 steps complete CPU time 22833.531250
[09:03:50] INFO: Checkpoint skipped. Progress 46000/100000 CPU time 23323.578125
[09:12:21] INFO: Checkpointed. Progress 47000 of 100000 steps complete CPU time 23831.046875
[09:21:01] INFO: Checkpoint skipped. Progress 48000/100000 CPU time 24349.593750
[09:29:26] INFO: Checkpointed. Progress 49000 of 100000 steps complete CPU time 24852.703125
[09:37:48] INFO: Sending trickle message to server.
[09:37:48] INFO: Starting intermediate upload, index = 5
[09:37:48] INFO: Checkpoint skipped. Progress 50000/100000 CPU time 25352.062500
[09:46:04] INFO: Checkpointed. Progress 51000 of 100000 steps complete CPU time 25844.671875
[09:54:27] INFO: Checkpoint skipped. Progress 52000/100000 CPU time 26341.312500
[10:02:53] INFO: Checkpointed. Progress 53000 of 100000 steps complete CPU time 26842.093750
[10:11:19] INFO: Checkpoint skipped. Progress 54000/100000 CPU time 27348.093750
[10:19:39] INFO: Checkpointed. Progress 55000 of 100000 steps complete CPU time 27848.156250
[10:27:53] INFO: Checkpoint skipped. Progress 56000/100000 CPU time 28339.406250
[10:36:15] INFO: Checkpointed. Progress 57000 of 100000 steps complete CPU time 28831.609375
[10:44:48] INFO: Checkpoint skipped. Progress 58000/100000 CPU time 29343.859375
[10:53:28] INFO: Checkpointed. Progress 59000 of 100000 steps complete CPU time 29861.000000
[11:02:03] INFO: Sending trickle message to server.
[11:02:03] INFO: Starting intermediate upload, index = 6
[11:02:03] INFO: Checkpoint skipped. Progress 60000/100000 CPU time 30374.796875
[11:10:36] INFO: Checkpointed. Progress 61000 of 100000 steps complete CPU time 30886.906250
[11:19:21] INFO: Checkpoint skipped. Progress 62000/100000 CPU time 31407.375000
[11:27:58] INFO: Checkpointed. Progress 63000 of 100000 steps complete CPU time 31919.093750
[11:36:20] INFO: Checkpoint skipped. Progress 64000/100000 CPU time 32419.609375
[11:45:01] INFO: Checkpointed. Progress 65000 of 100000 steps complete CPU time 32937.062500
[11:54:00] INFO: Checkpoint skipped. Progress 66000/100000 CPU time 33474.750000
[12:03:07] INFO: Checkpointed. Progress 67000 of 100000 steps complete CPU time 34021.187500
[12:12:33] INFO: Checkpoint skipped. Progress 68000/100000 CPU time 34585.046875
[12:21:50] INFO: Checkpointed. Progress 69000 of 100000 steps complete CPU time 35138.531250
[12:30:52] INFO: Sending trickle message to server.
[12:30:52] INFO: Starting intermediate upload, index = 7
[12:30:52] INFO: Checkpoint skipped. Progress 70000/100000 CPU time 35679.890625
[12:39:46] INFO: Checkpointed. Progress 71000 of 100000 steps complete CPU time 36210.703125
[12:48:31] INFO: Checkpoint skipped. Progress 72000/100000 CPU time 36733.687500
[12:57:13] INFO: Checkpointed. Progress 73000 of 100000 steps complete CPU time 37251.109375
[13:05:59] INFO: Checkpoint skipped. Progress 74000/100000 CPU time 37771.843750
[13:14:47] INFO: Checkpointed. Progress 75000 of 100000 steps complete CPU time 38299.937500
[13:23:31] INFO: Checkpoint skipped. Progress 76000/100000 CPU time 38821.953125
[13:31:58] INFO: Checkpointed. Progress 77000 of 100000 steps complete CPU time 39325.593750
[13:40:19] INFO: Checkpoint skipped. Progress 78000/100000 CPU time 39825.359375
[13:48:44] INFO: Checkpointed. Progress 79000 of 100000 steps complete CPU time 40326.015625
[13:57:10] INFO: Sending trickle message to server.
[13:57:10] INFO: Starting intermediate upload, index = 8
[13:57:10] INFO: Checkpoint skipped. Progress 80000/100000 CPU time 40832.093750
[14:05:31] INFO: Checkpointed. Progress 81000 of 100000 steps complete CPU time 41331.562500
[14:13:51] INFO: Checkpoint skipped. Progress 82000/100000 CPU time 41829.828125
[14:22:27] INFO: Checkpointed. Progress 83000 of 100000 steps complete CPU time 42343.812500
[14:31:17] INFO: Checkpoint skipped. Progress 84000/100000 CPU time 42872.328125
[14:40:03] INFO: Checkpointed. Progress 85000 of 100000 steps complete CPU time 43397.250000
[14:48:45] INFO: Checkpoint skipped. Progress 86000/100000 CPU time 43916.015625
[14:57:11] INFO: Checkpointed. Progress 87000 of 100000 steps complete CPU time 44418.593750
[15:05:34] INFO: Checkpoint skipped. Progress 88000/100000 CPU time 44914.093750
[15:13:50] INFO: Checkpointed. Progress 89000 of 100000 steps complete CPU time 45409.312500
[15:22:33] INFO: Sending trickle message to server.
[15:22:33] INFO: Starting intermediate upload, index = 9
[15:22:33] INFO: Checkpoint skipped. Progress 90000/100000 CPU time 45917.187500
[15:32:18] INFO:Turning trickle messaging on.
[15:32:18] INFO:Turning intermediate uploads on.
%IMPACT-I: Softcore binding energy with umax = 1000.00000
%IMPACT-I: Using AGBNP2: Analytical Generalized Born Model + Analytic
Non-Polar Hydration Model
%IMPACT-I: Hybrid potential for binding with lambda = 0.00330
agbnpf_assign_parameters(): info: attempting to load from SQL tables.
[15:41:09] INFO: Sending trickle message to server.
[15:41:09] INFO: Starting intermediate upload, index = 9
[15:41:09] INFO: Checkpoint skipped. Progress 90000/100000 CPU time 45928.355625
[15:50:05] INFO: Checkpointed. Progress 91000 of 100000 steps complete CPU time 46461.277500
[15:58:58] INFO: Checkpoint skipped. Progress 92000/100000 CPU time 46991.996250
[16:07:51] INFO: Checkpointed. Progress 93000 of 100000 steps complete CPU time 47523.402500
[16:16:23] INFO: Checkpoint skipped. Progress 94000/100000 CPU time 48033.574375
[16:25:02] INFO: Checkpointed. Progress 95000 of 100000 steps complete CPU time 48550.840000
[16:33:24] INFO: Checkpoint skipped. Progress 96000/100000 CPU time 49052.215000
[16:41:55] INFO: Checkpointed. Progress 97000 of 100000 steps complete CPU time 49561.949375
[16:50:31] INFO: Checkpoint skipped. Progress 98000/100000 CPU time 50077.621250
[16:59:04] INFO: Checkpointed. Progress 99000 of 100000 steps complete CPU time 50588.668125
[17:07:41] INFO: Checkpoint skipped. Progress 100000/100000 CPU time 51104.871250
%IMPACT-I: Species 1 written to SQL file md-out1.dms
%IMPACT-I: Species 2 written to SQL file md-out2.dms
17:07:42 (5520): called boinc_finish(0)

</stderr_txt>
]]>

----------------------------------------

"Patience is a virtue", I can't wait to learn it!
[Dec 11, 2015 4:20:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
phytell
Cruncher
Joined: Sep 8, 2014
Post Count: 39
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

Update: results appear to validate if and only if:
(a) all trickles are reported via scheduler request prior to the unit itself
(b) there is a delay between when trickles are reported and when the WU returns
In both cases, the valid units were reported several hours after their trickles, leading me to believe that the delay should be substantial.
Both of these units were calculated offline, with network connection only used to refresh the cache and report completion.
Since invalid units are wasting dozens of hours (I recently returned a 63 hour WU) and nobody seems to be able to tell me whether these units are actually doing anything or not, it looks like this project is going to be require a bit to much management for me.
I'll be watching these forums in hopes of a fix for this, but if none comes, best of luck to the rest of the crunchers.
[Dec 12, 2015 5:27:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

This all just gets more and more stupid as time goes on ....

If it sees a net disconnection that makes the WU invalid it should abort the WU post a message under the NOTICE's tab so we can read it and allow us to correct it or not.....

I wish the people in charge of all this would start doing their jobs

I no longer have questions as to why most people just drop WCG
[Dec 13, 2015 12:41:59 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

This all just gets more and more stupid as time goes on ....

If it sees a net disconnection that makes the WU invalid it should abort the WU post a message under the NOTICE's tab so we can read it and allow us to correct it or not.....

I wish the people in charge of all this would start doing their jobs

I no longer have questions as to why most people just drop WCG

I am sorry to hear you are having problems with this project as well as CEP2. This is a project which is still rather new, so all of the bugs are not fully crushed. Others have mentioned the problems with having intermittent internet service or other breaks having to do trickles and reporting. By WCG standards these are heavier than normal WU's which take more time than many of the other projects. This is also a more complex project with having to string together the results from 30 or 40 separate WU's to get a complete job finished. It appears like most of the crunchers are sending back valid units, because if there was a high error rate the techs would stop the project to figure out the problem. They do not want to see wasted cycles any more than you do. If this project is not compatible with your setup, for what ever reason, opt out. Choose a diferent project that is not so demanding of your resources. There is no use in getting high blood pressure over something that is bothering you. The techs would like to see all of the projects go smoothly for everyone, but that is probably an unrealistic goal due to the myriad of different setups that are crunching. At the very least just calling the system stupid without offering any possible solutions gets us nowhere.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Dec 13, 2015 3:53:10 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

Sgt Joe it all boils down to one thing .... keeping the numbers up ....

If not having a putter hooked to the net can cause a invalid result they could of told us in the notices section as fast as they do when there is some new video or news feed

If all that matters REALLY is the first save on CHP2 then why not let us be sure to crunch that out and then impose a 18 hour limit and avoid sending valid WU's back out for re-validation until all good again

It's the only valid reason I can find..... If you divide the daily CPU time by the number of computers connected on WCG (data found at) Community page top page they all work something close to 30 min a day each .... REALLY is that all we do ???

Posting the number of active computers just wouldn't look as IMPRESSIVE
----------------------------------------
[Edit 1 times, last edit by Former Member at Dec 13, 2015 7:40:39 PM]
[Dec 13, 2015 7:20:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

Sgt Joe it all boils down to one thing .... keeping the numbers up ....

If not having a putter hooked to the net can cause a invalid result they could of told us in the notices section as fast as they do when there is some new video or news feed

If all that matters REALLY is the first save on CHP2 then why not let us be sure to crunch that out and then impose a 18 hour limit and avoid sending valid WU's back out for re-validation until all good again

It's the only valid reason I can find..... If you divide the daily CPU time by the number of computers connected on WCG (data found at) Community page top page they all work something close to 30 min a day each .... REALLY is that all we do ???

Posting the number of active computers just wouldn't look as IMPRESSIVE

These figures are pretty old news. There are about 11% of the total users who are active. There is about 8% of the total hosts (about 2.5 million) which are active( about 200,000). Users come and go. For instance on the day I started, July 4, 2006, out of the 82 people who started on that day only 8 remain active, slightly less than 10%. I do agree the total hosts figure is deceptive. Just for me I have 48 device installations listed, but have retired 39 of them, either because they went belly up, or because they were too old to be effective crunchers any more. If you have any ideas on how to retain crunchers or make them more involved, don't hesitate to speak up. One of the areas frequently mentioned is more and better communication from the project scientists and better communication from the techs when needed. However, they have limited time, a limited budget and they need their time off to be with their families. I think in most instances they are reasonably prompt with addressing problems and keeping us informed. The uptime of the projects here is probably the envy of many other projects.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Dec 14, 2015 1:15:38 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

If you have any ideas on how to retain crunchers or make them more involved, don't hesitate to speak up.


ok ... if you want people to hang around don't waste their time and resorces

1) Fix the problem with cutting off CHP2 before the first save point avoiding unneeded time run errors that cause an unneeded error to the computers record that causes units to be validated..... I see it as better for my computers tracking record and better for WCG as a whole to just abort the WU if after 17hr + if it hadn't saved the first job ... really that's a very sad way fix that problem because no one else can seem to make an effort

2) When there is a problem noted like the computer has to be connected to send up trickles or it comes up invalid just stand up and say Huston We Have A Problem and tell us in the notices section of the program and/or in the event log....don't allow our time to be wasted and force us to research it here not all systems can be connected 24/7....we shouldn't have to research what known problems are WEE SHOULD BE TOLD!!!

Well they do any of that?? I'm betting no

As far as a computer getting to old to make an effort if it has the ability to run it even if it's slower then others ... please be real about managing resources and do consider just what your saying there ... other wise no harm done to me at least I can rebuild them and over clock if I want to I just don't want to

Good thing I can edit this stuff or I'd be banned for sure
----------------------------------------
[Edit 7 times, last edit by Former Member at Dec 14, 2015 7:33:29 PM]
[Dec 14, 2015 6:30:32 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

ok ... if you want people to hang around don't waste their time and resorces

I agree.

You have made two suggestions. On suggestion #1, if it is possible to do just that, I would be the first to applaud. There may be reasons it is not possible so if one of the scientists or techs could offer an explanation, that would nice. I don't think this problem is widespread, but it does occur due to, as I understand it, either the computer is too slow for the molecule size or the calculations lead to a divergent condition.
On suggestion #2, it would have been nice if this would have been caught in the Beta, but it wasn't. I agree that if intermittent internet connectivity is a problem for this project, it should be noted in the project requirements page.

As far as a computer getting to old to make an effort if it has the ability to run it even if it's slower then others ... please be real about managing resources and do consider just what your saying there

I retired some machines as being too old because the electricity use per unit crunched was too high. These were PII's, PIII's, P4's. They were repurposed to other uses if they were still operable. I obtained most of them for next to nothing, used them while they were useful and upgraded when I was able to obtain newer machines others were getting rid of. I know a person who deals in older lab and diagnostic equipment. Sometimes they need older machines to replace the computer components which have gone bad or only certain models work with certain machines. I have given some of these older machines to them in order to help them rehab the equipment and extend its useful life.
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Dec 14, 2015 8:55:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Does offline calculation results in invalids?

I just aborted a CEP2 WU that ran 17.78 hr with no save point and I looked at one that I spent 18 hours on and a different computer spent 18 on it also with only one hour into the 2nd job. So the problem isn't the computers running the WU's it's that unholy 18 hour limit and there is no good reason to pick up the 18 hour limit before the first job is done

No harm I would of received nothing for it and my computer would of taken a hit

As far as FAH2 goes I can keep it connected but of the 3 WU's that came up bad one I can't find I did get credit for the other two but they both got sent out twice.... so that's 3 times the CPU time for any one of them.....Dumb very dumb and both problems need to be corrected.... But I bet they never will
----------------------------------------
[Edit 2 times, last edit by Former Member at Dec 16, 2015 2:36:46 PM]
[Dec 16, 2015 2:28:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 60   Pages: 6   [ Previous Page | 1 2 3 4 5 6 ]
[ Jump to Last Post ]
Post new Thread