Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 8
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4584 times and has 7 replies Next Thread
starhuggrrr
Cruncher
Joined: Jun 17, 2006
Post Count: 22
Status: Offline
Reply to this Post  Reply with Quote 
Output file absent??

I've been noticing that the amount of time I have BOINC/World Community Grid running does not always correspond to the statistics graph. I run BOINC usually only when I'm not using the computer and when I am using the computer I have BOINC shut down and not even in memory. I Exit, and then restart it when I'm finished at the end of the day. Usually it's running for a good 8-10 hours at 100% throughout the night before I shut it down in the morning.

I am running only World Community Grid on it and I allow all WCG projects to run, according to whatever WCG serves me. I'm on a Vista Home Premium OS with a quad core Intel chip, 4GB RAM installed (3GB usable), with Bit Defender 2009 as my antivirus (which has never reported having any issues with WCG or BOINC).

Yesterday I was out all day and so I have had it running nonstop from Oct.17 12:34 AM until now, Oct.18 10:26 AM. I checked email and did a bit of online surfing yesterday morning and late last night; otherwise it's been the only thing running (other than my antivirus updating occasionally). And yet, the graph shows that less work was done yesterday (almost 24 hours) than today! (only 10+ hours)

When I looked back over the messages there are 6 instances of "Output file blahblah absent." I'm thinking that's where the problem lies and it may have been happening before now. I haven't noticed this before, but then I wasn't looking for it and they're hard to spot unless you really look for them. I can copy all messages here if that will help diagnose the problem, but it's huge (runs 10 pages when I paste it into a Word document and even scrunch the font size).

I've recently started running WCG again because there seemed to be problems when I first got this new computer and ran it concurrent with other use on a percentage basis. (The problems don't seem to happen when I run it only when I'm not using the computer, period, so that's how I'm getting around it - manually launch BOINC when I'm finished for the day.) But I'm feeling quite frustrated that the work seems to be being sabotaged somehow by problems unknown. It's maddening to feel like I'm going out of my way to keep WCG active when some of the work is never getting back to WCG.

Could this be a Vista problem? (We love to blame everything on Vista tongue but only because it's so often deserved!) I tried to do a search for the error message (the title of this post) in the forums here but didn't see much that seemed relevant.

Could someone please advise either what more information I should post here to be able to diagnose the problem, or suggestions about what I can do to resolve the problem?

Thanks very much.
[Oct 18, 2009 2:47:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Can you please visit the My Grid > Result Status page and click on an error link and post content.

Were they all the same science or different sciences? If different, please post an eeror log for each of these.

ttyl
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 18, 2009 3:21:21 PM]   Link   Report threatening or abusive post: please login first  Go to top 
starhuggrrr
Cruncher
Joined: Jun 17, 2006
Post Count: 22
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Hi Sekerob,

Thanks for your reply. I was able to filter my results to errors only. Curiously, although I see 6 "output file absent" messages in my list for Oct.17-18, there's only one error listed for that time.

When I click on one of the "error" links in the error listing, I get the following error code; it looks like other errors (on previous dates) are the same:
<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
Incorrect function. (0x1) - exit code 1 (0x1)
</message>
<stderr_txt>
ERROR:: Exit at: .\dock_structure.cc line:401

</stderr_txt>
]]>


When I click on the project number for the Oct.17 error line, I get a list of messages for that job, which I'll copy here.
Project Name: Human Proteome Folding - Phase 2
Created: 10/15/09
Name: mv321_00008
Minimum Quorum: 15
Replication: 19
The large number of copies sent out for this workunit is due to the unique nature of this project.
We encourage you to read the FAQs about this project for more information.


Result Name App Version Number Status Sent Time Time Due /
Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
mv321_ 00008_ 22-- - In Progress 10/18/09 15:38:40 10/22/09 15:38:40 0.00 0.0 / 0.0
mv321_ 00008_ 21-- 603 Pending Validation 10/17/09 21:36:16 10/18/09 09:12:16 7.75 144.6 / 0.0
mv321_ 00008_ 20-- 603 Pending Validation 10/17/09 14:41:17 10/18/09 08:18:57 5.42 83.2 / 0.0
mv321_ 00008_ 16-- - In Progress 10/17/09 12:50:16 10/27/09 12:50:16 0.00 0.0 / 0.0
mv321_ 00008_ 9-- 603 Error 10/17/09 11:19:28 10/17/09 14:34:06 0.02 0.4 / 0.0
mv321_ 00008_ 10-- - In Progress 10/17/09 11:19:23 10/27/09 11:19:23 0.00 0.0 / 0.0
mv321_ 00008_ 8-- 603 Error 10/17/09 11:18:51 10/18/09 15:31:23 0.03 0.4 / 0.0
mv321_ 00008_ 13-- - In Progress 10/17/09 11:16:27 10/27/09 11:16:27 0.00 0.0 / 0.0
mv321_ 00008_ 2-- 603 Pending Validation 10/17/09 11:08:56 10/17/09 20:12:52 6.99 119.6 / 0.0
mv321_ 00008_ 0-- 603 Error 10/17/09 11:07:07 10/17/09 21:35:44 0.03 0.4 / 0.0
mv321_ 00008_ 18-- - In Progress 10/17/09 11:00:23 10/27/09 11:00:23 0.00 0.0 / 0.0
mv321_ 00008_ 3-- - In Progress 10/17/09 10:58:01 10/27/09 10:58:01 0.00 0.0 / 0.0
mv321_ 00008_ 1-- - In Progress 10/17/09 10:57:57 10/27/09 10:57:57 0.00 0.0 / 0.0
mv321_ 00008_ 4-- - In Progress 10/17/09 10:57:48 10/27/09 10:57:48 0.00 0.0 / 0.0
mv321_ 00008_ 17-- - In Progress 10/17/09 10:55:54 10/27/09 10:55:54 0.00 0.0 / 0.0
mv321_ 00008_ 5-- 603 Pending Validation 10/17/09 10:55:30 10/18/09 04:17:28 5.01 81.3 / 0.0
mv321_ 00008_ 19-- 603 Pending Validation 10/17/09 10:53:06 10/17/09 19:48:14 5.78 105.5 / 0.0
mv321_ 00008_ 11-- 603 Pending Validation 10/17/09 10:53:06 10/17/09 22:27:35 6.08 151.7 / 0.0
mv321_ 00008_ 15-- 603 Pending Validation 10/17/09 10:51:45 10/17/09 19:14:15 4.16 89.2 / 0.0
mv321_ 00008_ 12-- 603 Pending Validation 10/17/09 10:51:41 10/18/09 01:10:22 8.60 103.1 / 0.0
mv321_ 00008_ 6-- - In Progress 10/17/09 10:51:35 10/27/09 10:51:35 0.00 0.0 / 0.0
mv321_ 00008_ 7-- 603 Error 10/17/09 10:49:10 10/17/09 10:50:35 0.00 0.0 / 0.0
mv321_ 00008_ 14-- 603 Pending Validation 10/17/09 10:46:33 10/18/09 13:15:17 11.85 77.9 / 0.0


As to whether they're all from the same science/project, all of the project job numbers start with "mv" so does that mean they're for the same project? The numbers after "mv" are all different though, so it doesn't seem that the same project job is producing repeated errors.

From what I can see, it's either only or mostly the "mv" projects that are going absent since after Oct.12. Although most of the "mv" jobs seem to suffer from this problem, at least one I'm seeing from today completed and was uploaded.

I'll copy just a section of the message list from today, so you'll get an idea of what's happening. It looks like most or all of the absent-file jobs are only running for a minute or two before they report as finished and then the output file is reported as absent. I've coloured the start and finishe message of the same jobs in the same colour so they're easier to spot.
2009-10-18 5:48:35 AM|World Community Grid|Starting mv276_00059_18
2009-10-18 5:48:37 AM|World Community Grid|Starting task mv276_00059_18 using hpf2 version 603
2009-10-18 5:48:38 AM|World Community Grid|Sending scheduler request: To fetch work. Requesting 1239 seconds of work, reporting 0 completed tasks
2009-10-18 5:48:39 AM|World Community Grid|Started upload of CMD2_0131-WWP1A.clustersOccur-3DEM_B.clustersOccur_62_0_0
2009-10-18 5:48:41 AM|World Community Grid|Finished upload of CMD2_0131-WWP1A.clustersOccur-3DEM_B.clustersOccur_62_0_0
2009-10-18 5:48:43 AM|World Community Grid|Scheduler request succeeded: got 1 new tasks
2009-10-18 5:48:46 AM|World Community Grid|Started download of 088838286bd4937b94032a73b2c0ff8a.dat
2009-10-18 5:48:46 AM|World Community Grid|Started download of d539ceceed20c3ddf255653ed24e16e0.dat
2009-10-18 5:48:47 AM|World Community Grid|Finished download of 088838286bd4937b94032a73b2c0ff8a.dat
2009-10-18 5:48:47 AM|World Community Grid|Finished download of d539ceceed20c3ddf255653ed24e16e0.dat
2009-10-18 5:48:47 AM|World Community Grid|Started download of 49392f64dcc6d9b3d7d1261dc2bc6009.pdb.gzb
2009-10-18 5:48:47 AM|World Community Grid|Started download of 8172aa2de6702a469031a1ce532c9c4c.pdb.gzb
2009-10-18 5:48:48 AM|World Community Grid|Finished download of 49392f64dcc6d9b3d7d1261dc2bc6009.pdb.gzb
2009-10-18 5:48:48 AM|World Community Grid|Finished download of 8172aa2de6702a469031a1ce532c9c4c.pdb.gzb
2009-10-18 5:48:48 AM|World Community Grid|Started download of 5a869e0e46db59f34f209c7595136598.dat.gzb
2009-10-18 5:48:50 AM|World Community Grid|Finished download of 5a869e0e46db59f34f209c7595136598.dat.gzb
2009-10-18 5:48:59 AM|World Community Grid|Sending scheduler request: To fetch work. Requesting 287 seconds of work, reporting 1 completed tasks
2009-10-18 5:49:04 AM|World Community Grid|Scheduler request succeeded: got 1 new tasks
2009-10-18 5:49:07 AM|World Community Grid|Started download of batch00367_R00367_368f21c328aa602a19c86398d2a9b5d6.sequence
2009-10-18 5:49:07 AM|World Community Grid|Started download of batch00367_R00367_368f21c328aa602a19c86398d2a9b5d6.dist.gzb
2009-10-18 5:49:08 AM|World Community Grid|Finished download of batch00367_R00367_368f21c328aa602a19c86398d2a9b5d6.sequence
2009-10-18 5:49:08 AM|World Community Grid|Finished download of batch00367_R00367_368f21c328aa602a19c86398d2a9b5d6.dist.gzb
2009-10-18 5:50:06 AM|World Community Grid|Computation for task mv276_00059_18 finished
2009-10-18 5:50:06 AM|World Community Grid|Output file mv276_00059_18_0 for task mv276_00059_18 absent

2009-10-18 5:50:06 AM|World Community Grid|Starting mv276_00040_0
2009-10-18 5:50:07 AM|World Community Grid|Starting task mv276_00040_0 using hpf2 version 603

2009-10-18 5:51:07 AM|World Community Grid|Sending scheduler request: To fetch work. Requesting 21628 seconds of work, reporting 1 completed tasks
2009-10-18 5:51:12 AM|World Community Grid|Scheduler request succeeded: got 1 new tasks
2009-10-18 5:51:14 AM|World Community Grid|Started download of 468787ae5c23c219a17f395aa048b3d8.dat
2009-10-18 5:51:14 AM|World Community Grid|Started download of ff956bb15417fd3e0fc60680441f0fcc.dat
2009-10-18 5:51:15 AM|World Community Grid|Finished download of 468787ae5c23c219a17f395aa048b3d8.dat
2009-10-18 5:51:15 AM|World Community Grid|Finished download of ff956bb15417fd3e0fc60680441f0fcc.dat
2009-10-18 5:51:15 AM|World Community Grid|Started download of 2757256b33dbf34a1785dc613ddfcde9.pdb.gzb
2009-10-18 5:51:15 AM|World Community Grid|Started download of 83fe29c2504d849998dcac85e0f0e269.pdb.gzb
2009-10-18 5:51:17 AM|World Community Grid|Finished download of 2757256b33dbf34a1785dc613ddfcde9.pdb.gzb
2009-10-18 5:51:17 AM|World Community Grid|Finished download of 83fe29c2504d849998dcac85e0f0e269.pdb.gzb
2009-10-18 5:51:17 AM|World Community Grid|Started download of 61a666869af84b1acd3c8f99caff4639.dat.gzb
2009-10-18 5:51:18 AM|World Community Grid|Finished download of 61a666869af84b1acd3c8f99caff4639.dat.gzb
2009-10-18 5:51:27 AM|World Community Grid|Sending scheduler request: To fetch work. Requesting 10218 seconds of work, reporting 0 completed tasks
2009-10-18 5:51:32 AM|World Community Grid|Scheduler request succeeded: got 1 new tasks
2009-10-18 5:51:34 AM|World Community Grid|Computation for task mv276_00040_0 finished
2009-10-18 5:51:34 AM|World Community Grid|Output file mv276_00040_0_0 for task mv276_00040_0 absent

2009-10-18 5:51:34 AM|World Community Grid|Starting mv277_00039_4
2009-10-18 5:51:34 AM|World Community Grid|Starting task mv277_00039_4 using hpf2 version 603

2009-10-18 5:51:35 AM|World Community Grid|Started download of X0000092580124200710031229_X0000092580124200710031229.jp2
2009-10-18 5:51:37 AM|World Community Grid|Finished download of X0000092580124200710031229_X0000092580124200710031229.jp2
2009-10-18 5:53:20 AM|World Community Grid|Computation for task mv277_00039_4 finished
2009-10-18 5:53:20 AM|World Community Grid|Output file mv277_00039_4_0 for task mv277_00039_4 absent



Is that enough to figure out what the problem is? Please let me know if there's more information I can provide that will be useful. Thanks very much for your help!
[Oct 18, 2009 4:47:11 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Hi,

Yes the information and the log is enough to pronounce verdict. Your Vista OSsed PC has a problem, of undiagnosable nature with Human Proteome Folding and would thus suggest you de-select this project in the Device Profile of on My Projects. BUT, since the error occurs in the first few minutes and seemingly your PC seems to succeed at times to complete these jobs, you could also leave it be, but only if the failure percent is not very high.

mv... are all hpf2 jobs. Click on the Result name link and it brings up the quorum detail with the project name at top similar to what you posted.

Why you had 6 messages and only 1 error listed for the 17th/18th I don't know, possibly the "ready to report" parts were not cleared yet in your client. Did you hit the Return time header to sort by date? Here's a prepped filter/sort:

https://secure.worldcommunitygrid.org/ms/view...eturnedTime&pageNum=1

Hope there are not many.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Oct 18, 2009 5:07:46 PM]
[Oct 18, 2009 5:06:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
LUFTY
Cruncher
Joined: Apr 27, 2007
Post Count: 25
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Output file absent??

Dear All

I am right in thinking that a quad core processor running 100% for 24 hours should record 96 hours of runtime as this does not seem to be the case ?

Kind regards


Lufty
[Oct 18, 2009 6:24:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Oh boy this is going off onto the first branch ;>)

Well, if you would get error free crunching, let it run for a couple of weeks to get a steady stream of validated work going, then take the average of 1 week from your My Statistics > My Statistics History, you'd get close. But, no PC has 100% disposal of the CPU time for crunching. The OS and the client itself does housekeeping too, so I guess, 95%-97%+ if left alone is achievable. My quad, though substantially used does about 3:18 days on average, 6 hours daily going towards other things.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Oct 18, 2009 6:36:12 PM]
[Oct 18, 2009 6:34:25 PM]   Link   Report threatening or abusive post: please login first  Go to top 
starhuggrrr
Cruncher
Joined: Jun 17, 2006
Post Count: 22
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Hi Sekerob,

Thanks very much for your feedback. When I sort it by Return date, I do see 6 messages for 17-18 during the time period in question (plus one more for today since I first posted. Sigh...).

Interestingly, before this I see 6 errors for jobs starting with "mu" for Oct.10 and previous which are also hpf2 jobs, so it seems there's a conflict with this project all round. When I filter for all hpf2 jobs that are Valid, I see only 4 and they're all up to Oct.12. So clearly there are more of these jobs failing than succeeding. (I only restarted running WCG on Sep.30, I think.)

I can certainly deselect that project, but do you think these errors imply that my Vista installation has a problem that could be causing problems with other programs too? I do get weirdness happening occasionally but since this is Vista after all, I just put it down to the usual Vista shenanigans. Should I be concerned about more than just this problem?

Thx.
[Oct 18, 2009 11:03:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
starhuggrrr
Cruncher
Joined: Jun 17, 2006
Post Count: 22
Status: Offline
Reply to this Post  Reply with Quote 
Re: Output file absent??

Hi Lufty,

Don't forget I only copied a small portion of the messages for that time period.
[Oct 18, 2009 11:05:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread