Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 4453 times and has 12 replies Next Thread
Lighthouse
Senior Cruncher
Joined: Nov 20, 2004
Post Count: 283
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Error: exceeds size limit

I am getting a large number (3 out of every 4) of bad results on only one computer:
Vista/Q6600, running HFCC.

As an example, the "stdoutdae.txt" log file states, in part:

19-Nov-2009 09:38:11 [World Community Grid] Starting HFCC_s1_00236214_s1_0000_0
19-Nov-2009 09:38:11 [World Community Grid] Starting task HFCC_s1_00236214_s1_0000_0 using hfcc version 610


19-Nov-2009 21:32:38 [World Community Grid] Computation for task HFCC_s1_00236214_s1_0000_0 finished
19-Nov-2009 21:32:38 [World Community Grid] Output file HFCC_s1_00236214_s1_0000_0_1 for task HFCC_s1_00236214_s1_0000_0 exceeds size limit.
19-Nov-2009 21:32:38 [World Community Grid] File size: 13625052.000000 bytes. Limit: 10485760.000000 bytes

Please include in your response "simple" directions. I realize that this may not be enough information, but you will need to tell me what you need to know and where to look for it. Assume that I know nothing about error messages, where to find files, etc.
----------------------------------------

----------------------------------------
[Edit 1 times, last edit by Lighthouse at Nov 20, 2009 9:08:57 AM]
[Nov 20, 2009 9:05:14 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Ah, now we're talking with some tangibles. Nothing to do with your client/install, it just seems that this badge batch of results produces output much larger than the preset limits permit. Something the techs have to look at and tweak at their end.

Unfortunately, they wont be in for a few hours, so can only suggest to temporarily switch to HCC project till it blows over. Quickest is via My Grid > My Projects to apply the selection change to all your devices, if all are effected.

edit: batch of course
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Nov 20, 2009 4:16:14 PM]
[Nov 20, 2009 9:18:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Lighthouse
Senior Cruncher
Joined: Nov 20, 2004
Post Count: 283
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

As always, thanks Sekerob. Advice taken.

I found it curious, though, that only one computer is having this issue. I have three computers running right now and the others are not getting errors. Just a coincidence that one got a "bad" batch?

I searched through the log file and found the largest File size that I had generated. Perhaps this number will help the techs?


19-Nov-2009 12:48:54 [World Community Grid] Computation for task HFCC_s1_00179312_s1_0000_0 finished
19-Nov-2009 12:48:54 [World Community Grid] Output file HFCC_s1_00179312_s1_0000_0_1 for task HFCC_s1_00179312_s1_0000_0 exceeds size limit.
19-Nov-2009 12:48:54 [World Community Grid] File size: 23037250.000000 bytes. Limit: 10485760.000000 bytes
----------------------------------------

[Nov 20, 2009 3:29:24 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Lighthouse,

I have taken a look at your logs of a few of your work units on that machine. It appears that you have restarted the work unit 133 times in one case. Each time a work unit restarts for Autodock application it prints about 4000 lines of information to make sure that the system is still properly set up for the scientists.

This issue is kind of unique to the HFCC project because these work units that are running have 255 runs in them which is the most we have put in a single Autodock job at World Community Grid. I say kind of because if the other Autodock programs were to use 255 runs it would be similar, but at the moment FA@H runs less than 100 (usually around 50) per work unit.

I'm not sure what is causing your computer to restart these work units so many times. It could be anything from your computer being shut down and restarted many times in a day. To having multiple BOINC projects selected (meaning something other than World Community Grid using your computer).

Can you please post more from your messages tab? I am curious to see what is causing the restarts on your application.

Thanks,
-Uplinger
[Nov 20, 2009 4:02:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Sorry Lighthouse, that's was unexpected. Wonder if you have the Leave Application in Memory while preempted is on on that client. If not, each time a job is paused, it's unloaded, then resuming from last checkpoint.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 20, 2009 4:27:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Sek,

When I was looking into the results for Lighthouse, it appeared his machine was set to Home profile which was set to standard. This allows the agent to run while computer is active. But this also removes apps from memory which is where I believe he may have more than one BOINC project attached. Don't know for sure at this moment.

-Uplinger
[Nov 20, 2009 5:05:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
adrianxw
Senior Cruncher
Denmark
Joined: Apr 13, 2008
Post Count: 196
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit


may have more than one BOINC project attached

Uplinger

Are you saying that connecting to more then WCG as your BOINC source could cause problems? I typically have 10+ BOINC sources running and have not seen this issue.
----------------------------------------
[Edit 4 times, last edit by adrianxw at Nov 20, 2009 8:52:26 PM]
[Nov 20, 2009 8:42:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Not specifically. I am trying to figure out if he has multiple boinc projects selected. If so and his preferences are to switch between applications every 15 minutes, it could cause the application to stop, unload from memory, and then start again. It all depends on the logs I'm waiting on to figure out what is causing the computer to restart from checkpoint so often. I was just mentioning being attached to other BOINC projects as a possible cause to this, but without the logs it's all speculation :) And if that is the case then we will want to check how often it is told to switch between applications.

The current settings for result logs to be sent back is 10MB before compression. This is to help catch issues that may be causing the computer to restart frequently on a work unit. Again there are many ways this could happen but I'm trying to rule out one scenario. Logs will help rule out multiple scenarios...

I hope this helps, if not feel free to ask more questions :)

-Uplinger
[Nov 20, 2009 8:51:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Lighthouse
Senior Cruncher
Joined: Nov 20, 2004
Post Count: 283
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

A bit more info:

I run WCG exclusively - there are no other grid computing projects.

I want computing to stop when I use the computer. In addition, I would like to throttle back CPU usage to around the 70-85% range.

The computer had been set to my "work" profile when these errors were generated. I changed it to "Home" only this morning. BUT, I had likely changed some of the settings locally, so even my Work profile is not entirely correct.

I just looked at current settings:
1. Switch between applications now 60 mins. I remember that it had been 30 minutes.

2. Leave applications in memory is NOT checked.

What are your suggested settings? The "default" may not be what I want, because I want to throttle it back, and pause when I am using it.
----------------------------------------

[Nov 21, 2009 3:58:08 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Reply to this Post  Reply with Quote 
Re: Error: exceeds size limit

Hi lighthouse,

As per the suspicion Post on: Nov 20, 2009 5:27:41 PM , the preventative action in your setup is indeed to tick the LAIM box. It will increase your client's crunching efficiency quite a bit by the looks of it as each time the BOINC is allowed to resume, the science will start right from the point it was paused and not from the last checkpoint.

Let us know, but think it's save to say that you can fully resume HFCC crunching.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Nov 21, 2009 8:14:12 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread