Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 31
Posts: 31   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 5786 times and has 30 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

I have been browsing through the forums today and it has become abundantly clear that there is a lot of pent up frustration on the part of the people donating their resources to this project.
A number of which appear to be ready to pull the plug as they believe that their efforts are going awry.
If the Admin does not start corresponding with these well meaning individuals soon , I am afraid that not getting an answer to the reason why donators are getting so many aborted work units could seriously damage the effectiveness of this project.
I am sure that there are a lot of experienced users donating time to this project that understand the inherant problems that a project of this size can cause, but for rookies it is a lot more difficult to continue without answers.I feel you must give priority to users questions to ensure continued dedication to this excellent project
I am sure your attention to these problems will be very much appreciated by all participants and put a lot of peoples minds at rest

The more comments to this post, the more effective it will be
Thank you in advance for your comments
Graham [Team Europe}
[Nov 22, 2004 11:50:46 AM]   Link   Report threatening or abusive post: please login first  Go to top 
joatmon
Senior Cruncher
Joined: Nov 17, 2004
Post Count: 185
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

my agent is now also experiencing the status reset. None of my settings have changed since tha tlast successful workunit completion.

there are a number of files in the WCG agent directory on my PC, which of these are the work unit? I have a ud_XXXXX that corresponds to the time of the last reset, and there is a ud_YYYYY.dll that corresponds to the download time of the lat successfully completed workunit download.


I am a newbie at this. Fine if it's all behaving normally and as designed, but it seems like a lot of us aren't really sure that this reset behavior is correct behavior. Some kind of log file would be useful. If this really is designed to crunch for a couple of hours, give up, and start a new one, then give us points for the effort, and also consider some sort of log file so we can tell what's going on and don't just up and quit because the software appears to not be accomplishing anything.
[Nov 22, 2004 1:07:02 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

The work unit is placed within a folder named ud_21380_0.dir created by the executable with the same name.

Within it are the working unit files including random number generator for seeding the rosetta program and caching of results which are processed and sent back to the server. There is a file with a .pdb extension that corresponds to the protein data sequence - however it is inaccessible.

Other .ud extension files created once the first result is returned to the server house the users profile information/settings/cpu time etc...

If anyone has found any other info on the workings of the program that are out of the norm ... post it ... something more than just 'my WU terminated after x%' would be useful as we are all aware of this issue.

ATTN Admin - An Error Log would be very useful, as suggested.

As a researcher myself I understand the scale of the project... which in itself is very ambitious... and would like to continue contributing to its success. However I do not understand how global participation can be achieved and project deemed fit to commence without the following:


- Cross platform support - Linux, Mac OS etc...
- Mutiple CPU support
- Error log reporting
- Multi lingual support


These are simple implementations to a program that should have been available at the time of release seeing little user input is required in the first place. The extra time spent developing the application is insignificant to the amount of processing time already lost due its poor functionality.

I hope those involved can rectify what seems like a 'rushed' job.
----------------------------------------
[Edit 3 times, last edit by Former Member at Nov 22, 2004 1:37:47 PM]
[Nov 22, 2004 1:17:00 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

After successfully completing 3 tasks taking a total of 4.25 days, my machine has also been aborting wu's (17 hrs worth so far). Statements have been made that these are caused by proteins that cannot be folded. While this may be true, I question why this is happening to everybody (or at least a lot of people) at the same time, especially after a period of a few days of success? Have WCG staff actually tried folding some of these proteins to verify that this behaviour is correct? Is it possible that our software has been automatically updated by a buggy version? I understand that this type of update is indeed possible.

The issue of receiving CPU credit is to me not important, as long as we are indeed doing something useful. I think that WCG owes the community some reasoned answers, and quickly, before it dissipates . We are, after all, the ones donating our computer resources to the cause (including increased power usage), and accepting the risk that our computers may fail due to thermal stress.
[Nov 22, 2004 2:17:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
confused Re: From the Application Developer

Hi Rick,

You say I should get credit for the non converged but I don't
In the statistics I have dropped down from 70h of cpu-time per day to 20h.
Eventhough my 4 devices has been running all the time.
I also dropped down from 2000 to 600 point per day.

I now have 3 devices that only go to approx. 4% and then gets a new work unit.

How many "dud" work units can I expect? and why don't I get the points for CPU time?
[Nov 22, 2004 2:30:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
joatmon
Senior Cruncher
Joined: Nov 17, 2004
Post Count: 185
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: From the Application Developer

Since I am not getting any points or accumulated CPU time for the 1 or 2 % workunits, I can only assume that my agent is terminating abnormally. Either I am wasting my computer's time or the project's workunits.

My first two workunits processed to 100%. Since then I have had about 30 in a row that reset after reaching the small percentage complete. not very feasible for a random event, tends to indicate that any workunit I download after a certain time will not be workable.

Is anyone out ther eable to download a new workunit and have ti process to 100% completion?

Either my computers, my configuration or the WCG software/workunits them selves have a problem here. The points don't matter, but are an indication that my PC's effort is actually making a contribution. Without that, all I know is my CPU gets hot and the fan runs all the time for no apparent reason. I know how to fix that, and have now disabled the WCG agent on my machines. If there is an upgrade or diagnostic procedure then I may try that, ubt I see no reason to continue like this anymore.

Good luck with the project, I would help if it could be more automatic and actually do something.
[Nov 22, 2004 6:08:58 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: From the Application Developer

I've been getting the same results as the previous posters - the first task completed, then numerous aborts, each time reloading about a 1Mbyte file. I have gotten no points for the last several days, despite virtually continuous crunching. I understand that not all the protein folding problems complete, but not getting any "work units" for the work to find this out doesn't make any sense.

I'm deinstalling the WCG program, even though I think the idea has a lot of promise. Email me when the work-unit situation is fixed, then I'll try it again.
[Nov 22, 2004 6:31:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Have some faith

Rick,

This is a helpful post (though I think there are still valid concerns out there). My question is, why was this not posted in the "Member News" or "Known Issues" section instead of buried in a thread?
[Nov 22, 2004 6:48:48 PM]   Link   Report threatening or abusive post: please login first  Go to top 
joatmon
Senior Cruncher
Joined: Nov 17, 2004
Post Count: 185
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

admins, thanks for identifying the problem , my machine is now back above the critical three percent smile

a log file is still a good idea, in case something like this comes up again.
[Nov 23, 2004 3:13:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: The restarting problem--maybe a log needed for the app?

I am almost certain this problem is caused by changes to the amount of available hard disk space. My first package went ok. Next stopped. Re-installing meant first package worked, next failed. I reduced the amount of disk space available (I have 10 Gb, but indicated that I have less than one, and everything now seems to work again.
[Nov 23, 2004 11:36:13 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 31   Pages: 4   [ Previous Page | 1 2 3 4 | Next Page ]
[ Jump to Last Post ]
Post new Thread