Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Retired Forums Forum: Member-to-Member Support [Read Only] Thread: Could we make sure to get enough work for several days ? |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 6
|
Author |
|
debrouxl
Advanced Cruncher France Joined: Dec 31, 2004 Post Count: 61 Status: Offline Project Badges: |
I have a notebook (P4 2.6 GHz, 512 MB RAM), which can run computations nearly 24/7.
----------------------------------------However, I can hardly connect it to Internet from monday morning to friday evening, because I have no direct Internet access where I live (I'm a student). Therefore, unless it is given a WU that requires days to complete, my computer has usually nothing to do starting from Tuesday, since most WUs require less than one day to complete. Therefore, it would be great if I could try to queue work for several days (by downloading more than one WU and/or selecting WUs that take a long time). Of course, from the server side, it requires means to evaluate the time needed to complete a task - and unfortunately, file size is not the relevant criteria, since the smallest VU I got (the one I'm talking below) took by far the highest amount of time. I know the Mersenne project, for example, allows both solutions I'm describing above. At least, the result file's naming convention ("result_0") looks like this could be possible. On a side note, the day before yesterday, I think I lost a WU which was ~95 % done, and required ~93 hours to reach that point... I know that the completion estimation is not extremely accurate, but like all other 11 previous WUs, the WU's progress rate had been linear until then, it should therefore have taken several more hours. tk* files in the WCG directory, and the subdirectory where the Rosetta program lies, were completely removed. I didn't check whether there was a result_0 file, but I guess there wasn't, since this WU doesn't appear in my points or # of returned results. During the ~30 minutes before I saw that unexpected completion, the only two significant events I can remember of are: * me saving (Windows copy & paste to another folder) tklg.ud, tkst.ud, tkop.ud~ totaling 12.4 MB. * another program I'm beta-testing crashing. I thought I might have made cut & paste instead of copy & paste. But checking again while another WU was running, showed that while the files are not locked and can therefore be moved, the program survives to them being moved. Similarly, it seems the other program crashing didn't trigger any corruption (but of course, this is not reproducable). The corresponding WU number (ud_xxxxxx file in the WCG directory) is very near from 282916 (which is that of a task I incorrectly received, and which was therefore canceled before I received the one I think I lost - but the file remains on my HD). |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I have the same issue as I have a laptop which I cannot connect to the Internet on a regular basis.
I use a tool called udmon which can be found here . Please be aware that this is not a supported or even recommended solution by WCG. You should understand that you could corrupt a WU which means that you will get no points for it and also elongate the phase of the project. However, I have never had a problem either whilst doing WCG or UD Cancer before it. One side effect you will see is that the number of device installs will be shown incorrectly on your member page. This is because each new WU is seen as a new device when downloaded. In addition, you have to be careful not to have to many WU cached. If you get a monster WU which takes a long time to complete, you may be in the situation that a WU in the cache has already run out of wall time (3 weeks) by the time your machine gets around to dealing with it. |
||
|
debrouxl
Advanced Cruncher France Joined: Dec 31, 2004 Post Count: 61 Status: Offline Project Badges: |
Thanks for the information, I've just downloaded udmon.
---------------------------------------- |
||
|
Alther
Former World Community Grid Tech United States of America Joined: Sep 30, 2004 Post Count: 414 Status: Offline Project Badges: |
You should also be aware that the workunits do not progress linearly. They can, but many times they do not. They usually hit a stretch than takes a little longer than other parts. Likewise, some sections can finish much more quickly than other parts.
----------------------------------------The bottom line is that while you can estimate what the finishing time will be based on time taken to reach a certain percentage, it doesn't mean that the rest of the workunit will progress at the same rate.
Rick Alther
Former World Community Grid Developer |
||
|
debrouxl
Advanced Cruncher France Joined: Dec 31, 2004 Post Count: 61 Status: Offline Project Badges: |
Alright, it seems to work fine. I wish I had knew of it and used it before: it may have been able to recover the 100-hour WU I lost...
---------------------------------------- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Alright, it seems to work fine. I wish I had knew of it and used it before: it may have been able to recover the 100-hour WU I lost... I have never tried to restore a backup so I cannot say if this works. I have simply used it for the caching ability to ensure that I can give as much free CPU time as possible. |
||
|
|