Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Completed Research Forum: Computing for Clean Water Forum Thread: Message from server: Resent lost task |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 21
|
Author |
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I did redo them but in the process I lost 100+ hours of crunching time.
52 results just lost out there somewhere Makes me wonder how many others got lost for other crunchers |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
bugg,
Was told where to look and hack files out v.v. the ever returning old beta sticky files [from a stopped BOINC system], but not going to tell you... You're bound to cause damage to critical control files. The way to get rid of them properly, is: 1. First hit the Update button on the BOINC Manager to clear any waiting Ready to Report, put on the blindfold after positioning mouse over project Reset button, then hit left button of mouse. All file references to tasks will be removed for WCG. 2. Run the buffer empty, then do a project update to clear any Ready to Report and then a project Reset. In both cases, all selected sciences will redownload the permanent science related files and download new work. What you still had is declared aborted or error. Any work downloaded after 00:00 UTC goes against the daily quota of 80 per core... well who ever does manage to process 80 per core per day [maybe when the GPU project goes life in the tested WU build size, at which time there may have to be an alteration to that server control] --//-- |
||
|
Bugg
Senior Cruncher USA Joined: Nov 19, 2006 Post Count: 271 Status: Offline Project Badges: |
Well, that didn't seem to do the trick. Here's what it looked like in my event log:
----------------------------------------3/18/2012 1:00:08 PM | World Community Grid | update requested by user 3/18/2012 1:00:12 PM | World Community Grid | Sending scheduler request: Requested by user. 3/18/2012 1:00:12 PM | World Community Grid | Not reporting or requesting tasks 3/18/2012 1:00:14 PM | World Community Grid | Scheduler request completed 3/18/2012 1:00:45 PM | World Community Grid | Resetting project 3/18/2012 1:04:46 PM | World Community Grid | work fetch resumed by user 3/18/2012 1:04:49 PM | World Community Grid | Sending scheduler request: To fetch work. 3/18/2012 1:04:49 PM | World Community Grid | Requesting new tasks for ATI GPU 3/18/2012 1:04:51 PM | World Community Grid | Scheduler request completed: got 0 new tasks 3/18/2012 1:04:51 PM | World Community Grid | Didn't resend lost task c4cw_target05_160338721_0 (expired) 3/18/2012 1:04:51 PM | World Community Grid | Didn't resend lost task c4cw_target05_160364807_0 (expired) 3/18/2012 1:04:51 PM | World Community Grid | Didn't resend lost task c4cw_target05_160458750_0 (expired) 3/18/2012 1:04:51 PM | World Community Grid | Didn't resend lost task c4cw_target05_160456102_0 (expired) 3/18/2012 1:04:51 PM | World Community Grid | No tasks sent 3/18/2012 1:04:51 PM | World Community Grid | No tasks are available for Computing for Clean Water 3/18/2012 1:04:51 PM | World Community Grid | Tasks are committed to other platforms 3/18/2012 1:05:06 PM | World Community Grid | Sending scheduler request: To fetch work. 3/18/2012 1:05:06 PM | World Community Grid | Requesting new tasks for CPU 3/18/2012 1:05:08 PM | World Community Grid | Scheduler request completed: got 4 new tasks 3/18/2012 1:05:08 PM | World Community Grid | Didn't resend lost task c4cw_target05_160338721_0 (expired) 3/18/2012 1:05:08 PM | World Community Grid | Didn't resend lost task c4cw_target05_160364807_0 (expired) 3/18/2012 1:05:08 PM | World Community Grid | Didn't resend lost task c4cw_target05_160458750_0 (expired) 3/18/2012 1:05:08 PM | World Community Grid | Didn't resend lost task c4cw_target05_160456102_0 (expired) 3/18/2012 1:05:10 PM | World Community Grid | Started download of wcg_c4cw_lmps_6.41_windows_x86_64 3/18/2012 1:05:10 PM | World Community Grid | Started download of wcg_c4cw_graphics_6.41_windows_x86_64 3/18/2012 1:05:14 PM | World Community Grid | Finished download of wcg_c4cw_graphics_6.41_windows_x86_64 3/18/2012 1:05:14 PM | World Community Grid | Started download of c4cw_image01_6.41.tga 3/18/2012 1:05:15 PM | World Community Grid | Finished download of c4cw_image01_6.41.tga 3/18/2012 1:05:15 PM | World Community Grid | Started download of c4cw_image04_6.41.tga 3/18/2012 1:05:17 PM | World Community Grid | Finished download of c4cw_image04_6.41.tga 3/18/2012 1:05:17 PM | World Community Grid | Started download of c4cw_image02_6.41.tga 3/18/2012 1:05:20 PM | World Community Grid | Finished download of c4cw_image02_6.41.tga 3/18/2012 1:05:20 PM | World Community Grid | Started download of c4cw_image07_6.41.tga 3/18/2012 1:05:21 PM | World Community Grid | Finished download of wcg_c4cw_lmps_6.41_windows_x86_64 3/18/2012 1:05:21 PM | World Community Grid | Finished download of c4cw_image07_6.41.tga 3/18/2012 1:05:21 PM | World Community Grid | Started download of c4cw_image06_6.41.tga 3/18/2012 1:05:21 PM | World Community Grid | Started download of c4cw_image03_6.41.tga 3/18/2012 1:05:22 PM | World Community Grid | Finished download of c4cw_image06_6.41.tga 3/18/2012 1:05:22 PM | World Community Grid | Finished download of c4cw_image03_6.41.tga 3/18/2012 1:05:22 PM | World Community Grid | Started download of c4cw_image05_6.41.tga 3/18/2012 1:05:24 PM | World Community Grid | Finished download of c4cw_image05_6.41.tga 3/18/2012 1:05:24 PM | World Community Grid | Starting task c4cw_target05_175508833_0 using c4cw version 641 3/18/2012 1:05:24 PM | World Community Grid | Starting task c4cw_target05_175495684_0 using c4cw version 641 3/18/2012 1:05:24 PM | World Community Grid | Starting task c4cw_target05_175492891_0 using c4cw version 641 3/18/2012 1:05:24 PM | World Community Grid | Starting task c4cw_target05_175482932_0 using c4cw version 641 You can see how many times those "lost tasks" are being mentioned. It's every work fetch/reporting, basically. i5-12600K (3.7GHz), 32GB DDR5, Win11 64bit Home |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Oink. Well then we have to get nasty nasty. Detach from WCG and reattach [set work fetch to no and clear all tasks you have first]. Not going to say that I'll eat my hat, but I'll prepare it with the appropriate softeners and spices anyhow ;P
------------------------------------------//-- [Edit 1 times, last edit by Former Member at Mar 18, 2012 6:20:14 PM] |
||
|
Bugg
Senior Cruncher USA Joined: Nov 19, 2006 Post Count: 271 Status: Offline Project Badges: |
Well, finished the work I had and it uploaded, then reported. I then detached from WCG and reattached and...
----------------------------------------3/16/2012 6:33:26 PM | | Starting BOINC client version 6.12.34 for windows_x86_64 3/16/2012 6:33:26 PM | | log flags: file_xfer, sched_ops, task 3/16/2012 6:33:26 PM | | Libraries: libcurl/7.21.6 OpenSSL/1.0.0d zlib/1.2.5 3/16/2012 6:33:26 PM | | Data directory: C:\ProgramData\BOINC 3/16/2012 6:33:26 PM | | Running under account Doug 3/16/2012 6:33:26 PM | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-2500K CPU @ 3.30GHz [Family 6 Model 42 Stepping 7] 3/16/2012 6:33:26 PM | | Processor: 256.00 KB cache 3/16/2012 6:33:26 PM | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 syscall lm vmx tm2 popcnt aes pbe 3/16/2012 6:33:26 PM | | OS: Microsoft Windows 7: Home Premium x64 Edition, Service Pack 1, (06.01.7601.00) 3/16/2012 6:33:26 PM | | Memory: 7.98 GB physical, 15.96 GB virtual 3/16/2012 6:33:26 PM | | Disk: 465.66 GB total, 387.98 GB free 3/16/2012 6:33:26 PM | | Local time is UTC -5 hours 3/16/2012 6:33:26 PM | | ATI GPU 0: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 1360 GFLOPS peak) 3/16/2012 6:33:26 PM | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 1882039; resource share 100 3/16/2012 6:33:26 PM | World Community Grid | General prefs: from World Community Grid (last modified 16-Mar-2012 02:28:50) 3/16/2012 6:33:26 PM | World Community Grid | Host location: none 3/16/2012 6:33:26 PM | World Community Grid | General prefs: using your defaults 3/16/2012 6:33:26 PM | | Reading preferences override file 3/16/2012 6:33:26 PM | | Preferences: 3/16/2012 6:33:26 PM | | max memory usage when active: 7765.95MB 3/16/2012 6:33:26 PM | | max memory usage when idle: 8174.68MB 3/16/2012 6:33:26 PM | | max disk usage: 10.00GB 3/16/2012 6:33:26 PM | | max download rate: 9999002 bytes/sec 3/16/2012 6:33:26 PM | | max upload rate: 9999002 bytes/sec 3/16/2012 6:33:26 PM | | (to change preferences, visit the web site of an attached project, or select Preferences in the Manager) 3/16/2012 6:33:26 PM | | Not using a proxy 3/16/2012 6:33:35 PM | World Community Grid | work fetch resumed by user 3/16/2012 6:33:37 PM | World Community Grid | Sending scheduler request: To fetch work. 3/16/2012 6:33:37 PM | World Community Grid | Requesting new tasks for ATI GPU 3/16/2012 6:33:40 PM | World Community Grid | Scheduler request completed: got 0 new tasks 3/16/2012 6:33:40 PM | World Community Grid | Didn't resend lost task c4cw_target05_160338721_0 (expired) 3/16/2012 6:33:40 PM | World Community Grid | Didn't resend lost task c4cw_target05_160364807_0 (expired) 3/16/2012 6:33:40 PM | World Community Grid | Didn't resend lost task c4cw_target05_160458750_0 (expired) 3/16/2012 6:33:40 PM | World Community Grid | Didn't resend lost task c4cw_target05_160456102_0 (expired) 3/16/2012 6:33:40 PM | World Community Grid | No tasks sent 3/16/2012 6:33:40 PM | World Community Grid | No tasks are available for Computing for Clean Water 3/16/2012 6:33:40 PM | World Community Grid | Tasks are committed to other platforms i5-12600K (3.7GHz), 32GB DDR5, Win11 64bit Home |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
I'm pretty sure it's server side. As long as the grid knows that computer it's going to give you that message. I have one system myself getting a similar message. At this point, I think it's going to have to time out on the server before we can hope for any resolution. Short of that, the techs are going to have to do some sort of purge as I'm sure it affects many computers.
----------------------------------------Distributed computing volunteer since September 27, 2000 |
||
|
Bugg
Senior Cruncher USA Joined: Nov 19, 2006 Post Count: 271 Status: Offline Project Badges: |
Well, after it running for a while after doing what Rob suggested, those message seem to not be appearing anymore in my event log. I also just checked my Results Status page on the site here and those 4 wu's are also now finally gone from being In Progress status. YAY! They are, however, showing up as ERRORed wu's now with the status as Detached. I hope that means someone else gets them so they get done. I'm assuming this is the case for that status. :)
----------------------------------------i5-12600K (3.7GHz), 32GB DDR5, Win11 64bit Home |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Yes, after the detach/attach, there will have been a last time clearing round of remnants. It's an oddity [to mortals] to do with getting both sides in sync. It's similar as detaching and attaching with tasks still on the system. On detach, the tasks continue to show as "In progress" [the client has not told the server it did so, going AWOL... detaching can be done off-line as well]. On attach and after the a normal server connect will you see "Detached" as status on the Result Status page.
All is well now... but it is a nuisance that these strong measures are needed bar hacking. --//-- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
P.S. On the part of them 4 still showing as "In Progress" until the detach/attach. Somehow was convinced you indicated they were gone gone from the RS pages too. It does explain why they would not clear off the client, but it is a bug... marked (expired), the server would have to be told which the client seemingly does not. The test client I have lost them right at the startup of server 700 [first hour], but recovered them with a new ''sent''time stamp, so they moved from the last page to the top page when sorting [which is why I thought they'd disappeared without trace]. The new timestamp the sign on the Result Status page they were lost and recovered. 12 crunched twice on the same host.
--//-- |
||
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges: |
There was a bug in the new server code. I just submitted the fix.
When the server determines that a 'lost' result (when the result list from the client is missing entries that are listed in the result list from the database, the ones not in the client list are called 'lost') is too close to its deadline or otherwise not required anymore, then the result is expired. This means that the deadline is set to the current time and the workunit is evaluated to see if new results should be sent, etc. However, the deadline line for the result was being set to the current time + grace period deadline and the workunit was being transitioned at the current time + the grace period deadline. This would result in a series of client requests and server responses that would continually advance the deadline for the result and would continually sent a message to the user saying that the result had been 'lost. This should now be fixed. |
||
|
|