Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: New Beta Test starting Oct 31, 2013 [Issues Thread] |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 211
|
Author |
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1316 Status: Offline Project Badges: |
The first 68 minutes progress 0.5%, using a full core, but BoincTasks saw only 30 seconds CPU-use during that elapsed time
----------------------------------------The task restarted itself and then there was normal progress and checkpointing after 18 minutes initialising. Windows 7, MS Security Essentials. Finally exceeds size limit, just like the wingmen. 01 Nov 14:51:26 Starting task BETA_BETA_9999987_0489_3 using beta17 version 719 in slot 12 01 Nov 15:59:05 Task BETA_BETA_9999987_0489_3 exited with zero status but no 'finished' file 01 Nov 15:59:05 If this happens repeatedly you may need to reset the project. 01 Nov 15:59:05 Restarting task BETA_BETA_9999987_0489_3 using beta17 version 719 in slot 12 01 Nov 18:16:19 Computation for task BETA_BETA_9999987_0489_3 finished 01 Nov 18:16:19 Output file BETA_BETA_9999987_0489_3_0 for task BETA_BETA_9999987_0489_3 exceeds size limit. 01 Nov 18:16:19 File size: 29188128.000000 bytes. Limit: 10485760.000000 bytes |
||
|
OldChap
Veteran Cruncher UK Joined: Jun 5, 2009 Post Count: 978 Status: Offline Project Badges: |
Had similar experience to KWSN - A Shrubbery above getting 98%+ cpu usage running 9 + some FAHV and 8 at a time on E5 26 xeon and 3770K, the former is native linux mint the latter running linux mint in virtualbox VM on windows7 64 host. both machines have 16GB ram
----------------------------------------[Edit 2 times, last edit by OldChap at Nov 1, 2013 5:56:37 PM] |
||
|
RichSavarie
Cruncher Canada Joined: Aug 9, 2005 Post Count: 49 Status: Offline Project Badges: |
I've had one Beta WU for nearly a day and I just noticed in the log that it has been restarting itself every ~10mins or so. Each time it restarts, the "estimated completion time" resets to about 10hrs. Absolutely no progress has been made. Do I abort or just let it go? Rich, What OS are you running and do you have any security software installed on your computer? If so can you check to see if the BOINC data directory is excluded. It sounds like an outside source is killing the process to have it restart. Thanks, -Uplinger I'm running on 64-bit Windows 7. Here are the specs as shown in the WCG/BOINC log: Processor: 4 GenuineIntel Intel(R) Core(TM) i7 CPU M 620 @ 2.67GHz [Family 6 Model 37 Stepping 2] I also just checked my security software and there was no mention of WCG/BOINC anywhere. No blocks or exclusions. I've never had issues running projects before so I didn't think it was necessary to play with it. However I have just added C:\ProgramData\BOINC to the exclusions list to see if this makes a difference. |
||
|
branjo
Master Cruncher Slovakia Joined: Jun 29, 2012 Post Count: 1892 Status: Offline Project Badges: |
Since I am at work, I can't check the progress, so just remote observations for now: - Full (down)load: 8 for i7 + 4 for i5. - 2 already errored: 1 on my i7-3770 Win7 64b 7.2.26 after 2.63 hours, 1 on my i5-2500S MAC OS X 10.9. (Mavericks) 7.0.65 after 2.68 h. Wingmen still In progress. - The other 10 In progress. ... Good luck and cheers ETA1: 1 on Win Valid (CPU Time 1.30 h), 2 on Mac PVal (CPU time 3.57 and 3.48 h) ETA2: methink 10 days deadline for Betas is a bit long ETA3: the last 2 WU's I caught unfinished when I came back from work were resends (one on Win, the second one on MAC), so both of them errored out. But checkpoints worked fine and the RAM usage was around 250 MB. ETA4: got another resend, Mac OS X, checkpoints OK (ca. 10 mins each), RAM <200 MB, run time:elapsed time like any other WCG WU (ca. 97%), errored out of course pi 1 nov 19:40:17 2013 | World Community Grid | Output file BETA_BETA_9999988_0343_2_0 for task BETA_BETA_9999988_0343_2 exceeds size limit. pi 1 nov 19:40:17 2013 | World Community Grid | File size: 54785988.000000 bytes. Limit: 10485760.000000 bytes Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006 |
||
|
vepaul
Senior Cruncher Belgium Joined: Nov 17, 2004 Post Count: 261 Status: Offline Project Badges: |
I had 5 : 2 still running, 3 ended in error
Running : 1 since 31/10/13 7:06:34, the other since 1/11/13 10:08:17 Errors (all appl 719) 31/10/13 07:06:20-22:04:29; next 31/10/13 22:11:02- 05:11:54 (next day); 01/11/13 05:11:57-10:08:13 However, BOINC credits are "accordés" for the WUs ended in error. Strange, no ? Paul |
||
|
branjo
Master Cruncher Slovakia Joined: Jun 29, 2012 Post Count: 1892 Status: Offline Project Badges: |
No, it is not strange - WCG usually rewards its crunchers for their effort especially if the error is not caused by them/us
----------------------------------------We are also rewarded with full run time for these errors Cheers Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006 [Edit 1 times, last edit by branjo at Nov 1, 2013 9:10:44 PM] |
||
|
darth_vader
Veteran Cruncher A galaxy far, far away... Joined: Jul 13, 2005 Post Count: 514 Status: Offline Project Badges: |
If you are encountering a restart issue of your result, please let us know a few things. 1. What OS are you running (ex. Windows 8 64 bit) 2. Do you have security software on your computer. One report from Gil was McAfee. 3. Check your security software to see if you can exclude either this application or the boinc data directory. On windows this is usually C:/ProgramData/BOINC/. 1. Win7 64-bit on i7 processor (8 threads) 2. Yes. Symantec. 3. Already excluded, so not a factor All three betas I received had the restart issue to some degree or another. Two errored due to the output file size. All three eventually ran to completion, but the elapsed time is very much under reported. Here's one example. It ran about 9 hours, but stats show 1.71 hours. Result Name: BETA_ BETA_ 9999984_ 0055_ 1-- <core_client_version>6.10.58</core_client_version> <![CDATA[ <stderr_txt> 4551 [17:08:03]: Computing pass 14552 [17:08:03]: Computing pass 14553 [17:08:04]: Computing pass 14554 .... [17:19:26]: Computing pass 16443 [17:19:26]: Computing pass 16444 [17:19:26]: Computing pass 16445 [17:19:27]: Computing pass 16446 Run complete, CPU time: 6168.295140 17:19:37 (8316): called boinc_finish </stderr_txt> <message> <file_xfer_error> <file_name>BETA_BETA_9999984_0055_1_0</file_name> <error_code>-131</error_code> </file_xfer_error> </message> ]]> - D |
||
|
RichSavarie
Cruncher Canada Joined: Aug 9, 2005 Post Count: 49 Status: Offline Project Badges: |
Yeah, so adding BOINC/WCG to my security software's exceptions list hasn't helped at all. The Beta WU is still constantly restarting itself every ten minutes or so. No useful messages being logged as far as I can tell except that it is being restarted.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
My output file exceeded the size limit just barely. Saw this mentioned upthread and wanted to report, if cases were being counted.
11/1/2013 3:20:56 PM World Community Grid Computation for task BETA_BETA_9999985_0158_1 finished 11/1/2013 3:20:56 PM World Community Grid Output file BETA_BETA_9999985_0158_1_0 for task BETA_BETA_9999985_0158_1 exceeds size limit. 11/1/2013 3:20:56 PM World Community Grid File size: 10998631.000000 bytes. Limit: 10485760.000000 bytes Win 7 Pro 64-bit Have barely gotten any betas in years of using WCG, so was glad to try! |
||
|
Ian_UK
Senior Cruncher England Joined: Oct 15, 2006 Post Count: 153 Status: Offline Project Badges: |
I had 2:10 units that error'd out due to too big a file size (Error code -131); believe both should have successfully completed. I understand Betas are meant to find problems, but my concern is whether this really indicative of a WCG procedural problem (i.e. nothing to do with research units themselves). We had similar types of issues with the HCC project timing units out at the end, as believe WCG got its calculations wrong (from memory all close (< 5%?) of completing OK) . The HCC (and these) units all error'd out very close to completing normally (or completed OK and then reported errors as didn’t comply with estimate).
----------------------------------------I personally would prefer less reliance on accurate estimates for file/time length (i.e. need to add larger safety margin), rather than risk wasting our efforts. Example of error: 01/11/2013 20:57:25 | World Community Grid | Computation for task BETA_BETA_9999987_0768_0 finished 01/11/2013 20:57:25 | World Community Grid | Output file BETA_BETA_9999987_0768_0_0 for task BETA_BETA_9999987_0768_0 exceeds size limit. 01/11/2013 20:57:25 | World Community Grid | File size: 110000046.000000 bytes. Limit: 10485760.000000 bytes Science isn't about boundaries. |
||
|
|