Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
World Community Grid Forums
Category: Beta Testing Forum: Beta Test Support Forum Thread: New Beta starting Aug 23, 2011 |
No member browsing this thread |
Thread Status: Locked Total posts in this thread: 164
|
Author |
|
Harriscott
Advanced Cruncher Joined: Jan 15, 2008 Post Count: 58 Status: Offline Project Badges: |
mfbabb2, sorry to gloat, but I got 12! I was just lucky to read the forum at the right time, and switch the profile to beta starvation mode. I wasn't so lucky the last couple of beta releases, and got Zippo.
|
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I've got a beta with some of the same responses that BSD noted yesterday. This is BETA_BETA_tk_0000001_0681_0 - it looked normal in the queue when I went to bed last night, but this morning it was reporting this:
Elapsed: 22:02:48 Percentage: 7.500% To completion: 56:42:56 After suspending and rebooting, it looks like this: Elapsed: 00:30:42 Percentage: 6.562% To completion: 07:03:01 A strange thing about this unit is that the "To completion" time is ticking up and down in sort of a descending rollercoaster pattern. I don't know that I've seen this behavior in other beta units. |
||
|
sk..
Master Cruncher http://s17.rimg.info/ccb5d62bd3e856cc0d1df9b0ee2f7f6a.gif Joined: Mar 22, 2007 Post Count: 2324 Status: Offline Project Badges: |
'in the queue when I went to bed'
'this morning it was reporting this: Elapsed: 22:02:48' That was some sleep |
||
|
Jason1478963
Senior Cruncher United States Joined: Sep 18, 2005 Post Count: 295 Status: Offline Project Badges: |
I had two XP machines crash/lockup overnight when I left them running these beta. The linux machines seem to be running better with less invalid or inconclusive.
---------------------------------------- |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I wish, skgiven! I'd like credit for those 22 hours instead of the 6 I actually got :)
That unit appears to be running normally now after the reboot. Not sure whether it will make the deadline or not, though. |
||
|
Rickjb
Veteran Cruncher Australia Joined: Sep 17, 2006 Post Count: 666 Status: Offline Project Badges: |
@BobCat13 & others reporting on suspend/resume behaviour:
I suggest that you state whether you have "Leave applications in memory while suspended" (aka LAIM) on or off, because this makes a huge difference to the actions being performed by the WCG, BOINC and o/s software. Also, but not with these beta WUs, I have found that under some versions of Windows (2K/XP) with LAIM off, processes do not always disappear from the system (Windows Task Manager) immediately, and they may resume as if LAIM was on, instead of from the last checkpoint. Thus, if you are testing suspend/resume with LAIM off, it is best to use a process listing utility, eg ps or top (*nix) or task mangler (Windows) to ensure that before you resume them, your suspended processes have disappeared from the o/s process list. Sorry, I didn't get around to testing suspend/resume with these WUs on my XP machines. All 19 WUs that I received ran faultlessly as far as I could tell, and are all Valid or PV (XP-32/BOINC 6.2.19, XP-64/BOINC 6.2.19-x64, Win7-x64/BOINC 6.10.58-x64). |
||
|
KWSN - A Shrubbery
Master Cruncher Joined: Jan 8, 2006 Post Count: 1585 Status: Offline |
As stated in the original post, I have LAIM on. This behavior was observed while running task manager. I can't imagine another way to determine what process is using the CPU without using a utility to view it. The vina task continued to run while the client reported the task as suspended. CPU time continued to increase while elapsed time stayed still. Meanwhile, the client would activate another task and elapsed time would increment without any CPU time accumulating.
----------------------------------------Please trust that other people do actually understand their systems well enough to report aberrant behavior. I certainly wouldn't be actively participating in Beta if I didn't. Nevertheless, this is clearly a problem with the application. The techs are aware of it and are testing solutions. Obviously, this one didn't work. Distributed computing volunteer since September 27, 2000 |
||
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Had two on my Athlon 7750 WinXP, but the system was blocked by them - not responding for seconds. When both WUs started all was frozen for nearly a minute. Found this message in the messsage tab (for the first time):
26.08.2011 19:30:39|World Community Grid|Starting BETA_BETA_00001870_0000002_0397_1 26.08.2011 19:30:39|World Community Grid|Starting task BETA_BETA_00001870_0000002_0397_1 using beta13 version 615 26.08.2011 19:30:39|World Community Grid|Starting BETA_BETA_00001870_0000002_0846_0 26.08.2011 19:30:40|World Community Grid|Starting task BETA_BETA_00001870_0000002_0846_0 using beta13 version 615 26.08.2011 19:31:05||Can't rename current state file to previous state file; Zugriff verweigert (0x5) 26.08.2011 19:31:17||Can't delete previous state file; Zugriff verweigert (0x5) 26.08.2011 19:31:30||Can't rename current state file to previous state file; Zugriff verweigert (0x5) 26.08.2011 19:31:38||Can't delete previous state file; Zugriff verweigert (0x5) 26.08.2011 19:35:26|World Community Grid|[checkpoint_debug] result BETA_BETA_00001870_0000002_0846_0 checkpointed 26.08.2011 19:36:14|World Community Grid|[checkpoint_debug] result BETA_BETA_00001870_0000002_0397_1 checkpointed So maybe the boinc manager (v5.10.45) was blocked by them as well... But at least both validated. |
||
|
Crystal Pellet
Veteran Cruncher Joined: May 21, 2008 Post Count: 1316 Status: Offline Project Badges: |
Nevertheless, this is clearly a problem with the application. The techs are aware of it and are testing solutions. Obviously, this one didn't work. IMO v6.15 is worse than the previous versions on Windows. Before this version no errors at all, now 4 with a Windows APPCRASH on 3 different computers, where in BOINC the task runs on without using cpu, but of course increasing elaps time, blocking an other task to start. Edit: It's getting even worse, 5th error BETA_BETA_00001592_0000002_0093_2 using beta13 version 615 caused a BSOD . Restarted after the boot without using saved checkpoint with 0 seconds cpu at 0% with over 6 hours elapsed time Workunit Status Project Name: Beta Test Created: 08/26/2011 02:03:13 Name: BETA_BETA_00001592_0000002_0093 Minimum Quorum: 2 Replication: 3 Result Name App Version Number Status Sent Time Time Due / Return Time CPU Time (hours) Claimed/ Granted BOINC Credit BETA_ BETA_ 00001592_ 0000002_ 0093_ 2-- - In Progress 27/08/11 05:10:30 28/08/11 19:34:30 0.00 0.0 / 0.0 BETA_ BETA_ 00001592_ 0000002_ 0093_ 1-- 615 Inconclusive 26/08/11 02:08:08 26/08/11 23:45:30 6.05 85.8 / 0.0 BETA_ BETA_ 00001592_ 0000002_ 0093_ 0-- 615 Inconclusive 26/08/11 02:07:53 27/08/11 05:09:45 5.26 125.4 / 0.0 [Edit 2 times, last edit by Crystal Pellet at Aug 27, 2011 11:52:54 AM] |
||
|
BobCat13
Senior Cruncher Joined: Oct 29, 2005 Post Count: 295 Status: Offline Project Badges: |
@BobCat13 & others reporting on suspend/resume behaviour: I suggest that you state whether you have "Leave applications in memory while suspended" (aka LAIM) on or off, because this makes a huge difference to the actions being performed by the WCG, BOINC and o/s software. LAIM is on here, but suspend/resume worked fine with the task stopping within a second of suspending. The error occurred after the client had been stopped, the system rebooted, and the client starting again. Prior to stop/reboot/start, all jobs were taking a little under 10 minutes each. After the reboot, the task ran for almost 2 hours without finishing a job and then erred. [Edit 1 times, last edit by BobCat13 at Aug 27, 2011 2:08:27 PM] |
||
|
|