| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 50
|
|
| Author |
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
OS Windows 11; Hardware Ryzen 7 5800X
I had several beta workunits running apparently OK and checkpointing normally. I set 'leave application in memory' off. Suspended the workunit; waited a minute or so; resumed workunit. It appeared to restart OK but failed with computation error 29539 within a few seconds. The STDERR shows 'out of memory'. I have 32Gb RAM and currently using 26%. Workunit: BETA30_9800028_0250 |
||
|
|
PowerFactor
Ace Cruncher Joined: Dec 9, 2016 Post Count: 4033 Status: Offline Project Badges:
|
I have 5 machines with Ubuntu 24.04 Linux. Here is my report:
----------------------------------------I have 137 Beta Units The following Work Units are valid: BETA_BETA30_9800008_0551_0 BETA_BETA30_9800008_0553_1 BETA_BETA30_9800008_0918_0 BETA_BETA30_9800021_0028_0 BETA_BETA30_9800021_0024_1 BETA_BETA30_9800021_0027_0 BETA_BETA30_9800021_0023_1 BETA_BETA30_9800021_0029_0 BETA_BETA30_9800021_0011_0 BETA_BETA30_9800021_0005_1 BETA_BETA30_9800021_0031_0 BETA_BETA30_9800021_0019_0 BETA_BETA30_9800021_0025_0 BETA_BETA30_9800021_0021_0 BETA_BETA30_9800021_0018_0 BETA_BETA30_9800021_0014_0 BETA_BETA30_9800021_0004_0 BETA_BETA30_9800021_0013_1 BETA_BETA30_9800022_0052_0 BETA_BETA30_9800022_0016_1 BETA_BETA30_9800022_0051_0 BETA_BETA30_9800022_0050_0 BETA_BETA30_9800025_0784_1 BETA_BETA30_9800025_0777_0 The following are pending validation: BETA_BETA30_9800021_0015_1 BETA_BETA30_9800025_0770_1 BETA_BETA30_9800025_0771_0 The following say "too late". I assumed they errored out: BETA_BETA30_9800008_0912_0 BETA_BETA30_9800008_0906_0 The rest are still in progress. [Edit 1 times, last edit by PowerFactor at Apr 26, 2025 11:43:45 PM] |
||
|
|
phytell
Cruncher Joined: Sep 8, 2014 Post Count: 39 Status: Offline |
I've had quite a few betas hang for well over a day (up to 40 hours per results) before being user aborted. Several of those were still at .5%.
Multiple systems, both Windows and Linux. Those betas that did complete ran quite quickly - less than an hour. |
||
|
|
f300
Cruncher Joined: Jan 14, 2014 Post Count: 2 Status: Offline Project Badges:
|
I tried Hibernating (Win10) while leaving BOINC open (and not using snooze). The MAM tasks appeared to carry on without errors after resuming from Hibernate. Only tried the once so far, don't know if it will work reliably.
|
||
|
|
ca05065
Senior Cruncher Joined: Dec 4, 2007 Post Count: 328 Status: Offline Project Badges:
|
OS Windows 11; Hardware Ryzen 7 5800X
I shut down my PC each night while 7 beta tasks were running. On startup 6 beta workunits failed with computation error (stderr shows out of memory). One workunit started from the begining. |
||
|
|
catchercradle
Senior Cruncher England Joined: Jan 16, 2009 Post Count: 167 Status: Offline Project Badges:
|
I have had one task labled, "Too late." Due date was yesterday so I am assuming that was the reason but after my wingman's task errored out no _2 or _3 tasks sent out for that work unit. You can see the work unit here My two remaining tasks that have been running for 35 hours and just reached 40%. It is touch and go whether they will finish on time. (This on a Ryzen9 that is finishing many tasks in half the time my wingmen take!
----------------------------------------On the positive side, I do have over ten tasks completed across three hosts on the same machine. Ubuntu Host OS, one Windows10 VM and one Ubuntu VM. [Edit 1 times, last edit by catchercradle at Apr 27, 2025 8:23:14 AM] |
||
|
|
geophi
Advanced Cruncher U.S. Joined: Sep 3, 2007 Post Count: 113 Status: Offline Project Badges:
|
I have had one task labled, "Too late." Due date was yesterday so I am assuming that was the reason but after my wingman's task errored out no _2 or _3 tasks sent out for that work unit. You can see the work unit here My two remaining tasks that have been running for 35 hours and just reached 40%. It is touch and go whether they will finish on time. (This on a Ryzen9 that is finishing many tasks in half the time my wingmen take! On the positive side, I do have over ten tasks completed across three hosts on the same machine. Ubuntu Host OS, one Windows10 VM and one Ubuntu VM. The Jurisica Lab Operational Status update yesterday said they would try to extend deadlines for these long-running tasks. They also covered the why _2 or _3 tasks are not being sent out for some work units. [Edit 1 times, last edit by geophi at Apr 27, 2025 5:36:48 PM] |
||
|
|
Tech57
Cruncher Joined: Mar 29, 2018 Post Count: 10 Status: Offline Project Badges:
|
I've had quite a few betas hang for well over a day (up to 40 hours per results) before being user aborted. Same issue here. Task hang at 3.355% and CPU core utilization is 100%. My hardware is a laptop running Windows 11 24H2 on an Intel i5-1340P processor. |
||
|
|
gb009761
Master Cruncher Scotland Joined: Apr 6, 2005 Post Count: 3010 Status: Offline Project Badges:
|
Earlier on today, I had to shut down my laptop (as I was going on holiday - taking my machine with me), and when I came to start it up again, the following Beta WU's all subseqnelty aborted - despite them having had successful (or, seemingly so) checkpoints...
----------------------------------------BETA_BETA30_9800028_0365_2 Francis-T16 Error 2025-04-27 00:17:57 UTC 2025-05-01 00:17:57 UTC 2025-04-27 18:48:46 UTC 7.23 / 11.53 294 / 0 BETA_BETA30_9800028_0576_2 Francis-T16 Error 2025-04-27 00:17:57 UTC 2025-05-01 00:17:57 UTC 2025-04-27 18:48:46 UTC 7.18 / 11.46 292.1 / 0 BETA_BETA30_9800030_0014_2 Francis-T16 Error 2025-04-27 00:17:57 UTC 2025-05-01 00:17:57 UTC 2025-04-27 18:48:46 UTC 7.25 / 11.56 294.7 / 0 BETA_BETA30_9800028_0690_2 Francis-T16 Error 2025-04-26 18:54:54 UTC 2025-04-30 18:54:54 UTC 2025-04-27 18:48:46 UTC 11.87 / 18.86 480.8 / 0 BETA_BETA30_9800028_0707_2 Francis-T16 Error 2025-04-26 18:54:54 UTC 2025-04-30 18:54:54 UTC 2025-04-27 18:48:46 UTC 10.36 / 16.46 419.8 / 0 BETA_BETA30_9800028_0710_2 Francis-T16 Error 2025-04-26 18:54:54 UTC 2025-04-30 18:54:54 UTC 2025-04-27 18:48:46 UTC 10.59 / 16.74 426.7 / 0 BETA_BETA30_9800028_0735_2 Francis-T16 Error 2025-04-26 18:54:54 UTC 2025-04-30 18:54:54 UTC 2025-04-27 18:48:46 UTC 10.61 / 16.76 427.3 / 0 BETA_BETA30_9800030_0928_2 Francis-T16 Error 2025-04-26 18:54:54 UTC 2025-04-30 18:54:54 UTC 2025-04-27 18:48:46 UTC 11.05 / 17.62 449.3 / 0 Needless to say, all my 'non Beta' WU's (MCM) picked up again from their last checkpoint. ![]() |
||
|
|
catchercradle
Senior Cruncher England Joined: Jan 16, 2009 Post Count: 167 Status: Offline Project Badges:
|
Thanks George.
The Jurisica Lab Operational Status update yesterday said they would try to extend deadlines for these long-running tasks. They also covered the why _2 or _3 tasks are not being sent out for some work units. I see looking at the task pages for my two long tasks the deadline has been extended to 05:02:2025 (American Format) I think just one day so 30th instead of 29th April would have been enough for mine but even 2nd May might not be enough for some slower machines. |
||
|
|
|