World Community Grid - View Thread - Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

World Community Grid Forums

Category: Official Messages

Forum: News

Thread: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 159

[ ]

Author

This topic has been viewed 34112 times and has 158 replies

Falconet
Master Cruncher
Portugal
Joined: Mar 9, 2009
Post Count: 3315
Status: Offline
Project Badges:

14 day badge for Human Proteome Folding - Phase 2

14 day badge for Nutritious Rice for the World

180 day badge for Help Fight Childhood Cancer

90 day badge for Help Cure Muscular Dystrophy - Phase 2

90 day badge for Computing for Clean Water

90 day badge for Drug Search for Leishmaniasis

90 day badge for GO Fight Against Malaria

20 year badge for Mapping Cancer Markers

2 year badge for Uncovering Genome Mysteries

2 year badge for Outsmart Ebola Together

1 year badge for FightAIDS@Home - Phase 2

5 year badge for Microbiome Immunity Project

14 day badge for Africa Rainfall Project

5 year badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Agreed. A very informative post.

----------------------------------------

- AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W
- AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W
- AMD Ryzen 7 7730U 8C/16T 3.0 GHz

[Nov 7, 2024 11:34:08 AM]

spRocket
Senior Cruncher
Joined: Mar 25, 2020
Post Count: 280
Status: Offline
Project Badges:

100 year badge for Mapping Cancer Markers

1 year badge for Microbiome Immunity Project

5 year badge for Africa Rainfall Project

20 year badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Still seeing retries as of 6:30 AM CST, but I run fairly short queues, so the the backlogs are short and clear quickly.

ETA: I don't always get retries, and a fair number go through without problems.

----------------------------------------
[Edit 1 times, last edit by spRocket at Nov 7, 2024 12:35:02 PM]

[Nov 7, 2024 12:32:58 PM]

gj82854
Advanced Cruncher
Joined: Sep 26, 2022
Post Count: 122
Status: Offline
Project Badges:

10 year badge for Mapping Cancer Markers

10 year badge for Africa Rainfall Project


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Not getting any work now due to "Task are committed to other platforms" message. That will definitely fix the download problem.

[Nov 7, 2024 1:23:51 PM]

adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2360
Status: Recently Active
Project Badges:

5 year badge for Human Proteome Folding - Phase 2

90 day badge for Nutritious Rice for the World

2 year badge for Help Fight Childhood Cancer

2 year badge for Help Cure Muscular Dystrophy - Phase 2

14 day badge for Discovering Dengue Drugs - Together - Phase 2

180 day badge for The Clean Energy Project - Phase 2

1 year badge for Computing for Clean Water

1 year badge for Drug Search for Leishmaniasis

1 year badge for GO Fight Against Malaria

45 day badge for Computing for Sustainable Water

1 year badge for Uncovering Genome Mysteries

20 year badge for Outsmart Ebola Together

2 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

50 year badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Looking at the latest 22 results for one of my devices that has a maximum of 1 ARP1-task in its queue, one of the biggest problems was (quoting savas) "a failing drive on one of the download servers", oldest first:

<22> * ARP1_0014490_126_0  Fedora Linux  Error      2024-11-04T07:23:47  2024-11-05T03:10:13   14.39/15.69
<22>   ARP1_0014490_126_1  Linux Ubuntu  Error      2024-11-04T07:23:52  2024-11-04T23:22:00    0.00/0.00
<22>   ARP1_0014490_126_2  Linux Debian  Too Late   2024-11-04T07:23:55  2024-11-06T04:57:14   29.92/29.92
<22>   ARP1_0014490_126_3  Linuxmint     Error      2024-11-04T23:36:39  2024-11-04T23:41:51    0.00/0.00
<22>   ARP1_0014490_126_4  Linuxmint     Error      2024-11-04T23:50:28  2024-11-05T00:00:44    0.00/0.00
<22>   ARP1_0014490_126_5  Linux Ubuntu  Error      2024-11-05T00:09:45  2024-11-05T01:15:45    0.00/0.00
<22>   ARP1_0014490_126_6  Linux Debian  Error      2024-11-05T01:25:55  2024-11-05T01:32:51    0.00/0.00

<21>   ARP1_0000725_130_0  Linux Debian  Error      2024-11-05T03:07:26  2024-11-05T04:28:38    0.00/0.00
<21>   ARP1_0000725_130_1  Linux Ubuntu  Error      2024-11-05T03:03:34  2024-11-05T03:38:54    0.00/0.00
<21> * ARP1_0000725_130_2  Fedora Linux  Error      2024-11-05T03:10:13  2024-11-05T03:21:02    0.00/0.00
<21>   ARP1_0000725_130_3  Linux Ubuntu  S.Aborted  2024-11-05T03:28:20  2024-11-05T05:33:33    0.00/0.00
<21>   ARP1_0000725_130_4  CentOS Linux  In Progr.  2024-11-05T03:49:43  2024-11-11T21:13:04    0.00/0.00
<21>   ARP1_0000725_130_5  Linux Ubuntu  Error      2024-11-05T04:38:15  2024-11-05T05:04:19    0.00/0.00
<21>   ARP1_0000725_130_6  Linux Debian  Error      2024-11-05T05:13:34  2024-11-05T05:26:31    0.00/0.00

<20>   ARP1_0005147_128_0  Linuxmint     Error      2024-11-04T08:47:15  2024-11-05T20:48:09    0.00/0.00
<20>   ARP1_0005147_128_1  Linux Ubuntu  Error      2024-11-04T08:45:57  2024-11-05T03:11:17    0.00/0.00
<20>   ARP1_0005147_128_2  Linux Ubuntu  Error      2024-11-04T08:49:13  2024-11-05T20:49:29    0.00/0.00
<20> * ARP1_0005147_128_3  Fedora Linux  Error      2024-11-05T03:21:02  2024-11-05T03:29:08    0.00/0.00
<20>   ARP1_0005147_128_4                Other                                                  0.00/0.00
<20>   ARP1_0005147_128_5  MSWin 10      Error      2024-11-05T21:10:58  2024-11-07T09:18:08    0.00/0.00
<20>   ARP1_0005147_128_6  MSWin 10      Error      2024-11-05T21:10:52  2024-11-07T09:11:22    0.00/0.00

<19>   ARP1_0010391_139_0  Linux Endeav  S.Aborted  2024-11-04T09:36:40  2024-11-05T21:32:00    0.00/0.00
<19>   ARP1_0010391_139_1  Fedora Linux  Error      2024-11-04T09:46:33  2024-11-05T00:22:13    0.00/0.00
<19>   ARP1_0010391_139_2  Linux Ubuntu  Error      2024-11-05T00:33:17  2024-11-05T01:44:37    0.00/0.00
<19>   ARP1_0010391_139_3  Linux Debian  Error      2024-11-05T01:52:56  2024-11-05T02:45:46    0.00/0.00
<19>   ARP1_0010391_139_4  Linux Ubuntu  Error      2024-11-05T02:52:42  2024-11-05T03:33:40    0.00/0.00
<19> * ARP1_0010391_139_5  Fedora Linux  Error      2024-11-05T03:42:30  2024-11-05T03:46:56    0.00/0.00

<18>   ARP1_0027813_139_0  Linux Ubuntu  Error      2024-11-04T16:28:38  2024-11-05T03:36:40    0.00/0.00
<18>   ARP1_0027813_139_1  Linux Ubuntu  S.Aborted  2024-11-04T16:14:53  2024-11-06T02:27:21    0.00/0.00
<18>   ARP1_0027813_139_2  Linuxmint     Error      2024-11-05T03:41:04  2024-11-05T03:45:58    0.00/0.00
<18> * ARP1_0027813_139_3  Fedora Linux  Error      2024-11-05T03:52:33  2024-11-05T04:10:05    0.00/0.00
<18>   ARP1_0027813_139_4  Linux Ubuntu  Error      2024-11-05T04:16:11  2024-11-05T04:21:47    0.00/0.00
<18>   ARP1_0027813_139_5  Linux Debian  Error      2024-11-05T04:25:52  2024-11-05T06:25:08    0.00/0.00

<17>   ARP1_0010864_139_0  Linux Mageia  Error      2024-11-04T09:38:50  2024-11-05T00:42:42    0.00/0.00
<17>   ARP1_0010864_139_1  Fedora Linux  In Progr.  2024-11-04T09:30:06  2024-11-14T09:30:06    0.00/0.00
<17>   ARP1_0010864_139_2  Linuxmint     Error      2024-11-05T00:45:56  2024-11-05T00:53:50    0.00/0.00
<17>   ARP1_0010864_139_3  Linux Debian  Error      2024-11-05T00:57:01  2024-11-05T04:21:18    0.00/0.00
<17> * ARP1_0010864_139_4  Fedora Linux  Error      2024-11-05T04:25:17  2024-11-05T04:37:28    0.00/0.00
<17>   ARP1_0010864_139_5  Linux Ubuntu  In Progr.  2024-11-05T04:42:56  2024-11-12T04:42:56    0.00/0.00

<16> * ARP1_0010494_140_0  Fedora Linux  Error      2024-11-05T04:39:40  2024-11-05T04:53:41    0.00/0.00
<16>   ARP1_0010494_140_1  Linux Ubuntu  In Progr.  2024-11-05T04:52:57  2024-11-11T04:52:57    0.00/0.00
<16>   ARP1_0010494_140_2  Linux Fedora  Error      2024-11-05T04:55:13  2024-11-05T06:03:23    0.00/0.00
<16>   ARP1_0010494_140_3  Linux Ubuntu  In Progr.  2024-11-05T06:09:40  2024-11-12T06:09:40    0.00/0.00

<15>   ARP1_0029563_139_0  Linux Ubuntu  Error      2024-11-04T18:00:52  2024-11-04T23:49:24    0.00/0.00
<15>   ARP1_0029563_139_1  Linux         Error      2024-11-04T18:06:23  2024-11-07T12:35:20    0.00/0.00
<15>   ARP1_0029563_139_2  Linux Ubuntu  Error      2024-11-05T00:02:19  2024-11-05T04:48:37    0.00/0.00
<15> * ARP1_0029563_139_3  Fedora Linux  Error      2024-11-05T04:53:41  2024-11-05T04:59:16    0.00/0.00
<15>   ARP1_0029563_139_4  Linux Debian  In Progr.  2024-11-05T05:07:45  2024-11-12T05:07:45    0.00/0.00
<15>   ARP1_0029563_139_5                W.2B sent                                              0.00/0.00

<14>   ARP1_0002124_140_0  Linux Ubuntu  Error      2024-11-05T05:31:33  2024-11-05T23:30:07    0.00/0.00
<14> * ARP1_0002124_140_1  Fedora Linux  Error      2024-11-05T05:23:58  2024-11-05T05:43:38    0.00/0.00
<14>   ARP1_0002124_140_2  Linux Ubuntu  Error      2024-11-05T05:51:18  2024-11-05T06:16:31    0.00/0.00
<14>   ARP1_0002124_140_3  Linux Ubuntu  Error      2024-11-05T06:25:08  2024-11-05T07:53:26    0.00/0.00
<14>   ARP1_0002124_140_4  Linux Ubuntu  In Progr.  2024-11-05T08:11:00  2024-11-12T08:11:00    0.00/0.00
<14>   ARP1_0002124_140_5  Linux Ubuntu  In Progr.  2024-11-05T23:46:26  2024-11-12T23:46:26    0.00/0.00

<13> * ARP1_0011221_140_0  Fedora Linux  Error      2024-11-05T05:43:38  2024-11-05T05:59:19    0.00/0.00
<13>   ARP1_0011221_140_1  Linux Ubuntu  In Progr.  2024-11-05T05:44:32  2024-11-11T05:44:32    0.00/0.00
<13>   ARP1_0011221_140_2  Linux Ubuntu  Error      2024-11-05T06:03:52  2024-11-05T07:09:46    0.00/0.00
<13>   ARP1_0011221_140_3  Linux Debian  In Progr.  2024-11-05T07:32:21  2024-11-12T07:32:21    0.00/0.00

<12> * ARP1_0011271_140_0  Fedora Linux  Error      2024-11-05T06:04:25  2024-11-05T06:10:17    0.00/0.00
<12>   ARP1_0011271_140_1  Fedora Linux  Error      2024-11-05T05:55:58  2024-11-05T07:03:17    0.00/0.00
<12>   ARP1_0011271_140_2  Linux Ubuntu  In Progr.  2024-11-05T06:16:47  2024-11-11T06:16:47    0.00/0.00
<12>   ARP1_0011271_140_3  Linux Ubuntu  In Progr.  2024-11-05T07:24:10  2024-11-12T07:24:10    0.00/0.00

<11>   ARP1_0019377_139_0  Linuxmint     Error      2024-11-04T11:01:50  2024-11-05T03:41:14    0.00/0.00
<11>   ARP1_0019377_139_1  Linux         In Progr.  2024-11-04T11:19:15  2024-11-14T11:19:15    0.00/0.00
<11>   ARP1_0019377_139_2  Linux Debian  Error      2024-11-05T03:55:06  2024-11-05T06:04:14    0.00/0.00
<11> * ARP1_0019377_139_3  Fedora Linux  Error      2024-11-05T06:10:17  2024-11-05T06:17:17    0.00/0.00
<11>   ARP1_0019377_139_4  Linuxmint     U.Aborted  2024-11-05T06:24:06  2024-11-05T16:54:21    0.00/0.00
<11>   ARP1_0019377_139_5  Linux Ubuntu  In Progr.  2024-11-05T17:24:28  2024-11-12T17:24:28    0.00/0.00

<10>   ARP1_0014043_139_0  Linux Ubuntu  Error      2024-11-04T12:33:18  2024-11-05T06:12:43    0.00/0.00
<10>   ARP1_0014043_139_1  Linux         In Progr.  2024-11-04T11:07:08  2024-11-14T11:07:08    0.00/0.00
<10> * ARP1_0014043_139_2  Fedora Linux  Error      2024-11-05T06:17:17  2024-11-05T06:30:29    0.00/0.00
<10>   ARP1_0014043_139_3  Linux Debian  Error      2024-11-05T06:34:16  2024-11-05T07:43:38    0.00/0.00
<10>   ARP1_0014043_139_4  Linux Debian  In Progr.  2024-11-05T08:03:21  2024-11-12T08:03:21    0.00/0.00

 <9>   ARP1_0021869_139_0  Linux Ubuntu  In Progr.  2024-11-04T10:53:06  2024-11-14T10:53:06    0.00/0.00
 <9>   ARP1_0021869_139_1  Linuxmint     Error      2024-11-04T11:01:49  2024-11-05T03:41:14    0.00/0.00
 <9>   ARP1_0021869_139_2  Linux Debian  Error      2024-11-05T04:04:16  2024-11-05T05:11:59    0.00/0.00
 <9>   ARP1_0021869_139_3  Linux Ubuntu  Error      2024-11-05T05:15:35  2024-11-05T06:20:50    0.00/0.00
 <9> * ARP1_0021869_139_4  Fedora Linux  Error      2024-11-05T06:30:29  2024-11-05T06:36:25    0.00/0.00
 <9>   ARP1_0021869_139_5  Linux Ubuntu  In Progr.  2024-11-05T06:45:35  2024-11-12T06:45:35    0.00/0.00

 <8>   ARP1_0016926_138_0  Linux Gentoo  In Progr.  2024-11-04T07:40:08  2024-11-14T07:40:08    0.00/0.00
 <8>   ARP1_0016926_138_1  Linux Ubuntu  Error      2024-11-04T07:40:09  2024-11-05T06:05:03    0.00/0.00
 <8>   ARP1_0016926_138_2  Fedora Linux  Error      2024-11-05T06:11:35  2024-11-05T06:28:05    0.00/0.00
 <8> * ARP1_0016926_138_3  Fedora Linux  Error      2024-11-05T06:36:25  2024-11-05T06:43:34    0.00/0.00
 <8>   ARP1_0016926_138_4  Linux Ubuntu  In Progr.  2024-11-05T06:51:38  2024-11-12T06:51:38    0.00/0.00

 <7> * ARP1_0011395_140_0  Fedora Linux  P. Valid.  2024-11-05T06:43:34  2024-11-06T01:35:17   15.44/16.82
 <7>   ARP1_0011395_140_1  Linux Ubuntu  In Progr.  2024-11-05T06:45:35  2024-11-11T06:45:35    0.00/0.00

 <6>   ARP1_0015040_139_0  AlmaLinux     S.Aborted  2024-11-04T09:46:37  2024-11-07T09:46:09    0.00/0.00
 <6>   ARP1_0015040_139_1  Linux Ubuntu  Error      2024-11-04T10:29:13  2024-11-05T01:05:50    0.00/0.00
 <6>   ARP1_0015040_139_2  Linux Ubuntu  Error      2024-11-05T01:09:56  2024-11-05T10:40:35    0.00/0.00
 <6>   ARP1_0015040_139_3  Linux Ubuntu  Error      2024-11-05T11:06:41  2024-11-05T23:56:51    0.00/0.00
 <6>   ARP1_0015040_139_4  Linux Ubuntu  Error      2024-11-06T00:27:25  2024-11-06T01:17:31    0.00/0.00
 <6> * ARP1_0015040_139_5  Fedora Linux  Error      2024-11-06T01:46:41  2024-11-06T02:13:53    0.00/0.00

 <5>   ARP1_0019545_139_0  Linuxmint     Error      2024-11-04T08:35:33  2024-11-05T01:28:16    0.00/0.00
 <5>   ARP1_0019545_139_1  Linux openSU  In Progr.  2024-11-04T08:33:18  2024-11-14T08:33:18    0.00/0.00
 <5>   ARP1_0019545_139_2  Alpine Linux  Error      2024-11-05T01:36:42  2024-11-06T01:46:59    0.00/0.00
 <5> * ARP1_0019545_139_3  Fedora Linux  Error      2024-11-06T02:13:53  2024-11-06T02:23:43    0.00/0.00
 <5>   ARP1_0019545_139_4  Linux Ubuntu  In Progr.  2024-11-06T02:50:51  2024-11-13T02:50:51    0.00/0.00

 <4>   ARP1_0026391_140_0  Linux openSU  In Progr.  2024-11-06T00:57:47  2024-11-12T00:57:47    0.00/0.00
 <4>   ARP1_0026391_140_1  Linux Ubuntu  Error      2024-11-06T00:58:12  2024-11-06T02:02:58    0.00/0.00
 <4> * ARP1_0026391_140_2  Fedora Linux  Error      2024-11-06T02:23:43  2024-11-06T03:25:33    0.00/0.00
 <4>   ARP1_0026391_140_3  Fedora Linux  In Progr.  2024-11-06T04:00:56  2024-11-13T04:00:56    0.00/0.00

 <3>   ARP1_0031312_140_0  Linux Ubuntu  Error      2024-11-06T01:07:14  2024-11-06T01:25:44    0.00/0.00
 <3>   ARP1_0031312_140_1  Linux openSU  In Progr.  2024-11-06T00:57:46  2024-11-12T00:57:46    0.00/0.00
 <3>   ARP1_0031312_140_2  Linux Ubuntu  Error      2024-11-06T01:57:09  2024-11-06T02:56:17    0.00/0.00
 <3> * ARP1_0031312_140_3  Fedora Linux  Error      2024-11-06T03:25:33  2024-11-06T03:38:21    0.00/0.00
 <3>   ARP1_0031312_140_4  Linux Debian  In Progr.  2024-11-06T04:23:55  2024-11-13T04:23:55    0.00/0.00

 <2>   ARP1_0030472_140_0  Linux Ubuntu  In Progr.  2024-11-06T03:56:01  2024-11-12T03:56:01    0.00/0.00
 <2> * ARP1_0030472_140_1  Fedora Linux  P. Valid.  2024-11-06T03:38:21  2024-11-06T20:29:49   14.18/15.47

 <1> * ARP1_0007183_141_0  Fedora Linux  P. Valid.  2024-11-06T20:29:49  2024-11-07T12:08:50   13.92/15.14
 <1>   ARP1_0007183_141_1  Linux Debian  In Progr.  2024-11-06T20:49:56  2024-11-12T20:49:56    0.00/0.00

Adri

[Nov 7, 2024 1:48:19 PM]

TPCBF
Master Cruncher
USA
Joined: Jan 2, 2011
Post Count: 2175
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

10 year badge for Help Fight Childhood Cancer

5 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

200 year badge for Mapping Cancer Markers

5 year badge for Uncovering Genome Mysteries

50 year badge for Outsmart Ebola Together

20 year badge for FightAIDS@Home - Phase 2

50 year badge for Smash Childhood Cancer

50 year badge for Microbiome Immunity Project

100 year badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

For both ARP1 and MCM1, it's not clear how many users collect more work than they have any chance of processing -- that's why I'd like to see the MCM1 default deadline cut back, in the hope that the unwitting might find out why they are having problems whilst those who deliberately maintain large caches might be encouraged to reduce the size a little :-)

This is an issue that I mentioned several times in the past, but that has always been brushed aside and I have been marked the scapegoat.

If you go though some of the threads regarding the ARP1 download issues in recent days, you will find several posts of people that clearly state that they have loaded up choke full of ARP1 WUs, even thought the FAQ clearly lists that ARP1 requires MUCH more resources than any other project, in all terms, like download size, upload size, drive space and RAM needed. But it seems too many folks just ignore this and being selfish, loading up with huge numbers, willfully removing the systems default restrictions of WUs active per host and thus only contribute to exaggerate the whole issue.
It was two years ago already established that the bottleneck here is the number of concurrent connections to the back end database servers. And yet another disk failing doesn't help either, but that's life.

As folks don't seem to be willing to restrict themselves in this situation to the default restrictions per host, the WCG team should look into introducing a way for a hard limit of concurrent WUs, that can't be modified by the "volunteer", at least until a better solution on the back end is found and implemented.

Ralf

[Nov 7, 2024 4:59:06 PM]

AgrFan
Senior Cruncher
USA
Joined: Apr 17, 2008
Post Count: 397
Status: Offline
Project Badges:

90 day badge for Discovering Dengue Drugs - Together

90 day badge for The Clean Energy Project

90 day badge for Influenza Antiviral Drug Search

1 year badge for Discovering Dengue Drugs - Together - Phase 2

10 year badge for The Clean Energy Project - Phase 2

5 year badge for Drug Search for Leishmaniasis

5 year badge for GO Fight Against Malaria

10 year badge for Uncovering Genome Mysteries

10 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

10 year badge for Smash Childhood Cancer

20 year badge for Microbiome Immunity Project


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

As folks don't seem to be willing to restrict themselves in this situation to the default restrictions per host, the WCG team should look into introducing a way for a hard limit of concurrent WUs, that can't be modified by the "volunteer", at least until a better solution on the back end is found and implemented.

Ralf

I seem to remember one WU per thread was the hard limit for the Clean Energy - Phase 2 project. CEP2 had one large file per WU and the application unzipped the files before it started to run. There were bandwidth restrictions also. Lots of technical information can be found in the CEP2 forum.

----------------------------------------

i5-10400 (Comet Lake, 6C/12T) @ 2.9 GHz
i5-7400 (Kaby Lake, 4C/4T) @ 3.0 GHz
i5-4590 (Haswell, 4C/4T) @ 3.3 GHz
i5-3330 (Ivy Bridge, 4C/4T) @ 3.0 GHz

----------------------------------------
[Edit 7 times, last edit by AgrFan at Nov 7, 2024 6:54:37 PM]

[Nov 7, 2024 6:39:30 PM]

ericinboston
Senior Cruncher
Joined: Jan 12, 2010
Post Count: 265
Status: Offline
Project Badges:

20 year badge for Help Fight Childhood Cancer

2 year badge for The Clean Energy Project - Phase 2

14 day badge for Computing for Clean Water

100 year badge for Smash Childhood Cancer


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

Can I ask for a little clarity here, please?

1)What is ARP1 and why is it affecting downloads for Mapping Cancer Makers (I assume this is what MCM1 means).

2)Although I am quite technical on many levels, I don't understand a lot of this WCG-specific technical post from savas and thus I don't have any expectations of when things will be back to normal for MCM WUs. Can someone please take it up a notch and maybe give a short, technical answer regarding this problem? For example, someone might say "Our systems run in the Cloud at SHARCNET. A few days ago the hard drive failed on a particular box which caused ______. We replaced the drive and WUs are being sent out as of 4:05PM ET Nov 6, 2024 in normal fashion. You may need to wait up to 48 hours to receive MCM WUs due to high demand." This level of detail/verbiage would be much appreciated for typical outages.

3)With all due respect, it's been 24 hours since savas posted the problem and implying the fix has been implemented (as far as I can tell). So why 24 hours later are my 10 machines not receiving WUs? I ask this again for clarity on expectations of when things will be back to normal for us volunteers. If a fix has not been implemented, can you please set our expectations of when it will be?

Thanks!

----------------------------------------

[Nov 7, 2024 7:25:37 PM]

Freewill
Advanced Cruncher
United States
Joined: Mar 28, 2006
Post Count: 50
Status: Offline
Project Badges:

45 day badge for Human Proteome Folding - Phase 2

180 day badge for Nutritious Rice for the World

14 day badge for Microbiome Immunity Project

1 year badge for Africa Rainfall Project


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

ARP jamming up itself and MCM is the same issue, as far as I can tell, that we had last time ARP was active more than a year ago. Was this anticipated? Did the IT team try to do something before the restart?

This recent update on efforts to address is greatly appreciated. I cannot however see it has improved the situation, at least for my PCs.

[Nov 7, 2024 7:57:39 PM]

imakuni
Advanced Cruncher
Joined: Jun 11, 2009
Post Count: 105
Status: Offline
Project Badges:

90 day badge for Help Fight Childhood Cancer

45 day badge for Help Cure Muscular Dystrophy - Phase 2

1 year badge for The Clean Energy Project - Phase 2

45 day badge for Computing for Clean Water

14 day badge for Drug Search for Leishmaniasis

45 day badge for GO Fight Against Malaria

1 year badge for Outsmart Ebola Together

180 day badge for Microbiome Immunity Project

1 year badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

1)What is ARP1 and why is it affecting downloads for Mapping Cancer Makers (I assume this is what MCM1 means)

Look through the list of projects and truncate the names to the first letter. Yes, MCM stands for "Mapping Cancer Makers (phase 1)", and ARP stands for "Africa Rainfall Project (Phase 1)".

2)Although I am quite technical on many levels, I don't understand a lot of this WCG-specific technical post from savas and thus I don't have any expectations of when things will be back to normal for MCM WUs.

MCM will get back on track when ARP stops being sent out.

In theory they could make it work, but in practice they have proven time and again to be incapable of doing so.

Can someone please take it up a notch and maybe give a short, technical answer regarding this problem? For example, someone might say "Our systems run in the Cloud at SHARCNET. A few days ago the hard drive failed on a particular box which caused ______. We replaced the drive and WUs are being sent out as of 4:05PM ET Nov 6, 2024 in normal fashion. You may need to wait up to 48 hours to receive MCM WUs due to high demand." This level of detail/verbiage would be much appreciated for typical outages

Here's a breakdown from what I gather.
-A drive was bad, the server is now on new hardware.
-If a computer requests work and has ARP and MCM selected, the server "randomly" sends either one. Now the odds of sending ARP are lower.
-When a computer requests a connection, the server tries to establish it for longer before giving up.
-Each computer can't transfer as many files at once. Say I could be transferring up to 10; now I can only do 5 at a time, and need to wait those to finish before I can start transferring the next one.
-The server must stock some units to send to people when they request work. In the future, they MIGHT have less of of them available for delivery at a given time; think of it like a supermarket having 10 crates of milk for sale rather than 100.
-More hardware is coming.
-Better software to handle communications is coming.
-If you fail to transfer a file, you can retry sooner (say, wait 10min rather than 1h).
-The deadline to complete any given piece of work has been extended. You now have about a week rather than a couple days.

3)With all due respect, it's been 24 hours since savas posted the problem and implying the fix has been implemented (as far as I can tell). So why 24 hours later are my 10 machines not receiving WUs? I ask this again for clarity on expectations of when things will be back to normal for us volunteers. If a fix has not been implemented, can you please set our expectations of when it will be?

Yeah, see, here's the thing: they definitely implemented changes, and we can clearly see that... but the effect is that people that don't micro manage their tasks still can't get any work at a reasonable pace (so no change there), whereas people that do are now stuck in the same boat as people that don't (which is to say, they can't get any work either).

TLDR, they're gaslighting everyone, including themselves.

----------------------------------------

Want to have an image of yourself like this on? Check this thread: https://secure.worldcommunitygrid.org/forums/wcg/viewthread_thread,29840

[Nov 7, 2024 8:22:55 PM]

Link64
Senior Cruncher
Joined: Feb 19, 2021
Post Count: 206
Status: Offline
Project Badges:

14 day badge for OpenPandemics - COVID-19


Re: Regarding ARP1 and MCM1 download issues since ARP1's launch on Monday Nov 4th, 2024

We have decreased the app weight of ARP1 relative to MCM1 in the feeder

You need to decrease it even further, so far not the slightest improvement is noticeable at our ends. Eventally stop the feeder for a while until most downloads complete, than start it again with lower weight of ARP.

----------------------------------------

[Nov 8, 2024 11:34:46 AM]

[ ]