Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 781
Posts: 781   Pages: 79   [ Previous Page | 14 15 16 17 18 19 20 21 22 23 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 943777 times and has 780 replies Next Thread
KoopaTroopa
Cruncher
Joined: May 22, 2016
Post Count: 8
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

I have this in my app_config.xml for a 3090 and an Intel i9-10850 and the GPU is still running just a little light. BOINC is writing an average of about 80 MB/s to my drive.

<app_config>
<app>
<name>opng</name>
<gpu_versions>
<gpu_usage>0.1</gpu_usage>
<cpu_usage>1</cpu_usage>
</gpu_versions>
</app>
<app_version>
<app_name>opng</app_name>
<plan_class>opencl_intel_gpu_102</plan_class>
<ngpus>1</ngpus>
</app_version>
<report_results_immediately/>
</app_config>
[Apr 27, 2021 3:58:34 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Blount
Veteran Cruncher
Joined: Aug 19, 2005
Post Count: 590
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

This is a new one for me:
4/26/2021 9:03:41 PM | World Community Grid | Backing off 00:06:12 on download of mip1.MIP1_00332548.1
4/26/2021 9:03:47 PM | World Community Grid | Started download of mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | Finished download of mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | [error] MD5 check failed for mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | [error] expected 3f39e03f16125b74b01a887ce51accd9, got 471c4a5d903c973f712016d4b56061b8
4/26/2021 9:03:55 PM | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00332548.1
4/26/2021 9:05:01 PM | World Community Grid | Computation for task MIP1_00332719_10452_0 finished
[Apr 27, 2021 4:06:56 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Grumpy Swede
Master Cruncher
Svíþjóð
Joined: Apr 10, 2020
Post Count: 2498
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

This is a new one for me:
4/26/2021 9:03:41 PM | World Community Grid | Backing off 00:06:12 on download of mip1.MIP1_00332548.1
4/26/2021 9:03:47 PM | World Community Grid | Started download of mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | Finished download of mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | [error] MD5 check failed for mip1.MIP1_00332548.1
4/26/2021 9:03:55 PM | World Community Grid | [error] expected 3f39e03f16125b74b01a887ce51accd9, got 471c4a5d903c973f712016d4b56061b8
4/26/2021 9:03:55 PM | World Community Grid | [error] Checksum or signature error for mip1.MIP1_00332548.1
4/26/2021 9:05:01 PM | World Community Grid | Computation for task MIP1_00332719_10452_0 finished

Please post that in the MIP sub forum. https://www.worldcommunitygrid.org/forums/wcg/listthreads?forum=760
This thread is about OpenPandemics - GPU Stress Test.
[Apr 27, 2021 4:11:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
spRocket
Senior Cruncher
Joined: Mar 25, 2020
Post Count: 280
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

Looks like there may be a little bit of trouble still. One of my Raspberry Pis (a Pi 400 that's been crunching for months) is having trouble downloading an OPN1 work unit (the only kind I've ever seen on a Pi). All attempts to force a transfer fail, yet uploads from my main cruncher are working fine. I guess I'll see tomorrow if that one ever makes it.
[Apr 27, 2021 4:17:36 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Andrew80431
Cruncher
Joined: Nov 25, 2005
Post Count: 36
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

I got WUs from the start at around 20:09 UTC.
At 22:38 UTC all WUs, I had gotten so far, had finished and no new ones got downloaded.
At 22:41 UTC I see this in the log(time in the log is CET):
4/27/2021 12:41:47 AM | World Community Grid | Sending scheduler request: Requested by user.
4/27/2021 12:41:47 AM | World Community Grid | Not requesting tasks: some download is stalled
4/27/2021 12:41:51 AM | World Community Grid | Scheduler request failed: Couldn't connect to server
4/27/2021 12:41:52 AM | | Project communication failed: attempting access to reference site
4/27/2021 12:41:54 AM | | Internet access OK - project servers may be temporarily down.
4/27/2021 12:46:46 AM | World Community Grid | update requested by user
4/27/2021 12:46:48 AM | World Community Grid | Sending scheduler request: Requested by user.
4/27/2021 12:46:48 AM | World Community Grid | Not requesting tasks: some download is stalled
4/27/2021 12:46:55 AM | World Community Grid | Scheduler request completed

I got some WUs at around 1:42 UTC and now the machine has been crunching for the last couple of minutes ever since 3:44 UTC.
But I still see:
4/27/2021 6:23:54 AM | World Community Grid | Not requesting tasks: some download is stalled
in the logs.
Clicking "Retry Now" on the Transfer tab seems to have fixed the stalled download. Let's see how it will run for over day.
----------------------------------------

[Apr 27, 2021 4:31:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Andrew80431
Cruncher
Joined: Nov 25, 2005
Post Count: 36
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

I had stalled downloads on my Linux machine as well.
Running:
boinccmd --get_file_transfers
showed this:
======== File transfers ========
1) -----------
name: 285f7cff99d04c96af1c6833cb6720e9.pdbqt
direction: download
sticky: no
xfer active: no
time_so_far: 7.059975
bytes_xferred: 107.000000
xfer_speed: 0.000000
So it was trying to download files, but none of the downloads was active.
You can check whether or not there are any active downloads by running this:

Edited:
If the output of this is greater than "0":
cd /var/lib/boinc
boinccmd --get_file_transfers|grep -B2 "xfer active: no"|grep download|wc -l
and the output of this one is "0":
boinccmd --get_file_transfers|grep -B2 "xfer active: yes"|grep download|wc -l
then it is quite possible that your downloads have stalled. (That's most certainly the case when you ran out of OPNG WUs at this stage of the test.)

Running this, will get your downloads to continue:
cd /var/lib/boinc
for dl in $(boinccmd --get_file_transfers|awk '/name:/{print $2}'); do boinccmd --file_transfer "http://www.worldcommunitygrid.org/" $dl retry; done

----------------------------------------

----------------------------------------
[Edit 5 times, last edit by Andrew80431 at Apr 27, 2021 11:22:56 AM]
[Apr 27, 2021 4:47:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
mdxi
Advanced Cruncher
Joined: Dec 6, 2017
Post Count: 109
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

I had two machines with stuck WUs (my fault; a bad library got reinstalled). Cleared that up and now all machines with GPUs have crunched at least one OPNG WU in the past 3 hours, so I'm onboard.
----------------------------------------

[Apr 27, 2021 5:57:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
SD Surfer
Advanced Cruncher
Joined: Nov 22, 2005
Post Count: 56
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

4/26/2021 11:02:29 PM | World Community Grid | Temporarily failed upload of OPNG_0007328_00025_0_r1114215537_1: transient HTTP error
4/26/2021 11:02:29 PM | World Community Grid | Backing off 00:04:56 on upload of OPNG_0007328_00025_0_r1114215537_1

And then, they eventually upload. S l o w l y coffee
----------------------------------------
1 x AMD Ryzen 3950x 16c/32t
Various Androids
[Apr 27, 2021 6:06:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Andyman
Cruncher
Joined: Apr 9, 2021
Post Count: 17
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

4/26/2021 11:02:29 PM | World Community Grid | Temporarily failed upload of OPNG_0007328_00025_0_r1114215537_1: transient HTTP error
4/26/2021 11:02:29 PM | World Community Grid | Backing off 00:04:56 on upload of OPNG_0007328_00025_0_r1114215537_1

And then, they eventually upload. S l o w l y coffee

Yeah system seems pretty strained atm, uploading slow and stalling. Even the site is slow. But i guess thats the purpose of the test to see how far it can push.
[Apr 27, 2021 6:35:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
adriverhoef
Master Cruncher
The Netherlands
Joined: Apr 3, 2009
Post Count: 2346
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: OpenPandemics - GPU Stress Test

I had stalled downloads on my Linux machine as well.
Running:
boinccmd --get_file_transfers
showed this:
======== File transfers ========
1) -----------
name: 285f7cff99d04c96af1c6833cb6720e9.pdbqt
direction: download
sticky: no
xfer active: no
time_so_far: 7.059975
bytes_xferred: 107.000000
xfer_speed: 0.000000
So it was trying to download files, but none of the downloads was active.
You can check whether or not there are any active downloads by running this:
cd /var/lib/boinc
boinccmd --get_file_transfers|grep "xfer active: yes"|wc -l
If the output is "0" then it is quite possible that your downloads have stalled. (That's most certainly the case when you ran out of OPNG WUs at this stage of the test.)

Running this, will get your downloads to continue:
cd /var/lib/boinc
for dl in $(boinccmd --get_file_transfers|awk '/name:/{print $2}'); do boinccmd --file_transfer "http://www.worldcommunitygrid.org/" $dl retry; done

Sounds about right, although perhaps you don't want to retry an already active filetransfer.
One could try using wcgresults -x, then rinse and repeat.
----------------------------------------
[Edit 1 times, last edit by adriverhoef at Apr 27, 2021 8:09:40 AM]
[Apr 27, 2021 7:02:03 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 781   Pages: 79   [ Previous Page | 14 15 16 17 18 19 20 21 22 23 | Next Page ]
[ Jump to Last Post ]
Post new Thread