| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 15
|
|
| Author |
|
|
bieberj
Senior Cruncher United States Joined: Dec 2, 2004 Post Count: 406 Status: Offline Project Badges:
|
Here is a snapshot:
4/1/2008 9:21:35 AM||Resuming computation 4/1/2008 11:13:41 AM|World Community Grid|Sending scheduler request: To fetch work. Requesting 12 seconds of work, reporting 1 completed tasks 4/1/2008 11:13:46 AM|World Community Grid|Scheduler request succeeded: got 1 new tasks 4/1/2008 11:13:48 AM|World Community Grid|Started download of lq460-469_lq463.fasta.gz 4/1/2008 11:13:48 AM|World Community Grid|Started download of lq460-469_lq463.psipred.gz 4/1/2008 11:13:49 AM|World Community Grid|Finished download of lq460-469_lq463.fasta.gz 4/1/2008 11:13:49 AM|World Community Grid|Finished download of lq460-469_lq463.psipred.gz 4/1/2008 11:13:49 AM|World Community Grid|Started download of lq460-469_lq463.psipred_ss2.gz 4/1/2008 11:13:49 AM|World Community Grid|Started download of lq460-469_aalq46303_05.075_v1_3.gz 4/1/2008 11:13:50 AM|World Community Grid|Finished download of lq460-469_lq463.psipred_ss2.gz 4/1/2008 11:13:50 AM|World Community Grid|Started download of lq460-469_aalq46309_05.075_v1_3.gz 4/1/2008 11:13:51 AM|World Community Grid|Finished download of lq460-469_aalq46303_05.075_v1_3.gz 4/1/2008 11:13:53 AM|World Community Grid|Finished download of lq460-469_aalq46309_05.075_v1_3.gz 4/1/2008 12:15:03 PM|World Community Grid|Computation for task X0000044090607200412311457_0 finished 4/1/2008 12:15:03 PM|World Community Grid|Starting faah3390_ZINC03953869_xMut_md01880_01_0 4/1/2008 12:15:04 PM|World Community Grid|Starting task faah3390_ZINC03953869_xMut_md01880_01_0 using faah version 542 4/1/2008 12:15:06 PM|World Community Grid|Started upload of X0000044090607200412311457_0_0 4/1/2008 12:15:10 PM|World Community Grid|Giving up on upload of X0000044090607200412311457_0_0: file not found Is this something I should be concerned about? This job is currently pending validation waiting for its partner to finish and upload. JB |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
Applying reverse logic, if it is Pending Validation (?) for your result on the Result Status page, the file must have already uploaded. Because it was, BOINC eventually gives up trying again if there was a glitch to tell the client that in fact the server received the file properly.
----------------------------------------So is yours in PV status? If so, no concerns! ttyl
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Given the times, and the fact that the task has neither uploaded nor reported - I think that the other copy completed and is waiting for this one.
bieberj, your copy is highlighted in orange. The upload may try again - check the transfer tab. |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
I just got the same error on a result I just uploaded:
01/04/2008 17:35:58|World Community Grid|Computation for task faah3384_ZINC01744753_xMut_md01820_01_0 finished 01/04/2008 17:35:59|World Community Grid|Starting X0000044171106200412130852_0 01/04/2008 17:36:00|World Community Grid|Starting task X0000044171106200412130852_0 using hcc1 version 520 01/04/2008 17:36:01|World Community Grid|Started upload of faah3384_ZINC01744753_xMut_md01820_01_0_0 01/04/2008 17:36:01|World Community Grid|Started upload of faah3384_ZINC01744753_xMut_md01820_01_0_1 01/04/2008 17:36:04|World Community Grid|Giving up on upload of faah3384_ZINC01744753_xMut_md01820_01_0_1: file not found 01/04/2008 17:36:05|World Community Grid|Finished upload of faah3384_ZINC01744753_xMut_md01820_01_0_0 01/04/2008 19:12:30|World Community Grid|Sending scheduler request: Requested by user. Requesting 0 seconds of work, reporting 1 completed tasks Looking at the Results status, mine is in an error state and another work unit is awaiting to be sent out. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
I just checked my logs but all uploaded proper for DDDT & HCC jobs, after the servers came back from the maintenance cycle. Not had any FAAH completing, so cant tell if things work from down here.
----------------------------------------As for Didactylos observation of the time span, 2 seconds is indeed very short. My crawler takes 12 seconds to start an upload and receive confirmation for a HCC job.
WCG
Please help to make the Forums an enjoyable experience for All! |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
BOINC should have been more patient then it was. I'm looking at why the client marked the upload as a permanent failure so quickly.
----------------------------------------This is 100% related to the maintenance we just performed. However, with an outage of only a little over an hour, BOINC should have patiently kept retrying to upload the result, waiting a little bit longer each time before retrying. There are 841 results that had this valid upload error due to this maintenance. We are still seeing a slow trickle of clients reporting the upload failure. [Edit 1 times, last edit by knreed at Apr 1, 2008 7:27:47 PM] |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
It appears that in the future BOINC 6 client, the software will wait as I expected it would. Unfortunately, for the BOINC 5.10.45 clients and earlier, the file uploads were immediately marked as an error.
I apologize for the problems this has caused. |
||
|
|
Sekerob
Ace Cruncher Joined: Jul 24, 2005 Post Count: 20043 Status: Offline |
This is not boding well or not understood entirely by me. It implies the users should take their client off-line for any maintenance period to avoid loosing any results.... but how if the outage is involuntary? It is though in all the past, the clients would just keep whirring along or in more distant past had to be manually helped by e.g. hitting the Retry Now button.... so is this something that is specific to a particular client / server version or is it now suddenly all versions below 6 ?
----------------------------------------[Edit: Seems the culprit file_upload_handler was taking off-line for 1st time in years causing this problem to arise. See knreed post below]
WCG
----------------------------------------Please help to make the Forums an enjoyable experience for All! [Edit 2 times, last edit by Sekerob at Apr 2, 2008 7:07:47 AM] |
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Just checked and seems I have fallen foul of this issue too on several of my machines that tried to upload around the same time:
![]() All the clients show the same error in the messages: 01/04/2008 17:38:45|World Community Grid|Started upload of X0000044080914200412101233_1_0 01/04/2008 17:38:51|World Community Grid|Giving up on upload of X0000044080914200412101233_1_0: file not found WCG Boinc 5.10.30 |
||
|
|
knreed
Former World Community Grid Tech Joined: Nov 8, 2004 Post Count: 4504 Status: Offline Project Badges:
|
We haven't had to take the file_upload_handler offline in a couple of year (it has no database access - it just writes the file to the filesystem). It turns out that I was unaware of the proper way to take the file_upload_handler offline. I should have done what is described on this page: http://boinc.berkeley.edu/trac/wiki/StartTool
----------------------------------------So in the future we will use this technique which will allow the agents to behave as I had originally expected them to. [Edit 3 times, last edit by knreed at Apr 2, 2008 12:50:06 AM] |
||
|
|
|