Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 11
Posts: 11   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3316 times and has 10 replies Next Thread
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Result lost on last outage

I had a result uploaded just before the outage.

The server was out when reporting the completed task but 4 hours later it succeeded in reporting the task.

However, it still shows up as "In Progress" in my Results Status report.

Could someone please sort this out?

I attach a copy of my events log for the relevant period.

Mike

06/03/2012 19:11:50 | World Community Grid | Computation for task SN2S_AAB68721_0000033_0073_0 finished
06/03/2012 19:11:50 | World Community Grid | Starting task E206480_811_C.25.C21H12N2OS.02048312.4.set1d06_0 using cep2 version 640
06/03/2012 19:12:19 | World Community Grid | Started upload of SN2S_AAB68721_0000033_0073_0_0
06/03/2012 19:12:27 | World Community Grid | Finished upload of SN2S_AAB68721_0000033_0073_0_0
06/03/2012 19:12:29 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:12:29 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:12:47 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 19:14:38 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:14:38 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:14:42 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 19:16:48 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:16:48 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:16:51 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 19:24:42 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:24:42 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:24:46 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 19:34:36 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:34:36 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:34:40 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 19:52:16 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 19:52:16 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 19:52:20 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 20:37:13 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 20:37:13 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 20:37:17 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 21:52:27 | World Community Grid | Sending scheduler request: To report completed tasks.
06/03/2012 21:52:27 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 21:52:32 | World Community Grid | Scheduler request failed: HTTP internal server error
06/03/2012 23:10:08 | World Community Grid | update requested by user
06/03/2012 23:10:12 | World Community Grid | Sending scheduler request: Requested by user.
06/03/2012 23:10:12 | World Community Grid | Reporting 1 completed tasks, not requesting new tasks
06/03/2012 23:10:22 | World Community Grid | Scheduler request completed
[Mar 8, 2012 8:26:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Mike,

I have added a post to the end of this thread: https://secure.worldcommunitygrid.org/forums/...32678_lastpage,yes#367931

that discusses the issue.
[Mar 8, 2012 9:49:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Thanks, knreed.

I had assumed that it was a Schistosoma problem rather than the wider WCG.

I just needed that result for my Bronze!

Mike
[Mar 8, 2012 9:57:13 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

One further thought.

As the task had successfully uploaded, is that not sufficient to avoid re-sending the task? If it is then it would be a waste of crunching time to do it again.

Mike
[Mar 8, 2012 10:03:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Hi Mike.Gibson,
The explanation knreed posted says that the result was lost on the server. So we will have to recrunch it.

Lawrence
[Mar 8, 2012 10:30:16 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12594
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Hi, Lawrence.

Yes, I read that, but I was unsure what it actually meant.

There are 2 stages so far as I can see at the end of the task. The first is the Uploading and the second is the Reporting.

Does it mean that the Uploading has been lost or the Reporting or have both gone?

I have never seen an explanation as to what happens when. I was just wondering if the Uploading contained the information needed and was still there. If so, it should be retrievable. Likewise anyone else's "orphan" Uploads from that period.

Mike
[Mar 8, 2012 11:01:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Hi Mike.Gibson,
I am a bit uncertain. A while back, knreed said that he had found 2 missing results when he first started investigating this. Most recently, he said that he would resend the work units with missing results. To me, that sounds as though there is no standardized automated way to retrieve them from where they were misfiled, but I don't know.

But I suspect that knreed is taking the safest course of action by resending them. There are enough other problems that have to be fixed quickly.

Lawrence
[Mar 9, 2012 12:23:17 AM]   Link   Report threatening or abusive post: please login first  Go to top 
knreed
Former World Community Grid Tech
Joined: Nov 8, 2004
Post Count: 4504
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

They have to be resent. It is possible that we could have written a script to navigate through all of the uploaded files on the server and matched uploaded files with results that were still in the database as 'in progress'. However, there would be no way to differentiate between results that were marked in progress due to this issue vs results that had simply been uploaded and simply not yet reported. Additionally, by the time the script as ready, tested and running, many results would have already been re-issued. As a result the choice was between possibly creating a bigger issue vs letting the system run its course and fix itself. Given that we had fixed the original issue and there were other issues outstanding, we choose the 2nd option.

We are not happy that this issue happened. We take great pride in running this system efficiently and correctly. Most of the time we are successful with this. Unfortunately, this time we did not meet yours or our own expectations. For that we are very sorry.
[Mar 9, 2012 1:54:33 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Kevin, don't beat yourself to death......you and the other Techies are doing a great job......think of the billions of SUCCESSFUL WU's, do not dwell on the odd missing ones..... biggrin
[Mar 9, 2012 2:09:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
nanoprobe
Master Cruncher
Classified
Joined: Aug 29, 2008
Post Count: 2998
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Result lost on last outage

Kevin, don't beat yourself to death......you and the other Techies are doing a great job......think of the billions of SUCCESSFUL WU's, do not dwell on the odd missing ones..... biggrin

^^^^^What he said^^^^^ applause
----------------------------------------
In 1969 I took an oath to defend and protect the U S Constitution against all enemies, both foreign and Domestic. There was no expiration date.


[Mar 9, 2012 3:55:06 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 11   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread