Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 6
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1386 times and has 5 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Early terminated Work Unit

After having a couple of normal run of CEP work units and granted the full credit, I got one of the WU that was early terminated.

Result Name Status Sent Time Time Due/Return Time CPU Time (hours) Claimed/ Granted BOINC Credit
E000044_ 418A_ 00055e00u_ 0-- Valid 12/19/08 14:00:39 12/21/08 00:56:35 7.34 49.0 / 49.0 --> mine
E000044_ 418A_ 00055e00u_ 1-- Valid 12/19/08 13:58:17 12/21/08 14:38:28 37.63 491.1 / 49.0 --> other's

The log file looks as follows.
<core_client_version>6.2.19</core_client_version>
<![CDATA[
<stderr_txt>
Calling initGraphics()
INFO: No state to restore. Start from the beginning.
[ERROR] Failed to open either source or destination files while copying wcgrestart.rst to ../../projects/www.worldcommunitygrid.org/E000044_418A_00055e00u_0_3. Error: 2
####Message = NORMAL STOP
called boinc_finish

</stderr_txt>
]]>


While I feel guilty to cause the other WU given very low credit for his/her 37+ hrs, I would like to share the info that may lead to solving the current bug.

For the observation above, we can conclude that:
1. a bug that truncate a file name
2. the mechanism to detect early termination failed to error this WU
3. the mirror WU is completed without error, which raised an interesting question on why the file name truncation did not happen in both copies of WUs.

Please let me know if further information is required.

Regards, Irwan
[Dec 22, 2008 3:03:29 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Early terminated Work Unit

Hi Irwan!
If you really want to feel guilty it would be only for having not noticed this thread (among a few others) Inconclusive work units where many similar cases have been reported. smile

Your conclusions are a good summary of what we can see. I hope that the techs will soon find a solution to this situation.
Meanwhile it seems that they have suspended distribution of this project which means that they have enough information and not enough solutions yet.

Thank you for your reporting. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Dec 22, 2008 5:05:33 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Early terminated Work Unit

Hi Jean,
I should have checked for similar cases bofore hand. sad
Anyway, it's good to know that the issues are in the techs' capable hands. smile

Cheers,
Irwan
[Dec 22, 2008 9:27:41 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Early terminated Work Unit

Meanwhile it seems that they have suspended distribution of this project which means that they have enough information and not enough solutions yet.


If work on this project has been suspended, shouldn't there be an announcement. This would be good news.

However, it seems I'm still get new work today, as my results log shows:

E000033_ 412A_ 00043y00s_ 2-- Garrideb In Progress 12/22/08 14:02:00 12/23/08 23:38:00 0.00 0.0 / 0.0
E000026_ 574A_ 000359010_ 3-- Garrideb In Progress 12/22/08 12:03:39 12/24/08 02:56:31 0.00 0.0 / 0.0
E000044_ 585A_ 00055i00t_ 2-- Garrideb In Progress 12/22/08 09:51:10 12/24/08 19:27:10 0.00 0.0 / 0.0
[Dec 22, 2008 2:45:50 PM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3716
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Early terminated Work Unit

steveleg, what you have got is "repair WUs", i.e. new distributions of WUs already distributed. Distribution of really new work was stopped even before the current problem with the file system.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Dec 23, 2008 5:09:49 AM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Early terminated Work Unit

Jean,

You are correct that members are seeing repair work units. However we have not stopped sending out new work units. We have put this project on a slow crawl though. I tried to explain this the best I could in a previous post. But basically spots in the queue are shared with repair jobs and new work units for CEP. The project as a whole is running at lowest priority on World Community Grid until we are able to fix the work units and charmm with the researchers help. They have some of the initial results right now.

-Uplinger
[Dec 23, 2008 3:26:04 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread