Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 13
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 2135 times and has 12 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
more errors

The two last CEP units I dowloaded just don't advance. After 40 minutes computins, they stay à 0% of work advance and remaining time stays the same that at the beginning.

I aborted them and try other ones...
[Apr 19, 2009 11:25:45 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors

Hello Legrandpiou

I had the same thing see my post - "Is this Normal"

My CEP went 1 hour before showing 0.50 % progress.

Didactylos told me to give it a day before starting to worry about it.

It eventually picked up steam and finished in about 18 hours well short of the day Didactylos suggested I wait for.
----------------------------------------
[Edit 2 times, last edit by Former Member at Apr 20, 2009 1:50:00 AM]
[Apr 20, 2009 1:49:04 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: more errors

All CEP WUs start with a rather long phase during which the percentage done shows 0 %.
This is normal and you just need to let the computation go.
It is not an error, only an inconvenience in some way, and you should not worry.
What is more important to know if your computer is rebooted frequently is that the first checkpoint for CEP WUs is taken only after this initialization phase (precisely at 1.6 % for WUs distributed currently) therefore try to not stop your machine before this step has been reached.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
----------------------------------------
[Edit 1 times, last edit by JmBoullier at Apr 20, 2009 6:28:30 AM]
[Apr 20, 2009 6:27:15 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors

Sorry, I reacted too fast (!).

After having read the post of djeto, I downloaded another unit of CPE. After 2 hours and 50 minutes, it was at 0,11% of advancement... but it had moved...

I read regularly the forums and I saw no post about this very peculiar point. But now, I'm up to date.

Thanks all for your help.
[Apr 20, 2009 9:04:19 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: more errors

Sorry, I reacted too fast (!).

After having read the post of djeto, I downloaded another unit of CPE. After 2 hours and 50 minutes, it was at 0,11% of advancement... but it had moved...

You have excuses, this one was very long! smile
Either this machine is very slow, or you have got a very long WU.
If it can help you for this particular one, checkpoints are currently taken at each multiple of 1.6 % (1.6, 3.2, 4.8, etc...) for CEP WUs.

Cheers. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 20, 2009 9:47:27 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors

Yeah, lots and lots of CEP computational errors. Didn't want to, but aborted the rest of the work units I had. Anyone have reason for this?
[Apr 20, 2009 11:02:37 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors

It seems to work but...

I had to restart my computer. The CEP unit was at 16,6%.

It restarted at 16%, as I expected and five minutes later it was ... restarting at 0%

What a loss of time !
[Apr 20, 2009 11:32:30 AM]   Link   Report threatening or abusive post: please login first  Go to top 
JmBoullier
Former Community Advisor
Normandy - France
Joined: Jan 26, 2007
Post Count: 3715
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: more errors

I had to restart my computer. The CEP unit was at 16,6%.
It restarted at 16%, as I expected and five minutes later it was ... restarting at 0%

The restart at 16 % is normal, the next one at 0 % looks very strange and unusual.
I hope you have let it continue (if not, I can understand smile ) to let us know if this stage at 0 % has been definitive or if it came back at 16 % after a while.
I am thinking of different possible scenarios, namely:
1. Something has been going seriously wrong and this WU has restarted from the very beginning. In that case you will see another initialization phase of about 2 hours and a half.
2. Something in the initialization phase has to be redone (and while it's done you see 0 % again) but next the computation resumes at 16 % as expected.
3. Something has gone wrong but the long initialization is not lost. In that case after coming back to 0 % you see the percentage increasing immediately without waiting 2:30 hours.

In my opinion all three cases are errors, but I would like to know more about it and with a slower machine it is easier to watch what happens.
I have already had CEP WUs restarting after stopping Boinc and I have never see what you say. But if what happens is case #2 (I hope so) it is possible that on a faster machine crunching in the background it goes unnoticed.

Thanks in advance for any complementary information that you can provide. Jean.
----------------------------------------
Team--> Decrypthon -->Statistics/Join -->Thread
[Apr 20, 2009 11:56:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors


I hope you have let it continue


Yes I did, just to see whqnt was going to happen...



1. Something has been going seriously wrong and this WU has restarted from the very beginning. In that case you will see another initialization phase of about 2 hours and a half.


That's what seems to happen. After 40 linutes It shows an advance of 0;9%...



Thanks in advance for any complementary information that you can provide. Jean.


That's it. I don't know what to think about all this...

Thanks for help
[Apr 20, 2009 12:39:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: more errors

I found these messages :

20/04/2009 13:23:51|World Community Grid|Computation for task E000510_200A_003d6w006_0 finished
20/04/2009 13:23:51|World Community Grid|Output file E000510_200A_003d6w006_0_2 for task E000510_200A_003d6w006_0 absent
20/04/2009 13:23:51|World Community Grid|Output file E000510_200A_003d6w006_0_3 for task E000510_200A_003d6w006_0 absent

So, I think maybe the unit diappeared for some reason at restart and the one which indicates 0% at restart is another one...
[Apr 20, 2009 1:49:30 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 13   Pages: 2   [ 1 2 | Next Page ]
[ Jump to Last Post ]
Post new Thread