Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go ยป
No member browsing this thread
Thread Status: Active
Total posts in this thread: 44
Posts: 44   Pages: 5   [ 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 3004 times and has 43 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Hello


As i told you on Facebook, i bought a Revodrive to play with CEP2 project on my SR2 24cores.

I'm currently facing some issues. I got a BSOD (while i never got one my system is not O/C and under watercooling)

So i reduced the numbers of core used to 12 instead of 24 the moment.

Can you help please ? i would like to share my logs with you but i got forum issue (too big logs from boinc???)
Then i'll let the queue to finish and i'll raise one by one the number of used core until i found the best result.

My second problem is how to find my Real CPU time and elapsed time on WU already uploaded to your server ?
----------------------------------------
[Edit 1 times, last edit by Former Member at Nov 25, 2011 3:57:48 PM]
[Nov 25, 2011 3:56:47 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Hi,

You can put your log file into for instance dropbox or some other hoster and post the link here.

Yes, if you run 24 simultaneous I'd expect things to start *crunchy*. It's with CEP2 good practice to slow build the number and let the other cores do the lighter stuff such as HCC or HCMD2.

There's no way to see the Elapsed/CPU time differential after upload. Many use the BOINCTasks tool that can simultaneous track hundreds of clients on a LAN and runs on Windows natively or in a VM/Wine on Mac and Linux. That logs all results and gives both times.

--//--
[Nov 25, 2011 4:16:59 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Great thanks for your tools. It is exactly what i was looking for.

Before running on Revordrive 3, i was doing 24 cores running on a 10K RPM velociraptor HD without any problem 6 months ago. I 've just restarted today. i espected to see just great improvement when using Revodrive because of better performance on writings compared to velociraptor.

i'll try to host my file .

Edit: Here it is. expecting it is ok for you

http://www.megaupload.com/?d=VPIHCL41
----------------------------------------
[Edit 1 times, last edit by Former Member at Nov 26, 2011 8:30:09 AM]
[Nov 25, 2011 8:03:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

i don't know if my link works but here is an example



warmachine-SR2

68 World Community Grid 26/11/2011 10:55:18 Sending scheduler request: To report completed tasks.
69 World Community Grid 26/11/2011 10:55:18 Reporting 1 completed tasks, not requesting new tasks
70 World Community Grid 26/11/2011 10:55:21 Scheduler request completed
71 26/11/2011 10:55:59 System clock was turned backwards; clearing timeouts
72 World Community Grid 26/11/2011 10:56:00 Task E204101_394_C.29.C26H18N2Si.00640665.2.set1d06_0 exited with zero status but no 'finished' file
73 World Community Grid 26/11/2011 10:56:00 If this happens repeatedly you may need to reset the project.
74 World Community Grid 26/11/2011 10:56:00 Task E204101_246_C.30.C22H10N2O2S2SeSi.00598433.3.set1d06_0 exited with zero status but no 'finished' file
75 World Community Grid 26/11/2011 10:56:00 If this happens repeatedly you may need to reset the project.
76 World Community Grid 26/11/2011 10:56:00 Task E204101_130_C.29.C28H18S.00604686.2.set1d06_0 exited with zero status but no 'finished' file
77 World Community Grid 26/11/2011 10:56:00 If this happens repeatedly you may need to reset the project.
78 World Community Grid 26/11/2011 10:56:00 Task E204101_074_C.29.C28H18S.00587100.2.set1d06_0 exited with zero status but no 'finished' file
79 World Community Grid 26/11/2011 10:56:00 If this happens repeatedly you may need to reset the project.
80 World Community Grid 26/11/2011 10:56:00 Task E204101_069_C.30.C22H10N2O2S2SeSi.00431349.0.set1d06_0 exited with zero status but no 'finished' file
81 World Community Grid 26/11/2011 10:56:00 If this happens repeatedly you may need to reset the project.



and i found the following

http://setiathome.berkeley.edu/forum_thread.php?id=65060

i have Origin running. I shut it down and i started again all cores... let's monitor.


Edit: It did not help. i still facethe issue i have reduced the number of cores to 16
----------------------------------------
[Edit 1 times, last edit by Former Member at Nov 26, 2011 12:31:52 PM]
[Nov 26, 2011 11:29:40 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Please post a copy of one Result log that has a Error status: My Grid > Result Status.

SysClock reversal is one thing, BOINC sciences don't like backward adjustments of longer than 30 seconds, but that has to happen many times before a task is told to take a hike. Presently thinking of heartbeat failure... system too busy so BOINC does not get the chance to check pulse of all the concurrent tasks running. Possible root cause... storage I/O bottleneck.

--//--
[Nov 26, 2011 3:00:35 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

There's no way to see the Elapsed/CPU time differential after upload.

You can use job_log_project.url.txt for this...
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
[Nov 26, 2011 3:07:17 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Was waiting on that one... hook, line ... and good luck in digging through and converting the seconds to human legible run times.

--//--
[Nov 26, 2011 3:30:53 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Ingleside
Veteran Cruncher
Norway
Joined: Nov 19, 2005
Post Count: 974
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Was waiting on that one... hook, line ... and good luck in digging through and converting the seconds to human legible run times.

This is easily done with example Excel, the biggest problem is that many of WCG's sub-projects hasn't human-readable task-names and I've got better things to do than remembering that the various cryptic letters V*, os*, q?*, M* and C* stands for.

For the sub-projects that does have understandable names, or I'm remembering X is HCC and E is CEP2, it was easy for me to calculate info like this, for data on the computer mostly crunching 24/7, used only data since 01.08.2011:

HCC: min efficiency, 94.85%, max efficiency, 100.09%, average 99.89%, std.dev. 0.315, 28 of 2228 tasks has more than 100% efficiency; min: 1.22 hours cpu-time, max: 1.58 hours, average: 1.39 hours, st.dev 0.089 hours.

GFAM: 98.99%, 99.90%, 99.69% average, 0.189 st.dev, 49 tasks; min 2.52 h, 8.64 h, 5.44h average, 1.15 h.
C4CW: 97.36%, 99.83%, 99.55% average, 0.292, 753 tasks; 3.19 h, 3.27 h, 3.22 h average, 0.012 h.
CMD2: 96.74%, 100.06%, 99.75% average, 0.506, 7 of 150 over 100%; 0.146 h, 11.50 h, 4.24 h average, 2.57 h.
DSFL: 99.59%, 100.05%, 99.75% average, 0.0942, 1 of 24 over 100%; 3.70 h, 5,07 h, 4.38 h average, 0.331 h.
CEP2: 95.62%, 98.67%, 97.39% average, 0.689, 170 tasks; 2.85 h, 12.00 h, 6.56 h average, 2.016 h; 1 task hit 12-hour-cutoff.
FAAH: 97.95%, 100.03%, 99.78% average, 0.234, 1 of 549 tasks over 100%; 3.60h, 9.73h, 5.93 h average, 0.904 h.

All tasks: 94.85%, 101.54%, 99.72% average, 0.569 st.dev, 50 of 4731 tasks over 100%. Min: 0.146 hours cpu-time, max 12.000 hours cpu-time, average 3.25 hours, st.dev 2.24 hours.


Now, getting this info could be made easier if example BoincTask had this capability, but it's also easy to use other tools to get the relevant info. I've probably used longer time typing-out this post than getting the relevant info.

Oh, and for the i7-920, the similar results for CEP2 is:
min 79.91%, 99.14%, 92.38% average, 4.58 st.dev; 4.78 h, 12.000 h, 10.25 h average, 57 of 99 tasks hit the 12-hour cutoff-limit.

No idea how many hours on average is wasted for every time CEP2 cuts-off in the middle of a step, but a 57 % cut-off-rate is definitely too high.
----------------------------------------


"I make so many mistakes. But then just think of all the mistakes I don't make, although I might."
----------------------------------------
[Edit 2 times, last edit by Ingleside at Nov 26, 2011 5:15:46 PM]
[Nov 26, 2011 5:12:26 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

How many workunits of clean energy project 2 are you running at once?

If more then one consider reducing the number of workunits done at one time.
[Nov 26, 2011 5:24:22 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: BOINC 6.12.34 seven 64 bit REVODRIVE 3 many error on tasks help please

Did not ask you to waste your time... you chose to proof the ''No way'' was contradict-able! applause

Not really on topic, my *None* HT Q6600 filtering with the WCGDAWS tool of pirogue has had exactly 1 hitting the 12 hour mark in the last 60 days, doing about 6 per day [50% of cores]. From a tech note some week or so ago, there's going to something that will allow these to run up to 24 hours, opt-in was me interpretation and from the statistics, the average run time has actually dropped since the implementation of ZR from 8 hours to 6 hours **, so largely, it's self-inflicted, that high cutoff %. Certainly 57% is not exactly representative, but if it bothers, the None HT crunching by restricting BOINC to the physical cores would possible remove the bulk of your waste, might even increase your number of results... you're good with numbers ;>)

--//--

** Having my thoughts of why that ZR may have added to the mean time drop.
[Nov 26, 2011 5:33:31 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 44   Pages: 5   [ 1 2 3 4 5 | Next Page ]
[ Jump to Last Post ]
Post new Thread