Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
![]() |
World Community Grid Forums
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
No member browsing this thread |
Thread Status: Active Total posts in this thread: 550
|
![]() |
Author |
|
mmonnin
Advanced Cruncher Joined: Jul 20, 2016 Post Count: 148 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have been getting quite a few with FAH2 and HSTB selected. I can only get 1 task per thread of FAH2 so BM is always asking for HSTB work. 44 across 5 PCs.
----------------------------------------![]() |
||
|
Mumak
Senior Cruncher Joined: Dec 7, 2012 Post Count: 477 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have noticed a higher (than usual = 0) error rate in recent HSTB tasks. Cause is "SIGSEGV: segmentation violation" most probably during writing of checkpoint.
----------------------------------------Anyone else perhaps too? ![]() |
||
|
BladeD
Ace Cruncher USA Joined: Nov 17, 2004 Post Count: 28976 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have been getting quite a few with FAH2 and HSTB selected. I can only get 1 task per thread of FAH2 so BM is always asking for HSTB work. 44 across 5 PCs. Yep, not getting any FAH2, because I'm getting so many HSTB WUs! ![]() ---------------------------------------- [Edit 2 times, last edit by BladeD at Feb 6, 2019 1:45:17 AM] |
||
|
Jean-David Beyer
Senior Cruncher USA Joined: Oct 2, 2007 Post Count: 339 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have noticed a higher (than usual = 0) error rate in recent HSTB tasks. Cause is "SIGSEGV: segmentation violation" most probably during writing of checkpoint. Anyone else perhaps too? Not me. I have been getting more tasks than usual, though. ![]() |
||
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7701 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I have noticed a higher (than usual = 0) error rate in recent HSTB tasks. Cause is "SIGSEGV: segmentation violation" most probably during writing of checkpoint. Anyone else perhaps too? I have one system I had to abort workunits on because they would cause the system to reboot. This system had the problem a while back and then crunched workunits for a couple of months with no problem. It even crunched all of the long units in the batch right before the little pause. It is an 8 core system so I tested by suspending all the units except one HST unit. It ran alone for about 10 minutes and then the system rebooted. So I know the HST units are causing the problem. Since no one else is seeming to have this problem I believe I have some unknown hardware issue. It does run all of the other projects without a problem. It could be at the point of writing a checkpoint, but I really don't know. I also don't know if it is related to your problem. I have excluded HST from this system for the time being. OS is Linux. Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
Aurum
Master Cruncher The Great Basin Joined: Dec 24, 2017 Post Count: 2387 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
It sure is nice to see HST ramping up the pace.
----------------------------------------I'm in awe of these folks analyzing all these projects. Think about what it takes to deal with ZIKA. Every other day ZIKA amasses another million completed WUs. They hired a computer scientist to deal with it. HST could send out 100 times more WUs a day than now and we'd still get the work done. Pour it on!!! ![]() ![]() |
||
|
Jack H
Cruncher Belgium Joined: Jan 16, 2006 Post Count: 11 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
Progress of the project : 94%
|
||
|
D_S_Spence
Advanced Cruncher Canada Joined: Jan 5, 2017 Post Count: 107 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
I haven't received any HSTB WUs since January 28. They run a little over 8 hours on my main box, so I need 9 of them to get my bronze.
Before the little burst of WUs in January, I think I was snagging approx. 1 WU per month. If it returns to that pattern, and it takes 9 months to get 9 WUs, will I get the bronze badge before the project ends? Or do I have to start running scripts and things to help get more WUs? |
||
|
Aurum
Master Cruncher The Great Basin Joined: Dec 24, 2017 Post Count: 2387 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
What storage queue are you using??? You might want to increase it some.
----------------------------------------![]() ![]() |
||
|
D_S_Spence
Advanced Cruncher Canada Joined: Jan 5, 2017 Post Count: 107 Status: Offline Project Badges: ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() |
@Aurum420
I increased my cache to 6 extra days work from 2 a few days ago. I also changed to "Connect to network about every 0.05 days" from 0.2 days. And I've limited FAH2 to 1 WU, MCM to 5 WUs, MIP to 1 WU, and ZIKA to 1 WU, so there's no way to get to 6 days in the cache without HSTB because SCC is dormant now. I could maybe stop accepting work from Rosetta for a while, to maximize chances. My project weights work out so that a Rosetta WU is running on one of my four cores most of the time. |
||
|
|
![]() |