Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Locked
Total posts in this thread: 192
Posts: 192   Pages: 20   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 106661 times and has 191 replies Next Thread
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Re: Discovering Dengue Drugs - Together Phase 2 BETA

WU BETA_ erag_ a093_ ps0000_ 2-- errored after 32 hours. The result log is huge. Here are the relevant excerpts. I would be glad to post any additional information if requested.

Excerpts from Result Log

Result Name: BETA_ erag_ a093_ ps0000_ 2--

<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
.
.
.
Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x008E46C6 read attempt to address 0xB1D3C534
[Oct 8, 2009 1:48:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
uplinger
Former World Community Grid Tech
Joined: May 23, 2005
Post Count: 3952
Status: Offline
Project Badges:
Re: Discovering Dengue Drugs - Together Phase 2 BETA

joneill, we are not going to send out replacements for the error work units on this beta. some may have been sent out but this has been stopped due to the high error rate. We will have another beta once we figure out these errors.

-Uplinger
[Oct 8, 2009 2:30:19 PM]   Link   Report threatening or abusive post: please login first  Go to top 
David_L6
Senior Cruncher
USA
Joined: Aug 24, 2006
Post Count: 296
Status: Offline
Project Badges:
Re: Discovering Dengue Drugs - Together Phase 2 BETA

I received 9 Betas this time. 1 is Valid, 2 are Pending Validation, 5 Errored, and 1 Too Late. I don't understand that "Too Late" result. It was returned ~3.5 hours from when it was received. When first returned it was Pending Validation.
----------------------------------------

[Oct 8, 2009 3:09:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
X-Files 27
Senior Cruncher
Canada
Joined: May 21, 2007
Post Count: 391
Status: Offline
Project Badges:
Re: Discovering Dengue Drugs - Together Phase 2 BETA

just wondering how would these results be in quorum if there's no repair WUs assuming there is 1 successful WU?
----------------------------------------

[Oct 8, 2009 4:18:38 PM]   Link   Report threatening or abusive post: please login first  Go to top 
X-Files 27
Senior Cruncher
Canada
Joined: May 21, 2007
Post Count: 391
Status: Offline
Project Badges:
Re: Discovering Dengue Drugs - Together Phase 2 BETA

As the nature of these type "A" are so extensive (CPU/RAM/HD/Network), it has to be sent out to a dedicated crunching rig. Thus can be measured by rDCF.

Lets say it was sent out to a crunching rig who do <8hrs per day and he uses this rig for work or something else. We all know that this WU will checkpoint 50 times. Depending on the speed of the rig, it may checkpoint between 1hr to 2hrs. If the user reboots or shutdown the machine, more or less 2hrs will be lost. And to reach completion (ave 50hrs), it needs 6.25 days plus will add 2 days for weekends. That will give us 1.75days leeway assuming the deadline is 10days. Is it enough for all these reboots, shutdown and what not?
----------------------------------------

[Oct 8, 2009 4:54:37 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Re: Discovering Dengue Drugs - Together Phase 2 BETA

I think you're confusing rDCF with the Active_frac, the first an indication of performance in relation to the benchmark, the second the amount of time that is measured over a sliding period of time that the computer is on and allowed to crunch. That the servers get told on each contact so it can determine if the client is foreseeably able to finish the work. A message in the client log would state something like:

17/07/2008 17:33:35|World Community Grid|Message from server: (won't finish in time) Computer on 99.6% of time, BOINC on 98.7% of that

You'll be finding out pretty quick if by your scenario the unit can be finished in time. For now, it's speculation, 10 days on 60 hours requiring 6 hours a day. With the <checkpoint_debug> flag on the timing of switching off the computer can be reasonably well managed to ensure the least time is lost on computer wakeup.
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
----------------------------------------
[Edit 1 times, last edit by Sekerob at Oct 8, 2009 5:50:32 PM]
[Oct 8, 2009 5:46:46 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Sekerob
Ace Cruncher
Joined: Jul 24, 2005
Post Count: 20043
Status: Offline
Re: Discovering Dengue Drugs - Together Phase 2 BETA

PS, remember there are not too many A types to go around in the pool, the ratio being 2 A for 4 B for 10,000 C (quorum 2)
----------------------------------------
WCG Global & Research > Make Proposal Help: Start Here!
Please help to make the Forums an enjoyable experience for All!
[Oct 8, 2009 5:54:14 PM]   Link   Report threatening or abusive post: please login first  Go to top 
PecosRiverM
Veteran Cruncher
The Great State of Texas
Joined: Apr 27, 2007
Post Count: 1053
Status: Offline
Project Badges:
Re: Discovering Dengue Drugs - Together Phase 2 BETA

Well after 32 hrs looks like I'm in good company. biggrin

Result Name: BETA_ erag_ a060_ ps0000_ 1--



<core_client_version>6.2.28</core_client_version>
<![CDATA[
<message>
- exit code -1073741819 (0xc0000005)
</message>
<stderr_txt>
.
.
.
.
Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x008E46C6 read attempt to address 0x52922F80

Engaging BOINC Windows Runtime Debugger


More avail. if you need it.
I was feeling left out seeing as everyone else was getting errored out devilish
----------------------------------------

[Oct 8, 2009 11:25:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Re: Discovering Dengue Drugs - Together Phase 2 BETA

I've had 1 WU error out so far, and nobody else seems to have reported the same error, even though it appears a wingman had the same one (and the other wingman had the more-common code 29)...


Result Name: BETA_ erag_ a053_ ps0000_ 2--
<core_client_version>6.2.15</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
INFO: No state to restore. Start from the beginning.
wcgStepsDone = 100 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.000400
wcgStepsDone = 200 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.000800
wcgStepsDone = 300 wcgSteps1 = 5000 wcgCyclesDone = 0 wcgCycles = 50 pctComplete = 0.001200

[about 400 lines deleted]

wcgStepsDone = 1400 wcgSteps1 = 5000 wcgCyclesDone = 8 wcgCycles = 50 pctComplete = 0.165600
wcgStepsDone = 1500 wcgSteps1 = 5000 wcgCyclesDone = 8 wcgCycles = 50 pctComplete = 0.166000
wcgStepsDone = 1600 wcgSteps1 = 5000 wcgCyclesDone = 8 wcgCycles = 50 pctComplete = 0.166400
wcgStepsDone = 1700 wcgSteps1 = 5000 wcgCyclesDone = 8 wcgCycles = 50 pctComplete = 0.166800
wcgStepsDone = 1800 wcgSteps1 = 5000 wcgCyclesDone = 8 wcgCycles = 50 pctComplete = 0.167200
SIGSEGV: segmentation violation
Stack trace (13 frames):
[0x86cd8af]
[0x8737250]
[0xb7ff3400]
[0x83a07ff]
[0x82e6663]
[0x820f211]
[0x81eca3b]
[0x81f2737]
[0x86a798f]
[0x86ab016]
[0x804e5cc]
[0x873934a]
[0x8048131]

Exiting...

</stderr_txt>
]]>


From StdOutDae

06-Oct-2009 17:02:21 [World Community Grid] Starting BETA_erag_a053_ps0000_2
06-Oct-2009 17:02:21 [World Community Grid] Starting task BETA_erag_a053_ps0000_2 using beta10 version 607
06-Oct-2009 18:45:04 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
06-Oct-2009 19:51:39 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
06-Oct-2009 20:57:44 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
06-Oct-2009 22:04:53 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
06-Oct-2009 23:10:07 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 00:15:33 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 01:20:17 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 02:25:42 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 02:56:25 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 03:09:06 [World Community Grid] [checkpoint_debug] result BETA_erag_a053_ps0000_2 checkpointed
07-Oct-2009 03:22:25 [World Community Grid] Computation for task BETA_erag_a053_ps0000_2 finished
07-Oct-2009 03:22:27 [World Community Grid] Output file BETA_erag_a053_ps0000_2_4 for task BETA_erag_a053_ps0000_2 absent
07-Oct-2009 03:22:27 [World Community Grid] Output file BETA_erag_a053_ps0000_2_5 for task BETA_erag_a053_ps0000_2 absent
07-Oct-2009 03:22:28 [World Community Grid] Started upload of BETA_erag_a053_ps0000_2_0
07-Oct-2009 03:22:28 [World Community Grid] Started upload of BETA_erag_a053_ps0000_2_1
07-Oct-2009 03:22:30 [World Community Grid] Finished upload of BETA_erag_a053_ps0000_2_0
07-Oct-2009 03:22:30 [World Community Grid] Started upload of BETA_erag_a053_ps0000_2_2
07-Oct-2009 03:22:33 [World Community Grid] Finished upload of BETA_erag_a053_ps0000_2_1
07-Oct-2009 03:22:33 [World Community Grid] Started upload of BETA_erag_a053_ps0000_2_3
07-Oct-2009 03:22:37 [World Community Grid] Finished upload of BETA_erag_a053_ps0000_2_3
07-Oct-2009 03:22:37 [World Community Grid] Started upload of BETA_erag_a053_ps0000_2_6
07-Oct-2009 03:22:40 [World Community Grid] Finished upload of BETA_erag_a053_ps0000_2_6
07-Oct-2009 03:35:19 [World Community Grid] Finished upload of BETA_erag_a053_ps0000_2_2


I wish I could add anything from the Messages tab in BOINC but it mysteriously stopped writing to that tab for about 6.5 hours during which the error occurred.
Here's the top section from that, though...

Thu 08 Oct 2009 08:01:07 PM EDT||Starting BOINC client version 6.2.15 for i686-pc-linux-gnu
Thu 08 Oct 2009 08:01:07 PM EDT||log flags: task, file_xfer, sched_ops, checkpoint_debug
Thu 08 Oct 2009 08:01:07 PM EDT||Libraries: libcurl/7.18.0 OpenSSL/0.9.8g zlib/1.2.3 c-ares/1.5.1
Thu 08 Oct 2009 08:01:07 PM EDT||Data directory: /home/RM/BOINC
Thu 08 Oct 2009 08:01:07 PM EDT||Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [Family 6 Model 15 Stepping 11]
Thu 08 Oct 2009 08:01:07 PM EDT||Processor features: fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc arch_perfmon pebs bts pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm tpr_shadow
Thu 08 Oct 2009 08:01:07 PM EDT||OS: Linux: 2.6.30.8-64.fc11.i586
Thu 08 Oct 2009 08:01:07 PM EDT||Memory: 3.20 GB physical, 5.16 GB virtual
Thu 08 Oct 2009 08:01:07 PM EDT||Disk: 68.24 GB total, 64.59 GB free
Thu 08 Oct 2009 08:01:07 PM EDT||Local time is UTC -4 hours
Thu 08 Oct 2009 08:01:07 PM EDT||No coprocessors
Thu 08 Oct 2009 08:01:07 PM EDT|World Community Grid|URL: http://www.worldcommunitygrid.org/; Computer ID: 1056461; location: work; project prefs: work
Thu 08 Oct 2009 08:01:07 PM EDT||General prefs: from World Community Grid (last modified 06-Oct-2009 18:12:11)
Thu 08 Oct 2009 08:01:07 PM EDT||Computer location: work
Thu 08 Oct 2009 08:01:07 PM EDT||General prefs: using separate prefs for work
Thu 08 Oct 2009 08:01:07 PM EDT||Reading preferences override file
Thu 08 Oct 2009 08:01:07 PM EDT||Preferences limit memory usage when active to 3211.49MB
Thu 08 Oct 2009 08:01:07 PM EDT||Preferences limit memory usage when idle to 3277.04MB
Thu 08 Oct 2009 08:01:07 PM EDT||Preferences limit disk usage to 9.31GB


I did have the section that showed the gap, but I was almost ready to click 'Reply to the post' when I hit the 'Sleep' key by accident and ultimately found out there's no way to wake fedora back up without rebooting (I probably have something configured wrong, though). So I just got done assembling all these data again.

Hope that helps.

This is now the 3rd [re]assembly and attempt to post it... 2nd attempt resulted in

World Community Grid Forums » Error !!!
The error message is:
Error executing SQL in PostDAOImplJDBC.create.

Probably from the Results log, so I removed about 400 lines from it.

Hmmm... preview shows some of my typing disappearing off the right-edge of the screen. are the code tags screwing the formatting outside their range?
[Oct 9, 2009 12:50:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Re: Discovering Dengue Drugs - Together Phase 2 BETA

Just finished with quite a result log:
BETA_ erag_ a019_ ps0000_ 2-- starbase4.command Pending Validation 10/6/09 20:04:31 10/9/09 15:10:44 63.45 747.0 / 0.0
Neither wingmen have replied as of this post.

Wish I had had a couple more like them smile
[Oct 9, 2009 3:16:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 192   Pages: 20   [ Previous Page | 5 6 7 8 9 10 11 12 13 14 | Next Page ]
[ Jump to Last Post ]
Post new Thread