World Community Grid - View Thread - Mapping Cancer Markers

World Community Grid Forums

Category: Active Research

Forum: Mapping Cancer Markers Forum

Thread: Mapping Cancer Markers - Problems Thread

Quick Go »

No member browsing this thread

Thread Status: Active
Total posts in this thread: 264

[ ]

Author

This topic has been viewed 79466 times and has 263 replies

armstrdj
Former World Community Grid Tech
Joined: Oct 21, 2004
Post Count: 695
Status: Offline
Project Badges:

5 year badge for Human Proteome Folding - Phase 2

14 day badge for Help Cure Muscular Dystrophy

90 day badge for Discovering Dengue Drugs - Together

90 day badge for Nutritious Rice for the World

90 day badge for The Clean Energy Project

2 year badge for Help Fight Childhood Cancer

90 day badge for Influenza Antiviral Drug Search

2 year badge for Help Cure Muscular Dystrophy - Phase 2

2 year badge for Discovering Dengue Drugs - Together - Phase 2

2 year badge for The Clean Energy Project - Phase 2

2 year badge for Computing for Clean Water

2 year badge for Drug Search for Leishmaniasis

2 year badge for GO Fight Against Malaria

2 year badge for Computing for Sustainable Water

10 year badge for Mapping Cancer Markers

2 year badge for Uncovering Genome Mysteries

2 year badge for Outsmart Ebola Together

2 year badge for FightAIDS@Home - Phase 2

2 year badge for Microbiome Immunity Project

2 year badge for Africa Rainfall Project

2 year badge for OpenPandemics - COVID-19


Re: Mapping Cancer Markers - Problems Thread

TimAndHedy,

Look at this thread to see info to supply for a stuck workunit http://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,35845 if you still have it running.

Thanks,
armstrdj

[Nov 21, 2013 4:13:12 PM]

armstrdj
Former World Community Grid Tech
Joined: Oct 21, 2004
Post Count: 695
Status: Offline
Project Badges:


Re: Mapping Cancer Markers - Problems Thread

breathesgelatin,

Check the website for the results which are erroring out and post the result log from one of them. If you have not done this before navigate to My Contribution->Result Status and from there you can filter based on result status = error. Click on the status and that will show the result log.

Thanks,
armstrdj

[Nov 21, 2013 4:17:23 PM]

BobCat13
Senior Cruncher
Joined: Oct 29, 2005
Post Count: 295
Status: Offline
Project Badges:

180 day badge for Human Proteome Folding - Phase 2

180 day badge for Nutritious Rice for the World

45 day badge for The Clean Energy Project

90 day badge for Discovering Dengue Drugs - Together - Phase 2

90 day badge for The Clean Energy Project - Phase 2

1 year badge for Drug Search for Leishmaniasis

1 year badge for GO Fight Against Malaria

45 day badge for Computing for Sustainable Water

20 year badge for Mapping Cancer Markers

1 year badge for Africa Rainfall Project

5 year badge for OpenPandemics - COVID-19


Re: Mapping Cancer Markers - Problems Thread

Linux Mint 15 64-bit

Result Name: MCM1_ 0000176_ 2595_ 0--

<core_client_version>7.2.28</core_client_version>
<![CDATA[
<message>
process exited with code 193 (0xc1, -63)
</message>
<stderr_txt>
Commandline = ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.26_x86_64-pc-linux-gnu -SettingsFile MCM1_0000176_2595.txt -DatabaseFile dataset-17_72_SDG_v1.txt
Initializing
wcg_learn_limit = 500000
Running
[14:03:04]: Computing pass 0
*** glibc detected *** ../../projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.26_x86_64-pc-linux-gnu: munmap_chunk(): invalid pointer: 0x000000000474bee0 ***
======= Backtrace: =========
[0x5434c2]
[0x483ccb]
[0x483c37]
[0x482b73]
[0x42fbe9]
[0x44294d]
[0x442fc3]
[0x443080]
[0x42585c]
[0x51712b]
[0x400449]
======= Memory map: ========
00400000-00648000 r-xp 00000000 08:03 1310877 /boinc/data/projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.26_x86_64-pc-linux-gnu
00848000-0084b000 rw-p 00248000 08:03 1310877 /boinc/data/projects/www.worldcommunitygrid.org/wcgrid_mcm1_7.26_x86_64-pc-linux-gnu
0084b000-00883000 rw-p 00000000 00:00 0
0114e000-047ad000 rw-p 00000000 00:00 0 [heap]
7f00710ae000-7f00710af000 rw-p 00000000 00:00 0
7f00710af000-7f00710b0000 rw-s 00000000 08:03 1310984 /boinc/data/slots/3/boinc_mcm1_3
7f00710b0000-7f00710b1000 ---p 00000000 00:00 0
7f00710b1000-7f00710b8000 rw-p 00000000 00:00 0 [stack:12021]
7f00710b8000-7f00710ba000 rw-s 00000000 08:03 1310937 /boinc/data/slots/3/boinc_mmap_file
7fff2dd93000-7fff2ddb4000 rw-p 00000000 00:00 0 [stack]
7fff2ddfe000-7fff2de00000 r-xp 00000000 00:00 0 [vdso]
ffffffffff600000-ffffffffff601000 r-xp 00000000 00:00 0 [vsyscall]
SIGABRT: abort called
Stack trace (16 frames):
[0x4987cd]
[0x47fc80]
[0x47fb4b]
[0x51fc75]
[0x53d9a7]
[0x5434c2]
[0x483ccb]
[0x483c37]
[0x482b73]
[0x42fbe9]
[0x44294d]
[0x442fc3]
[0x443080]
[0x42585c]
[0x51712b]
[0x400449]

Exiting...

</stderr_txt>
]]>

[Nov 21, 2013 10:45:47 PM]

TimAndHedy
Senior Cruncher
Joined: Jan 27, 2009
Post Count: 267
Status: Offline
Project Badges:

10 year badge for Help Fight Childhood Cancer

5 year badge for The Clean Energy Project - Phase 2

90 day badge for Drug Search for Leishmaniasis

14 day badge for GO Fight Against Malaria

100 year badge for Mapping Cancer Markers

5 year badge for Outsmart Ebola Together

10 year badge for Microbiome Immunity Project

10 year badge for OpenPandemics - COVID-19


Re: Mapping Cancer Markers - Problems Thread

TimAndHedy,

Look at this thread to see info to supply for a stuck workunit http://www.worldcommunitygrid.org/forums/wcg/viewthread_thread,35845 if you still have it running.

Thanks,
armstrdj

I rebooted and it stayed at 100% complete but restarted the processing time back to 0.

I aborted the unit. It would be nice to have this purged from the system.

It is wasting a lot of processing time, in my case something like 53 hours, who knows on the systems that had it for 10 days.

----------------------------------------
[Edit 1 times, last edit by TimAndHedy at Nov 22, 2013 4:26:27 AM]

[Nov 22, 2013 4:24:23 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Mapping Cancer Markers - Problems Thread

Hi TimAndHedy,
Yes, I would also like an occasional explanation of these problems. They are most common at the start of a project. As the project scientists discover what causes these long-running errors, the numbers are usually reduced, but without any real mention on the board.

Lawrence

[Nov 22, 2013 5:00:47 AM]

NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:

2 year badge for Human Proteome Folding - Phase 2

180 day badge for Discovering Dengue Drugs - Together

2 year badge for Nutritious Rice for the World

180 day badge for Discovering Dengue Drugs - Together - Phase 2

1 year badge for Computing for Sustainable Water

5 year badge for Uncovering Genome Mysteries

10 year badge for Outsmart Ebola Together

10 year badge for FightAIDS@Home - Phase 2

20 year badge for Smash Childhood Cancer

20 year badge for Microbiome Immunity Project

5 year badge for Africa Rainfall Project


Re: Mapping Cancer Markers - Problems Thread

There is no "Computing pass 0" entry in the results log. What did the WU do for almost 6 hours?

Cheers coffee

----------------------------------------

[Nov 22, 2013 5:04:23 AM]

NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:


Re: Mapping Cancer Markers - Problems Thread

I am very suspisious that the WUs are running far longer than is being reported, perhaps as much a 10 times longer. It is hard to be sure since the start time is not logged in stderr.txt. I've got some jobs that I think ran for over 48 hours, but have only logged a few. Is the CPU time being truncated? I've got 8 cores running 24x7, but I am not seeing 8 days of work beeing reported.

007:01:33:09

008:16:35:48

008:17:02:00

005:23:28:57

004:16:03:17

008:23:41:41

008:17:19:52

006:10:40:09

006:20:12:14

008:04:21:37

006:00:55:42

005:02:35:24

006:06:58:28

It looks like 20% of my cpu time is being "lost" or wasted.

Cheers coffee

[Edit - highlight time]

----------------------------------------

----------------------------------------
[Edit 1 times, last edit by NixChix at Nov 22, 2013 5:36:40 PM]

[Nov 22, 2013 5:25:28 AM]

NixChix
Veteran Cruncher
United States
Joined: Apr 29, 2007
Post Count: 1187
Status: Offline
Project Badges:


Re: Mapping Cancer Markers - Problems Thread

Are any of you that *know* that you have had jobs running for more than 24 hours seeing that reflected in the server results?

Also, would someone who has had one please post your results log. I would like to compare to some of mine that I think have been running long.

Cheers coffee

----------------------------------------

[Nov 22, 2013 5:30:12 AM]

Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline


Re: Mapping Cancer Markers - Problems Thread

I haven't seen this described before. I have two WUs running where the estimated time to completion is increasing significantly. The previous four WUs on this machine completed normally.

Properties of task MCM1_0000215_5002_0
Application Mapping Cancer Markers 7.26
Workunit name MCM1_0000215_5002
State Running
Received Fri 22 Nov 2013 10:53:43 AM CST
Report deadline Mon 02 Dec 2013 10:53:41 AM CST
Estimated computation size 39067 GFLOPS
CPU time at last checkpoint 05:50:22
CPU time 06:00:02
Elapsed time 06:00:22
Estimated time remaining 10:04:43 <---- This was 06:46:34 when it started.
Fraction done 27.560 %
Virtual memory size 82.72 MB
Working set size 41.02 MB
Directory slots/1
Process ID 10709

The other job report is similar. I have two more queued up.

So far I have suspended both jobs and restarted them. The estimated time to completion is still going up.

[Nov 22, 2013 11:12:50 PM]

Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7846
Status: Offline
Project Badges:

2 year badge for Discovering Dengue Drugs - Together

14 day badge for The Clean Energy Project

45 day badge for Discovering Dengue Drugs - Together - Phase 2

5 year badge for Drug Search for Leishmaniasis

5 year badge for GO Fight Against Malaria

200 year badge for Mapping Cancer Markers

20 year badge for Outsmart Ebola Together

100 year badge for Smash Childhood Cancer

100 year badge for OpenPandemics - COVID-19


Re: Mapping Cancer Markers - Problems Thread

There is something I have not seen before:
MCM1_ 0000122_ 9493_ 4-- 726 Too Late 11/22/13 10:29:37 11/23/13 10:46:42 5.22 111.7 / 0.0
MCM1_ 0000122_ 9493_ 3-- 726 Too Late 11/21/13 02:09:56 11/22/13 10:29:09 18.94 109.9 / 0.0
MCM1_ 0000122_ 9493_ 2-- 726 Too Late 11/20/13 05:37:02 11/21/13 00:43:48 4.48 85.3 / 0.0 <Mine
MCM1_ 0000122_ 9493_ 1-- - Detached 11/20/13 03:47:02 11/20/13 05:36:35 0.00 0.0 / 0.0
MCM1_ 0000122_ 9493_ 0-- 726 Too Late 11/20/13 02:34:50 11/20/13 08:25:39 3.04 111.7 / 0.0

All of the items except the detached item are marked "Too Late." Will this unit be reissued or is it just a total dud ?
I actually have two of these units so this is not the only one like this.

Cheers

----------------------------------------

Sgt. Joe
*Minnesota Crunchers*

----------------------------------------
[Edit 1 times, last edit by Sgt.Joe at Nov 23, 2013 5:03:07 PM]

[Nov 23, 2013 4:59:57 PM]

[ ]