Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 3
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1082 times and has 2 replies Next Thread
jay_Orlando
Senior Cruncher
USA
Joined: Jan 4, 2006
Post Count: 189
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
confused Where do I start debug of segfault in cep2?

Greetings,
I'm running linux and Boinc 7.0.27
I noticed a segfault in my system log (kern.log.1) and syslog :
[369168.751849] wcgrid_cep2_qch[22385]: segfault at 5d0b7000 ip 000000000d8edea4 sp 00000000ff9c1194 error 4 in wcgrid_cep2_qchem_6.40_i686-pc-linux-gnu[8048000+6297000]

I did not see anything unusual in the BOINC event log.

I don't think I'm overloading with too many tasks.
I'm running on 7 of 8 cores (87.5%) anly am configured to run only one CEP2 wu at a time.

Question: Where do I start debugging?

FYI - env. stuff:

Wed 19 Jun 2013 12:40:44 AM EDT | | Starting BOINC client version 7.0.27 for x86_64-pc-linux-gnu
Wed 19 Jun 2013 12:40:44 AM EDT | | log flags: file_xfer, sched_ops, task
Wed 19 Jun 2013 12:40:44 AM EDT | | Libraries: libcurl/7.29.0 OpenSSL/1.0.1c zlib/1.2.7 libidn/1.25 librtmp/2.3
Wed 19 Jun 2013 12:40:44 AM EDT | | Data directory: /var/lib/boinc-client
Wed 19 Jun 2013 12:40:44 AM EDT | | Processor: 8 AuthenticAMD AMD FX(tm)-8150 Eight-Core Processor [Family 21 Model 1 Stepping 2]
Wed 19 Jun 2013 12:40:44 AM EDT | | Processor: 2.00 MB cache
Wed 19 Jun 2013 12:40:44 AM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc extd_apicid aperfmperf pni pclmulqdq monitor ssse3 cx16 sse4_1 sse4_2 popcnt aes xsave avx lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt lwp fma4 nodeid_msr topoext perfctr_core arat cpb hw_pstate npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold
Wed 19 Jun 2013 12:40:44 AM EDT | | OS: Linux: 3.8.0-13-generic
Wed 19 Jun 2013 12:40:44 AM EDT | | Memory: 7.70 GB physical, 8.04 GB virtual
Wed 19 Jun 2013 12:40:44 AM EDT | | Disk: 18.33 GB total, 15.32 GB free
Wed 19 Jun 2013 12:40:44 AM EDT | | Local time is UTC -4 hours
Wed 19 Jun 2013 12:40:44 AM EDT | | ATI GPU 0: Capeverde (CAL version 1.4.1741, 2048MB, 1726MB available, 2048 GFLOPS peak)
Wed 19 Jun 2013 12:40:44 AM EDT | | OpenCL: ATI GPU 0: Capeverde (driver version 1084.4 (VM), device version OpenCL 1.2 AMD-APP (1084.4), 2048MB, 1726MB available)
Wed 19 Jun 2013 12:40:44 AM EDT | | Config: use all coprocessors
Wed 19 Jun 2013 12:40:44 AM EDT | | Config: GUI RPC allowed from:
Wed 19 Jun 2013 12:40:44 AM EDT | | A new version of BOINC is available. <a href=http://boinc.berkeley.edu/download.php>Download it.</a>

and

me:/var/log$ ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
scheduling priority (-e) 0
file size (blocks, -f) unlimited
pending signals (-i) 62449
max locked memory (kbytes, -l) 64
max memory size (kbytes, -m) unlimited
open files (-n) 1024
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
real-time priority (-r) 0
stack size (kbytes, -s) 8192
cpu time (seconds, -t) unlimited
max user processes (-u) 62449
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited


I'm trying to link a result reported with the wu that was in progress.
I think this is the one it finished 4 hours after the segfault.:
Link to WCG results

Or, should I just ignore??

T H A N K Y O U!!
Jay E.
----------------------------------------

[Jun 23, 2013 1:24:20 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: Where do I start debug of segfault in cep2?

Simple #1 rule with CEP2 in general: If valid, ignore!
Simple #2 rule with CEP2: if having an RC = xxxxxx in the result log, and task -not- invalid, ignore!
[Jun 23, 2013 2:29:51 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jay_Orlando
Senior Cruncher
USA
Joined: Jan 4, 2006
Post Count: 189
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
smile Re: Where do I start debug of segfault in cep2?

Rob,

Thanks for the simple rules !!

I am now a happy camper.

Jay
----------------------------------------

[Jun 23, 2013 3:30:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread