| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 9
|
|
| Author |
|
|
NorthernRaider
Cruncher Canada Joined: Dec 10, 2008 Post Count: 12 Status: Offline Project Badges:
|
I got the same errors on a whole bunch of workunits but this time from Smash Childhood Cancer. Here is from 3 last WU's the Process got signal 11 - Segmentation Violation. Likely means that the WU's are bad
----------------------------------------Result Name: SCC1_ 0003838_ FoxO1-A_ 21638_ 0-- <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> process got signal 11</message> <stderr_txt> SIGSEGV: segmentation violation </stderr_txt> ]]> Result Name: SCC1_ 0003804_ Prdm-C_ 87258_ 0-- <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> process got signal 11</message> <stderr_txt> SIGSEGV: segmentation violation </stderr_txt> ]]> Result Name: SCC1_ 0003773_ Prdm-B_ 10089_ 0-- <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> process got signal 11</message> <stderr_txt> SIGSEGV: segmentation violation </stderr_txt> ]]> I have 20 GB Disk available to BOINC where only about 3 GB is being used. Machine has 32 GB of Memory and only 20% is being used of it, and has 8 CPU where only 6 are being used at the moment. You could look up the WU's in the logs, and I shall dig up some more to see if it is a System Problem at my end. UPDATE : I had 5 SCC1 WU's in the queue. They all started and then quit with the Computation Error, I bet it is all the SIGSEGV Computation Error, that shows in the WCG Log. CONFIRMED !! after Transmit. They were transmitted back to WCG, and here is the part of the log that shows no Output file in the BOINC Log :: Mon 13 Apr 2020 07:26:32 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:33 PM EDT | World Community Grid | Output file SCC1_0003829_FoxO1-A_78908_0_r1617702272_0 for task SCC1_0003829_FoxO1-A_78908_0 absent Mon 13 Apr 2020 07:26:33 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:34 PM EDT | World Community Grid | Started upload of MCM1_0162034_6606_1_r1609257074_0 Mon 13 Apr 2020 07:26:34 PM EDT | World Community Grid | Output file SCC1_0003778_Prdm-B_84465_1_r521594503_0 for task SCC1_0003778_Prdm-B_84465_1 absent Mon 13 Apr 2020 07:26:34 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:35 PM EDT | World Community Grid | Output file SCC1_0003772_Prdm-B_54836_1_r1181842435_0 for task SCC1_0003772_Prdm-B_54836_1 absent Mon 13 Apr 2020 07:26:35 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:36 PM EDT | World Community Grid | Finished upload of MCM1_0162034_6606_1_r1609257074_0 Mon 13 Apr 2020 07:26:36 PM EDT | World Community Grid | Output file SCC1_0003804_Prdm-C_88955_0_r1259533660_0 for task SCC1_0003804_Prdm-C_88955_0 absent Mon 13 Apr 2020 07:26:36 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:37 PM EDT | World Community Grid | Output file SCC1_0003772_Prdm-B_54842_1_r609797721_0 for task SCC1_0003772_Prdm-B_54842_1 absent Mon 13 Apr 2020 07:26:37 PM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] ARP1_0016230_008_0: WS 742.51MB, smoothed 742.51MB, swap 815.76MB, 0.00 page faults/sec, user CPU 53219.750, kernel CPU 39.320 Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] ARP1_0010974_008_1: WS 742.05MB, smoothed 742.05MB, swap 815.76MB, 0.00 page faults/sec, user CPU 51869.840, kernel CPU 37.340 Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] ARP1_0034711_008_0: WS 742.06MB, smoothed 742.06MB, swap 815.76MB, 0.00 page faults/sec, user CPU 35291.110, kernel CPU 19.330 Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] MCM1_0162036_1741_1: WS 85.52MB, smoothed 85.52MB, swap 86.35MB, 0.00 page faults/sec, user CPU 7277.610, kernel CPU 4765.910 Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] MCM1_0161971_0040_1: WS 73.93MB, smoothed 73.93MB, swap 74.70MB, 0.00 page faults/sec, user CPU 3344.580, kernel CPU 2112.880 Mon 13 Apr 2020 07:26:42 PM EDT | World Community Grid | [mem_usage] MIP1_00289572_6543_0: WS 66.85MB, smoothed 33.42MB, swap 133.16MB, 0.00 page faults/sec, user CPU 2.940, kernel CPU 0.080 Mon 13 Apr 2020 07:26:42 PM EDT | | [mem_usage] BOINC totals: WS 2452.92MB, smoothed 2419.49MB, swap 2741.49MB, 0.00 page faults/sec Mon 13 Apr 2020 07:26:42 PM EDT | | [mem_usage] All others: WS 4311.16MB, swap 271351.43MB, user 5738.470s, kernel 3786.640s Let me know if you have an idea on how to solve this. ![]() ![]() |
||
|
|
Sgt.Joe
Ace Cruncher USA Joined: Jul 4, 2006 Post Count: 7846 Status: Offline Project Badges:
|
If your SCC work units crashed, check the status of the wingman. If successive wingmen complete the unit with no problem, then it is some type of system failure on your system. If none of the wingmen complete the units, then the units are bad work units.
----------------------------------------See here Cheers
Sgt. Joe
*Minnesota Crunchers* |
||
|
|
alanb1951
Veteran Cruncher Joined: Jan 20, 2006 Post Count: 1317 Status: Offline Project Badges:
|
Sgt. Joe is right about checking on your wingmen... If it turns out that the wingmen are not having problems, then here's a possibility, but it only applies if you are running an up-to-date Linux with a recent kernel. (I wish there was a way of finding out what people are running on if they don't say when they post, but... So this may be irrelevant to you, in which case sorry.)
For quite a long time now, Linux kernels have deprecated the old method of doing certain common system calls (date/time in particular); more recently, some system maintainers have disabled the mechanism by default, and I believe that now some have removed it completely. Anyway, the end result is that old applications may SIGSEGV now even if they didn't on an older kernel... There's more about this in a post I made in another thread here called "Some errors"; it gives more details and refers to more information on possible resolutions. I also asked if a new build was in the pipeline and one of the techs replied in the affirmative. Hope this helps - Al. |
||
|
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3315 Status: Offline Project Badges:
|
I had this error on 1 SCC task yesterday (at about 90%). Ubuntu 20.04, 5.4 kernel.
----------------------------------------Wingman did okay, no more errors since. The other 5 SCC and 2 FAH2 tasks did not error out. ![]() - AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W - AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W - AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
|
KerSamson
Master Cruncher Switzerland Joined: Jan 29, 2007 Post Count: 1684 Status: Offline Project Badges:
|
Since you run already Ubuntu 20.04 (you updated your machine), I would recommend you to run the machine dry. Afterwards you should detach WCG. After a reboot, you can re-attach WCG and it should run better.
----------------------------------------I did experience similar trouble several years long with one particular machine: just occasionally errored or invalid WUs. Last year, because of more troubles after a full machine hardware upgrade and Linux update, I came to the idea to detach and re-attach WCG. In the meantime, the machine runs perfectly. I have no idea for the rational, but it worked for me. Cheers, Yves |
||
|
|
Falconet
Master Cruncher Portugal Joined: Mar 9, 2009 Post Count: 3315 Status: Offline Project Badges:
|
It was a clean install, just 2 weeks old.
----------------------------------------About 1000 WCG tasks (mostly SCC) since then and this was the only SCC error, I think. If it happens again, I'll try what you suggested. ![]() - AMD Ryzen 5 1600AF 6C/12T 3.2 GHz - 85W - AMD Ryzen 5 2500U 4C/8T 2.0 GHz - 28W - AMD Ryzen 7 7730U 8C/16T 3.0 GHz |
||
|
|
NorthernRaider
Cruncher Canada Joined: Dec 10, 2008 Post Count: 12 Status: Offline Project Badges:
|
Update on the original Post , with answers on checking things out and provide further information.
----------------------------------------1. Wingmen complete the WU , so it is something that is local to me. 2. Ran complete passes of MemTest86+ with NO failures anywhere. 3. Ran WCG / Boinc empty , and Cleared and Stopped Client and restarted the system, taking on a minimum of 10 tasks, of which one is SCC1 4 Reduced running tasks to 6 Active only (75% CPU) 5. each and every started SCC1 task fails with the same SigSevV error Signal 11 6. Enabled complete debugging of client in the log. 7. Suspended all tasks except the SCC1 for debug logging. 8. Captured the log of which part is attached here showing the task failure. I am including here the start section for Boinc , includes hardware and software info. Then I am including a full debug of the starting of task SCC1_0003882_FoxO1-B_75698_0 , and the sections regarding this task when it was attempting to run. I can not see from this what exactly is happening in the task , and hope that one of you can have a look and give some advise. Wed 22 Apr 2020 08:45:28 PM EDT | | Starting BOINC client version 7.14.2 for x86_64-pc-linux-gnu Wed 22 Apr 2020 08:45:28 PM EDT | | log flags: file_xfer, task, checkpoint_debug, coproc_debug Wed 22 Apr 2020 08:45:28 PM EDT | | Libraries: libcurl/7.64.0 OpenSSL/1.1.1d zlib/1.2.11 libidn2/2.0.5 libpsl/0.20.2 (+libidn2/2.0.5) libssh2/1.8.0 nghttp2/1.36.0 librtmp/2.3 Wed 22 Apr 2020 08:45:28 PM EDT | | Data directory: /var/lib/boinc-client Wed 22 Apr 2020 08:45:28 PM EDT | | [coproc] launching child process at /usr/bin/boinc Wed 22 Apr 2020 08:45:28 PM EDT | | [coproc] with data directory /var/lib/boinc-client Wed 22 Apr 2020 08:45:35 PM EDT | | CUDA: NVIDIA GPU 0: GeForce GTX 960 (driver version 418.74, CUDA version 10.1, compute capability 5.2, 2002MB, 1952MB available, 2644 GFLOPS peak) Wed 22 Apr 2020 08:45:35 PM EDT | | [coproc] NVIDIA library reports 1 GPU Wed 22 Apr 2020 08:45:35 PM EDT | | [coproc] ATI: libaticalrt.so: cannot open shared object file: No such file or directory Wed 22 Apr 2020 08:45:35 PM EDT | | [coproc] OpenCL library present but no OpenCL-capable devices found Wed 22 Apr 2020 08:45:36 PM EDT | | [libc detection] gathered: 2.28, Debian GLIBC 2.28-10 Wed 22 Apr 2020 08:45:36 PM EDT | | Host name: jabbah Wed 22 Apr 2020 08:45:36 PM EDT | | Processor: 8 AuthenticAMD AMD FX(tm)-8350 Eight-Core Processor [Family 21 Model 2 Stepping 0] Wed 22 Apr 2020 08:45:36 PM EDT | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq monitor ssse3 fma cx16 sse4_1 sse4_2 popcnt aes xsave avx f16c lahf_lm cmp_legacy svm extapic cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw ibs xop skinit wdt fma4 tce nodeid_msr tbm topoext perfctr_core perfctr_nb cpb hw_pstate ssbd ibpb vmmcall bmi1 arat npt lbrv svm_lock nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold Wed 22 Apr 2020 08:45:36 PM EDT | | OS: Linux Debian: Debian GNU/Linux 10 (buster) [4.19.0-8-amd64|libc 2.28 (Debian GLIBC 2.28-10)] Wed 22 Apr 2020 08:45:36 PM EDT | | Memory: 31.32 GB physical, 64.62 GB virtual Wed 22 Apr 2020 08:45:36 PM EDT | | Disk: 58.67 GB total, 49.59 GB free Wed 22 Apr 2020 08:45:36 PM EDT | | Local time is UTC -4 hours Wed 22 Apr 2020 08:45:36 PM EDT | | Config: GUI RPCs allowed from: Wed 22 Apr 2020 08:45:37 PM EDT | World Community Grid | URL http://www.worldcommunitygrid.org/; Computer ID 4044432; resource share 100 Wed 22 Apr 2020 08:45:37 PM EDT | World Community Grid | General prefs: from World Community Grid (last modified 17-Apr-2020 09:30:18) Wed 22 Apr 2020 08:45:37 PM EDT | World Community Grid | Host location: none Wed 22 Apr 2020 08:45:37 PM EDT | World Community Grid | General prefs: using your defaults Wed 22 Apr 2020 08:45:37 PM EDT | | Preferences: Wed 22 Apr 2020 08:45:37 PM EDT | | max memory usage when active: 24050.60 MB Wed 22 Apr 2020 08:45:37 PM EDT | | max memory usage when idle: 28860.72 MB Wed 22 Apr 2020 08:45:37 PM EDT | | max disk usage: 20.00 GB Wed 22 Apr 2020 08:45:37 PM EDT | | max CPUs used: 6 Wed 22 Apr 2020 08:45:37 PM EDT | | (to change preferences, visit a project web site or select Preferences in the Manager) Wed 22 Apr 2020 08:45:37 PM EDT | | Setting up project and slot directories Wed 22 Apr 2020 08:45:37 PM EDT | | Checking active tasks Wed 22 Apr 2020 08:45:37 PM EDT | | Using account manager BOINCstatsBAM! Wed 22 Apr 2020 08:45:37 PM EDT | | Setting up GUI RPC socket Wed 22 Apr 2020 08:45:37 PM EDT | | gui_rpc_auth.cfg is empty - no GUI RPC password protection Wed 22 Apr 2020 08:45:37 PM EDT | | Checking presence of 143 project files Wed 22 Apr 2020 09:10:58 PM EDT | | Contacting account manager at https://bam.boincstats.com/ Wed 22 Apr 2020 09:11:01 PM EDT | | Account manager: BAM! User: 236703, NorthernRaider Wed 22 Apr 2020 09:11:01 PM EDT | | Account manager: BAM! Host: 945911 Wed 22 Apr 2020 09:11:01 PM EDT | | Account manager: Number of BAM! connections for this host: 207 Wed 22 Apr 2020 09:11:01 PM EDT | | Account manager contact succeeded Wed 22 Apr 2020 10:11:01 PM EDT | | Contacting account manager at https://bam.boincstats.com/ Wed 22 Apr 2020 10:11:04 PM EDT | | Account manager: BAM! User: 236703, NorthernRaider Wed 22 Apr 2020 10:11:04 PM EDT | | Account manager: BAM! Host: 945911 Wed 22 Apr 2020 10:11:04 PM EDT | | Account manager: Number of BAM! connections for this host: 208 Wed 22 Apr 2020 10:11:04 PM EDT | | Account manager contact succeeded Wed 22 Apr 2020 11:11:05 PM EDT | | Contacting account manager at https://bam.boincstats.com/ Wed 22 Apr 2020 11:11:08 PM EDT | | Account manager: BAM! User: 236703, NorthernRaider Wed 22 Apr 2020 11:11:08 PM EDT | | Account manager: BAM! Host: 945911 Wed 22 Apr 2020 11:11:08 PM EDT | | Account manager: Number of BAM! connections for this host: 209 Wed 22 Apr 2020 11:11:08 PM EDT | | Account manager contact succeeded Wed 22 Apr 2020 11:25:26 PM EDT | World Community Grid | project resumed by user ... .. SECTION OF THE LOG Concerning the SCC1 Task and Failure ... Thu 23 Apr 2020 05:35:52 AM EDT | | [rrsim_detail] rpbest: ARP1_0031183_009_0 (finish delay 2781.55) Thu 23 Apr 2020 05:35:52 AM EDT | World Community Grid | [rr_sim] 3712.98: ARP1_0031183_009_0 finishes (1.00 CPU) (10998.22G/2.96G) Thu 23 Apr 2020 05:35:52 AM EDT | | [rrsim_detail] rpbest: SCC1_0003882_FoxO1-B_75698_0 (finish delay 1931.68) Thu 23 Apr 2020 05:35:52 AM EDT | World Community Grid | [rr_sim] 5644.65: SCC1_0003882_FoxO1-B_75698_0 finishes (1.00 CPU) (16775.99G/3.42G) Thu 23 Apr 2020 05:35:52 AM EDT | | [rrsim_detail] rpbest: MCM1_0162402_2567_0 (finish delay 1412.43) Thu 23 Apr 2020 05:35:52 AM EDT | World Community Grid | [rr_sim] 7057.08: MCM1_0162402_2567_0 finishes (1.00 CPU) (18020.13G/2.55G) ... Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | task MCM1_0162447_2127_0 suspended by user Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] Request work fetch: task suspended by user Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] Request CPU reschedule: task suspended, resumed or aborted by user Thu 23 Apr 2020 05:35:54 AM EDT | | [statefile] set dirty: Result RPC Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><success/></boinc_gui_rpc_reply>' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC Command = '<boinc_gui_rpc_request><get_cc_status/></boinc_gui_rpc_request>' Thu 23 Apr 2020 05:35:54 AM EDT | | [network_status] status: don't need connection Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><cc_status> <network_status>2</network_status> <ams_password_error>0</ams_password_error> <task_s' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC Command = '<boinc_gui_rpc_request><get_messages> <seqno>474</seqno> <translatable/></get_messages></boinc_gui_rpc_request>' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><msgs><msg> <project></project> <pri>1</pri> <seqno>475</seqno> <body><![CDATA[[gui_rpc] GUI RPC rep' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC Command = '<boinc_gui_rpc_request><get_results><active_only>0</active_only></get_results></boinc_gui_rpc_request>' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><results><result> <name>ARP1_0031183_009_0</name> <wu_name>ARP1_0031183_009</wu_name> <platfo' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC Command = '<boinc_gui_rpc_request><get_messages> <seqno>554</seqno> <translatable/></get_messages></boinc_gui_rpc_request>' Thu 23 Apr 2020 05:35:54 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><msgs><msg> <project></project> <pri>1</pri> <seqno>555</seqno> <body><![CDATA[[gui_rpc] GUI RPC rep' Thu 23 Apr 2020 05:35:54 AM EDT | | [idle_detection] DISPLAY ':0' not found or insufficient access. Thu 23 Apr 2020 05:35:54 AM EDT | | [idle_detection] XSS idle detection succeeded on DISPLAY ':1'. Thu 23 Apr 2020 05:35:54 AM EDT | | [idle_detection] idle threshold: 180 Thu 23 Apr 2020 05:35:54 AM EDT | | [idle_detection] idle_time: 0 Thu 23 Apr 2020 05:35:54 AM EDT | | [idle_detection] DISPLAY ':1' is active. Thu 23 Apr 2020 05:35:54 AM EDT | | [suspend] net_susp: no; file_xfer_susp: no; reason: unknown reason Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [heartbeat] Heartbeat sent to task ARP1_0031183_009_0 ... Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [heartbeat] Heartbeat sent to task MCM1_0162393_9412_1 h] shortfall 255378.68 nidle 5.00 saturated 0.00 busy 0.00 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [work_fetch] share 0.000 Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] --- state for NVIDIA GPU --- Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] shortfall 43380.00 nidle 1.00 saturated 0.00 busy 0.00 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [work_fetch] share 0.000 no applications ... Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [prio] recent est credit: 0.06G in 2.04 sec, 707.568022 + 0.002660 ->707.570682 Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] schedule_cpus(): start Thu 23 Apr 2020 05:35:54 AM EDT | | [rr_sim] start: work_buf min 180 additional 43200 total 43380 on_frac 0.926 active_frac 1.000 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [prio] -1.000000 rsf 1.000000 rt 707.570682 rs 707.570682 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [rr_sim_detail] 0.00: starting SCC1_0003882_FoxO1-B_75698_0 (1.00 CPU) (16775.99G/3.42G) Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] rpbest: SCC1_0003882_FoxO1-B_75698_0 (finish delay 4901.32) Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] time-slice step of 3600.00 sec Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] rpbest: SCC1_0003882_FoxO1-B_75698_0 (finish delay 1301.32) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [rr_sim] 4901.32: SCC1_0003882_FoxO1-B_75698_0 finishes (1.00 CPU) (16775.99G/3.42G) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [prio] -1.000000 rsf 1.000000 rt 707.570682 rs 707.570682 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [cpu_sched_debug] add to run list: SCC1_0003882_FoxO1-B_75698_0 (CPU, FIFO) (prio -1.000000) Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] enforce_run_list(): start Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] preliminary job list: Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [cpu_sched_debug] 0: SCC1_0003882_FoxO1-B_75698_0 (MD: no; UTS: no) Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] final job list: Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [cpu_sched_debug] 0: SCC1_0003882_FoxO1-B_75698_0 (MD: no; UTS: no) Thu 23 Apr 2020 05:35:54 AM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [cpu_sched_debug] scheduling SCC1_0003882_FoxO1-B_75698_0 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] assigning slot 6 to SCC1_0003882_FoxO1-B_75698_0 Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] using 1.00 out of 6 CPUs Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] Request work fetch: CPUs idle Thu 23 Apr 2020 05:35:54 AM EDT | | [slot] removed file slots/6/init_data.xml Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu to slots/6/wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/wcgrid_scc1_gfx_7.08_x86_64-pc-linux-gnu (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/wcgrid_scc1_gfx_7.08_x86_64-pc-linux-gnu to slots/6/graphics_app Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image01_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image01_7.08.tga to slots/6/Courier-Bold.txf Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image02_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image02_7.08.tga to slots/6/Courier.txf Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image03_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image03_7.08.tga to slots/6/boinc_logo_sra1.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image04_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image04_7.08.tga to slots/6/boy_cancer.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image05_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image05_7.08.tga to slots/6/IBM_logo_white.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image06_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image06_7.08.tga to slots/6/medical_center.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image07_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image07_7.08.tga to slots/6/scc1_background.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image08_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image08_7.08.tga to slots/6/scc1_legend.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1_image09_7.08.tga (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1_image09_7.08.tga to slots/6/wcg_logo.tga Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/scc1.FoxO1-B.pdbqt (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/scc1.FoxO1-B.pdbqt to slots/6/FoxO1-B.pdbqt Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/288a245059cd893f6edc9570adf22421.job (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/288a245059cd893f6edc9570adf22421.job to slots/6/SCC1_0003882_FoxO1-B_75698.job Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/8507730610ddeb034b0519e6865a5565.zip (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/8507730610ddeb034b0519e6865a5565.zip to slots/6/SCC1_0003882_FoxO1-B_75698.zip Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/fe0b7526ab9cf19e6d210e7985ef916a.pdbqt (input) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/fe0b7526ab9cf19e6d210e7985ef916a.pdbqt to slots/6/FoxO1-B_flex.pdbqt Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | setup_file: projects/www.worldcommunitygrid.org/SCC1_0003882_FoxO1-B_75698_0_r1038022793_0 (output) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [slot] linked ../../projects/www.worldcommunitygrid.org/SCC1_0003882_FoxO1-B_75698_0_r1038022793_0 to slots/6/result.out Thu 23 Apr 2020 05:35:54 AM EDT | | [slot] removed file slots/6/boinc_temporary_exit Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [task] ACTIVE_TASK::start(): forked process: pid 3410 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [task] task_state=EXECUTING for SCC1_0003882_FoxO1-B_75698_0 from start Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | Starting task SCC1_0003882_FoxO1-B_75698_0 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [cpu_sched] Starting task SCC1_0003882_FoxO1-B_75698_0 using scc1 version 708 in slot 6 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [css] running SCC1_0003882_FoxO1-B_75698_0 ( ) Thu 23 Apr 2020 05:35:54 AM EDT | | [statefile] set dirty: enforce_cpu_schedule Thu 23 Apr 2020 05:35:54 AM EDT | | [cpu_sched_debug] enforce_run_list: end Thu 23 Apr 2020 05:35:54 AM EDT | | [poll] CLIENT_STATE::poll_slow_events(): schedule_cpus Thu 23 Apr 2020 05:35:54 AM EDT | | [rr_sim] start: work_buf min 180 additional 43200 total 43380 on_frac 0.926 active_frac 1.000 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [prio] -1.000000 rsf 1.000000 rt 707.570682 rs 707.570682 Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [rr_sim_detail] 0.00: starting SCC1_0003882_FoxO1-B_75698_0 (1.00 CPU) (16775.99G/3.42G) Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] rpbest: SCC1_0003882_FoxO1-B_75698_0 (finish delay 4901.32) Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] time-slice step of 3600.00 sec Thu 23 Apr 2020 05:35:54 AM EDT | | [rrsim_detail] rpbest: SCC1_0003882_FoxO1-B_75698_0 (finish delay 1301.32) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [rr_sim] 4901.32: SCC1_0003882_FoxO1-B_75698_0 finishes (1.00 CPU) (16775.99G/3.42G) Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [prio] -1.000000 rsf 1.000000 rt 707.570682 rs 707.570682 Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] ------- start work fetch state ------- Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] target work buffer: 180.00 + 43200.00 sec Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] --- project states --- Thu 23 Apr 2020 05:35:54 AM EDT | World Community Grid | [work_fetch] REC 707.571 prio -1.006 can't request work: some task is suspended via Manager Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetch] --- state for CPU --- Thu 23 Apr 2020 05:35:54 AM EDT | | [work_fetcGUI RPC Command = '<boinc_gui_rpc_request><get_results><active_only>0</active_only></get_results></boinc_gui_rpc_request>' Thu 23 Apr 2020 05:35:55 AM EDT | | [gui_rpc] GUI RPC reply: '<boinc_gui_rpc_reply><results><result> <name>ARP1_0031183_009_0</name> <wu_name>ARP1_0031183_009</wu_name> <platfo' Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] DISPLAY ':0' not found or insufficient access. Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] XSS idle detection succeeded on DISPLAY ':1'. Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] idle threshold: 180 Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] idle_time: 0 Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] DISPLAY ':1' is active. Thu 23 Apr 2020 05:35:55 AM EDT | | [suspend] net_susp: no; file_xfer_susp: no; reason: unknown reason Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [task] Process for SCC1_0003882_FoxO1-B_75698_0 exited, status 11, task state 1 Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [task] process got signal 11 Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [task] task_state=WAS_SIGNALED for SCC1_0003882_FoxO1-B_75698_0 from handle_exited_app Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [sched_op] Deferring communication for 00:01:21 Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [sched_op] Reason: Unrecoverable error for task SCC1_0003882_FoxO1-B_75698_0 Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [task] result state=COMPUTE_ERROR for SCC1_0003882_FoxO1-B_75698_0 from CS::report_result_error Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] cleaning out slots/6: handle_exited_app() Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/wcg_logo.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/SCC1_0003882_FoxO1-B_75698.zip Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/boy_cancer.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/boinc_mmap_file Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/result.out Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/medical_center.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/boinc_lockfile Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/boinc_logo_sra1.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/scc1_legend.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/stderr.txt Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/init_data.xml Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/FoxO1-B.pdbqt Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/scc1_background.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/graphics_app Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/wcgrid_scc1_vina_7.08_x86_64-pc-linux-gnu Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/Courier.txf Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/SCC1_0003882_FoxO1-B_75698.job Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/stdout.txt Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/FoxO1-B_flex.pdbqt Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/Courier-Bold.txf Thu 23 Apr 2020 05:35:55 AM EDT | | [slot] removed file slots/6/IBM_logo_white.tga Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] Request CPU reschedule: application exited Thu 23 Apr 2020 05:35:55 AM EDT | | [work_fetch] Request work fetch: application exited Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [heartbeat] Heartbeat sent to task ARP1_0031183_009_0 ... Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [heartbeat] Heartbeat sent to task MCM1_0162393_9412_1 Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [heartbeat] Heartbeat sent to task MCM1_0162394_9783_1 Thu 23 Apr 2020 05:35:55 AM EDT | | [statefile] set dirty: ACTIVE_TASK_SET::poll Thu 23 Apr 2020 05:35:55 AM EDT | | [poll] CLIENT_STATE::poll_slow_events(): active_tasks Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | Computation for task SCC1_0003882_FoxO1-B_75698_0 finished Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | Output file SCC1_0003882_FoxO1-B_75698_0_r1038022793_0 for task SCC1_0003882_FoxO1-B_75698_0 absent Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [task] result state=COMPUTE_ERROR for SCC1_0003882_FoxO1-B_75698_0 from CS::app_finished Thu 23 Apr 2020 05:35:55 AM EDT | World Community Grid | [prio] recent est credit: 0.01G in 1.04 sec, 707.570682 + 0.000058 ->707.570741 Thu 23 Apr 2020 05:35:55 AM EDT | | [statefile] set dirty: handle_finished_apps Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] Request CPU reschedule: handle_finished_apps Thu 23 Apr 2020 05:35:55 AM EDT | | [poll] CLIENT_STATE::poll_slow_events(): handle_finished_apps Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] schedule_cpus(): start Thu 23 Apr 2020 05:35:55 AM EDT | | [rr_sim] start: work_buf min 180 additional 43200 total 43380 on_frac 0.926 active_frac 1.000 Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] enforce_run_list(): start Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] preliminary job list: Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] final job list: Thu 23 Apr 2020 05:35:55 AM EDT | | [mem_usage] enforce: available RAM 24050.60MB swap 49626.40MB Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] using 0.00 out of 6 CPUs Thu 23 Apr 2020 05:35:55 AM EDT | | [work_fetch] Request work fetch: CPUs idle Thu 23 Apr 2020 05:35:55 AM EDT | | [cpu_sched_debug] enforce_run_list: end Thu 23 Apr 2020 05:35:55 AM EDT | | [statefile] Writing state file Thu 23 Apr 2020 05:35:55 AM EDT | | [statefile] Done writing state file Thu 23 Apr 2020 05:35:55 AM EDT | | [poll] CLIENT_STATE::do_something(): End poll: 2 tasks active Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] DISPLAY ':0' not found or insufficient access. Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] XSS idle detection succeeded on DISPLAY ':1'. Thu 23 Apr 2020 05:35:55 AM EDT | | [idle_detection] idle threshold: 180 Thanks a bunch all ! TJ van Muyden ![]() ![]() |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
@NorthernRaider, I also run Debian 10 (Buster) and got the same exact errors when running SCC1 or HSTB tasks.
----------------------------------------See these threads: 1. Build Science Apps With Newer glibc or at lea... Virtual Syscalls [Linux] in the Suggestions and Feedback forum. 2. Linux Upgrade: Best Practices to Preserve Device ID? in the BOINC Agent Support forum. The workaround is: 1. Stop the BOINC daemon: sudo /etc/init.d/boinc-client stop 2. Edit grub: sudo nano /etc/default/grub 3. Change the line that has GRUB_CMDLINE_LINUX_DEFAULT="quiet" to GRUB_CMDLINE_LINUX_DEFAULT="quiet vsyscall=emulate" 4. Save the file and exit nano (or your favorite text editor). 5. sudo update-grub 6. Reboot: sudo reboot That temporarily puts virtual syscalls in emulation mode. Until WCG techs recompile the SCC1 and HSTB applications with the latest version of GCC for Linux, this is what we will have to do.
[Edit 3 times, last edit by hchc at May 3, 2020 10:50:47 AM] |
||
|
|
NorthernRaider
Cruncher Canada Joined: Dec 10, 2008 Post Count: 12 Status: Offline Project Badges:
|
Thanks very much HCHC !
----------------------------------------That did the job for now , and things are processing for SCC1. Have not been reading the forums , but now I did to update myself a bit. So just waiting now for the SCC1 project to redo the compilation , and then obviously take out that kernel parameter. Geez, I thought that Debian stable was slow in its updates, the projects are certainly behind... As software developer at IBM I always was the adopter (and test guinea pig) of new libraries and releases for the team, and compiling with old libraries for me is a no-no. Yeah it can cause issues with those who don't update their OS, which is also not recommended by me. I am sure glad that since 1993 I gave the boot to Windows. Thanks again for the help !! ![]() ![]() |
||
|
|
|