| Index | Recent Threads | Unanswered Threads | Who's Active | Guidelines | Search |
| World Community Grid Forums
|
| No member browsing this thread |
|
Thread Status: Active Total posts in this thread: 14
|
|
| Author |
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
GitHub Issue #3149 on BOINC/boinc
----------------------------------------I created an issue on the BOINC GitHub repo since I wasn't sure if this bug is related to WCG/MIP or to BOINC in general. There are screenshots in the issue above. Describe the bug When watching the graphics of a task via the "Show Graphics" button, when the task reaches 100% completion, a domino effect of errors of subsequent tasks will occur until the graphics window is closed. I am unsure if this is a BOINC issue or a project specific issue. I am able to re-create this in World Community Grid's Microbiome Immunity Project application. Steps To Reproduce
Expected behavior I probably expect that when a work unit reaches 100% completion for the graphics window to automatically close, requiring the user to select a new work unit to watch if so desired. System Information
Additional context BOINC is configured to run as a service. I don't know if this is relevant to this bug.
[Edit 3 times, last edit by hchc at May 16, 2019 6:56:12 AM] |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
Result Log
----------------------------------------Result Name: MIP1_ 00189279_ 0839_ 0-- <core_client_version>7.14.2</core_client_version> <![CDATA[ <message> (unknown error) - exit code -1073741819 (0xc0000005)</message> <stderr_txt> [2019- 5-16 2:21: 4:] :: BOINC:: Initializing ... ok. [2019- 5-16 2:21: 4:] :: BOINC :: boinc_init() INFO: result number = 0 BOINC:: Setting up shared resources ... ok. failed to create shared mem segment: minirosetta Size: 25057688 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x012E8F10 write attempt to address 0x017D7EC9 Engaging BOINC Windows Runtime Debugger... ******************** BOINC Windows Runtime Debugger Version 7.7.0 Dump Timestamp : 05/16/19 02:21:05 Install Directory : Data Directory : C:\ProgramData\BOINC Project Symstore : LoadLibraryA( C:\ProgramData\BOINC\dbghelp.dll ): GetLastError = 126 Loaded Library : dbghelp.dll LoadLibraryA( C:\ProgramData\BOINC\symsrv.dll ): GetLastError = 126 LoadLibraryA( symsrv.dll ): GetLastError = 126 LoadLibraryA( C:\ProgramData\BOINC\srcsrv.dll ): GetLastError = 126 LoadLibraryA( srcsrv.dll ): GetLastError = 126 LoadLibraryA( C:\ProgramData\BOINC\version.dll ): GetLastError = 126 Loaded Library : version.dll Debugger Engine : 4.0.5.0 Symbol Search Path: C:\ProgramData\BOINC\slots\1;C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org ModLoad: 0000000000860000 000000000342a000 C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org\wcgrid_mip1_rosetta_7.16_windows_intelx86 (-exported- Symbols Loaded) Linked PDB Filename : ModLoad: 0000000076f20000 000000000019c000 C:\Windows\SYSTEM32\ntdll.dll (6.2.17763.475) (-exported- Symbols Loaded) Linked PDB Filename : wntdll.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000074b90000 00000000000e0000 C:\Windows\System32\KERNEL32.DLL (6.2.17763.475) (-exported- Symbols Loaded) Linked PDB Filename : wkernel32.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000075a80000 00000000001fa000 C:\Windows\System32\KERNELBASE.dll (6.2.17763.475) (-exported- Symbols Loaded) Linked PDB Filename : wkernelbase.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000075e20000 000000000005f000 C:\Windows\System32\WS2_32.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : ws2_32.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 00000000745b0000 00000000000bf000 C:\Windows\System32\RPCRT4.dll (6.2.17763.379) (-exported- Symbols Loaded) Linked PDB Filename : wrpcrt4.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000074590000 0000000000020000 C:\Windows\System32\SspiCli.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : wsspicli.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000074580000 000000000000a000 C:\Windows\System32\CRYPTBASE.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : cryptbase.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000076ba0000 0000000000062000 C:\Windows\System32\bcryptPrimitives.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : bcryptprimitives.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 00000000747f0000 0000000000079000 C:\Windows\System32\sechost.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : sechost.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000076d70000 0000000000199000 C:\Windows\System32\USER32.dll (6.2.17763.168) (-exported- Symbols Loaded) Linked PDB Filename : wuser32.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000075e80000 0000000000017000 C:\Windows\System32\win32u.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : wwin32u.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000075ea0000 0000000000023000 C:\Windows\System32\GDI32.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : wgdi32.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000074d90000 0000000000167000 C:\Windows\System32\gdi32full.dll (6.2.17763.475) (-exported- Symbols Loaded) Linked PDB Filename : wgdi32full.pdb File Version : 10.0.17763.475 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.475 ModLoad: 00000000761e0000 0000000000080000 C:\Windows\System32\msvcp_win.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : msvcp_win.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000075950000 0000000000122000 C:\Windows\System32\ucrtbase.dll (6.2.17763.404) (-exported- Symbols Loaded) Linked PDB Filename : ucrtbase.pdb File Version : 10.0.17763.404 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.404 ModLoad: 0000000074670000 000000000007e000 C:\Windows\System32\ADVAPI32.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : advapi32.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000076cb0000 00000000000c0000 C:\Windows\System32\msvcrt.dll (7.0.17763.475) (-exported- Symbols Loaded) Linked PDB Filename : msvcrt.pdb File Version : 7.0.17763.475 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 7.0.17763.475 ModLoad: 00000000702e0000 0000000000029000 C:\Windows\SYSTEM32\ntmarta.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : ntmarta.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000073960000 000000000018f000 C:\Windows\SYSTEM32\dbghelp.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : dbghelp.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 ModLoad: 0000000073f70000 0000000000008000 C:\Windows\SYSTEM32\version.dll (6.2.17763.1) (-exported- Symbols Loaded) Linked PDB Filename : version.pdb File Version : 10.0.17763.1 (WinBuild.160101.0800) Company Name : Microsoft Corporation Product Name : Microsoft® Windows® Operating System Product Version : 10.0.17763.1 *** Dump of the Process Statistics: *** - I/O Operations Counters - Read: 5, Write: 0, Other 442 - I/O Transfers Counters - Read: 0, Write: 113, Other 0 - Paged Pool Usage - QuotaPagedPoolUsage: 159000, QuotaPeakPagedPoolUsage: 159000 QuotaNonPagedPoolUsage: 8624, QuotaPeakNonPagedPoolUsage: 8624 - Virtual Memory Usage - VirtualSize: 153366528, PeakVirtualSize: 153366528 - Pagefile Usage - PagefileUsage: 45871104, PeakPagefileUsage: 45871104 - Working Set Size - WorkingSetSize: 27680768, PeakWorkingSetSize: 27684864, PageFaultCount: 6895 *** Dump of thread ID 5252 (state: Waiting): *** - Information - Status: Wait Reason: UserRequest, , Kernel Time: 156250.000000, User Time: 468750.000000, Wait Time: 5805242.000000 - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x012E8F10 write attempt to address 0x017D7EC9 - Registers - eax=00000000 ebx=04f4f960 ecx=810676a6 edx=00000000 esi=00000000 edi=00d5219e eip=012e8f10 esp=049ed0e4 ebp=049ffb6c cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00010206 - Callstack - ChildEBP RetAddr Args to Child 049ffb6c 00d4ddfb 81075872 00d5219e 00d5219e 00000000 wcgrid_mip1_rosetta_7!cppdb::backend::statements_cache::active+0x0 049ffe6c 00d52121 0000000e 04f4f960 04f4b440 810758aa wcgrid_mip1_rosetta_7!cppdb::atomic_counter::get+0x0 049ffeb4 74bb0419 03ff4000 74bb0400 049fff20 76f8662d wcgrid_mip1_rosetta_7!cppdb::atomic_counter::get+0x0 049ffec4 76f8662d 03ff4000 610958d6 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 049fff20 76f865fd ffffffff 76fa51cd 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 049fff30 00000000 00d5219e 03ff4000 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 *** Dump of thread ID 5104 (state: Waiting): *** - Information - Status: Wait Reason: EventPairLow, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 5805233.000000 - Registers - eax=00000000 ebx=04f4af20 ecx=00000000 edx=00000000 esi=04f4ad60 edi=04f49328 eip=76f9216c esp=05f1f708 ebp=05f1f8c4 cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000202 - Callstack - ChildEBP RetAddr Args to Child 05f1f8c4 74bb0419 04f49328 74bb0400 05f1f930 76f8662d ntdll!NtWaitForWorkViaWorkerFactory+0x0 05f1f8d4 76f8662d 04f49328 60675ec6 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 05f1f930 76f865fd ffffffff 76fa51cd 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 05f1f940 00000000 76f6e230 04f49328 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 *** Dump of thread ID 6792 (state: Waiting): *** - Information - Status: Wait Reason: EventPairLow, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 5805233.000000 - Registers - eax=00000000 ebx=04f4b820 ecx=00000000 edx=00000000 esi=04f4b660 edi=04f49328 eip=76f9216c esp=0691fc0c ebp=0691fdc8 cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000206 - Callstack - ChildEBP RetAddr Args to Child 0691fdc8 74bb0419 04f49328 74bb0400 0691fe34 76f8662d ntdll!NtWaitForWorkViaWorkerFactory+0x0 0691fdd8 76f8662d 04f49328 630759c2 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 0691fe34 76f865fd ffffffff 76fa51cd 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 0691fe44 00000000 76f6e230 04f49328 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 *** Dump of thread ID 10876 (state: Waiting): *** - Information - Status: Wait Reason: EventPairLow, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 5805240.000000 - Registers - eax=00000000 ebx=04f4fd40 ecx=00000000 edx=00000000 esi=04f4fb80 edi=04f49328 eip=76f9216c esp=0731f880 ebp=0731fa3c cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000212 - Callstack - ChildEBP RetAddr Args to Child 0731fa3c 74bb0419 04f49328 74bb0400 0731faa8 76f8662d ntdll!NtWaitForWorkViaWorkerFactory+0x0 0731fa4c 76f8662d 04f49328 62a75d5e 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 0731faa8 76f865fd ffffffff 76fa51cd 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 0731fab8 00000000 76f6e230 04f49328 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 *** Dump of thread ID 2976 (state: Waiting): *** - Information - Status: Wait Reason: ExecutionDelay, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 5805238.000000 - Registers - eax=01928ca0 ebx=08bffce8 ecx=00000000 edx=00000000 esi=00000000 edi=08bffce8 eip=76f907ec esp=08bffca8 ebp=08bffd0c cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000202 - Callstack - ChildEBP RetAddr Args to Child 08bffd0c 75b8dbdf 00000064 00000000 08bfff48 01928ccb ntdll!ZwDelayExecution+0x0 08bffd1c 01928ccb 00000064 01928ca0 01928ca0 00000000 KERNELBASE!Sleep+0x0 08bfff48 74bb0419 00000000 74bb0400 08bfffb4 76f8662d wcgrid_mip1_rosetta_7!cppdb::backend::statements_cache::active+0x0 08bfff58 76f8662d 00000000 6d295842 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 08bfffb4 76f865fd ffffffff 76fa51cd 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 08bfffc4 00000000 01928ca0 00000000 00000000 00000000 ntdll!RtlGetAppContainerNamedObjectPath+0x0 *** Debug Message Dump **** *** Foreground Window Data *** Window Name : Window Class : Window Process ID: 0 Window Thread ID : 0 Exiting... </stderr_txt> ]]>
|
||
|
|
Former Member
Cruncher Joined: May 22, 2018 Post Count: 0 Status: Offline |
Little doubt it's MIP. Have to suspend all tasks, running or ready to, before booting and only allow resume manually, one by one, well after boot is complete. Else the MIP tasks will crash in 30 seconds, something BOINC is programmed to do if there is no communication between client and task app.
BOINCTasks has a feature to automatically suspend tasks when reaching a checkpoint, when planning a boot for instance, the safest moment to interrupt them. No other science I'm aware of needs this caretaking. |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
We are testing it to see if we can recreate the issue.
Thanks, -Uplinger |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
Thanks Uplinger.
----------------------------------------I tried re-creating the issue on the same machine with OpenZika 7.08 tasks, and it didn't happen. When the task reached 100%, the next one started. After about 10 seconds or so, the graphics window gracefully closed itself. So far it does seem to just be MIP 7.16 tasks. I might update and close the issue on GitHub as this may not be a BOINC bug.
[Edit 1 times, last edit by hchc at May 19, 2019 4:20:26 AM] |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
Greetings,
We tested for this over the weekend. We were not able to recreate the issue. We tested it with two projects, including MIP1. Does this machine go into screen saver mode ever? If so, are you leaving the window open after it goes into screen saver mode? In our test we didn't have a workunit nearing completion, so we opened the graphics window and let it stay open while walking away, fully expecting it to throw errors until we came back, this was not the case. I will see if we can catch one without the walk away method. Thanks, -Uplinger |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
This Win10 machine has the "Blank" Screensaver set to turn on after 5 minutes, but I've never bothered watching the WCG graphics and then walking away, so I can't troubleshoot one way or another with what you tried over the weekend.
----------------------------------------I pretty much look at an MIP1 work unit that is close to completion then click "Show Graphics," then when it hits 100% and starts the next task, it reliably starts throwing errors for MIP1 tasks. I'll try to test on another W10 machine where BOINC isn't running as a service.
[Edit 1 times, last edit by hchc at May 21, 2019 10:54:01 AM] |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
Further testing:
----------------------------------------1. Windows 7 Pro Service Pack 1 BOINC 7.14.2, not installed as a service MIP 7.16 Issue did NOT occur. Graphics window gracefully closed about 10 seconds after the task reached 100%. 2. Windows 10 Pro 1803 BOINC 7.14.2, not installed as a service MIP 7.16 Issue did NOT occur. Graphics window gracefully closed about 10 seconds after the task reached 100%. 3. Windows 10 Pro 1809 (the original system that I talked about in the bug report) BOINC 7.14.2, installed as a service MIP 7.16 Issue DID recur reliably. Domino effect of all subsequent MIP units error'd out. I'm wondering if this is related to BOINC running as a service under the "boinc_master" account and something to do with the way MIP 7.16 shares files among a batch of work units. Edit: Let me know if I can provide any more testing or logs or anything.
[Edit 1 times, last edit by hchc at May 23, 2019 2:02:56 AM] |
||
|
|
uplinger
Former World Community Grid Tech Joined: May 23, 2005 Post Count: 3952 Status: Offline Project Badges:
|
Greetings,
We have still yet to be able to recreate the issue. We are going to reinstall BOINC as a service to see if that is what is causing it. But I would not imagine that would be the case, I'm not ruling it out though since you are encountering it only on your service install machine. I am curious, have you checked to see if that machine has any updates to the graphics drivers on the machine? Thanks, -Uplinger |
||
|
|
hchc
Veteran Cruncher USA Joined: Aug 15, 2006 Post Count: 865 Status: Offline Project Badges:
|
Greetings, We have still yet to be able to recreate the issue. We are going to reinstall BOINC as a service to see if that is what is causing it. But I would not imagine that would be the case, I'm not ruling it out though since you are encountering it only on your service install machine. I am curious, have you checked to see if that machine has any updates to the graphics drivers on the machine? Thanks, -Uplinger Yep, it has the latest Intel drivers for the Integrated GPU. It's an i5-4590 (Haswell). That same machine as no issues with Zika, Mapping Cancer Markers, etc. It's only on a string of MIP tasks and only if the graphics window is open when a task completes. I'm not using 3rd party Antivirus/Antimalware, just the built-in Windows Defender. Also of note, BOINC is installed as a service, but I also log in as a Standard User, not as Administrator. I think some of the error logs said something about write permissions? (Or is it RAM?) BOINC:: Setting up shared resources ... ok. failed to create shared mem segment: minirosetta Size: 25057688 Unhandled Exception Detected... - Unhandled Exception Record - Reason: Access Violation (0xc0000005) at address 0x012E8F10 write attempt to address 0x017D7EC9 Engaging BOINC Windows Runtime Debugger... I'll try logging in as Administrator to see if the errors happen. And honestly, I pretty much never look at the graphics for a project (only new projects), so this issue is fairly minor to me. I can live with not staring at a MIP work unit finish at least until the project completes. It does stink that the errors happen, since it just kick the tasks back to another user to complete. Are there any logs or debugging I need to turn on in BOINC to capture?
[Edit 2 times, last edit by hchc at May 25, 2019 9:26:25 AM] |
||
|
|
|