Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 8
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 1071 times and has 7 replies Next Thread
jimmyouyang123
Cruncher
Joined: Aug 29, 2016
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Computation error for 90 percent of the task

Anyone know what might cause it. The PC is new so it shouldn't be a problem. Everytime the job start and 2 seconds later it failed immediately.
[Jun 30, 2023 5:48:06 AM]   Link   Report threatening or abusive post: please login first  Go to top 
bfmorse
Senior Cruncher
US
Joined: Jul 26, 2009
Post Count: 294
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

Please post appropriate lines from your Event Log - it will allow us to give you better advice.
[Jun 30, 2023 5:54:11 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Mike.Gibson
Ace Cruncher
England
Joined: Aug 23, 2007
Post Count: 12132
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

And/or the error log from results page.

Mike
[Jun 30, 2023 2:46:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jimmyouyang123
Cruncher
Joined: Aug 29, 2016
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

And/or the error log from results page.

Mike



Is there actual log for the WCG job? If yes where is it stored?
[Jul 2, 2023 4:05:52 AM]   Link   Report threatening or abusive post: please login first  Go to top 
jimmyouyang123
Cruncher
Joined: Aug 29, 2016
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

I tried to post image screenshot but I guess the image don't show? But basically all job ran for 2-3 seconds and then failed
[Jul 2, 2023 4:10:00 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Aperture_Science_Innovators
Advanced Cruncher
United States
Joined: Jul 6, 2009
Post Count: 139
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

And/or the error log from results page.

Mike



Is there actual log for the WCG job? If yes where is it stored?

The easiest approach is to go to your Results Status page, and then click on the Error link that shows up under the Status column.

I don't have any Errored WUs to show an example of, but if I click on one of my Valid ones, I get this:


<core_client_version>7.22.2</core_client_version>
<![CDATA[
<stderr_txt>
INFO: result number = 1
INFO: No state to restore. Start from the beginning.
[07:12:46] Number of tasks = 1
[07:12:46] Running task 0,CPU time at start of task 0 was 0.000000
[07:12:46] ./cmpd-3292626.pdbqt size = 27 4 ../../projects/www.worldcommunitygrid.org/scc1.MyoD1-A.pdbqt size = 1313 0
[09:10:47] Finished task #0 cpu time used 6285.000000
09:10:47 (8644): called boinc_finish(0)

</stderr_txt>
]]>


That's the kind if information Mike was asking for.
----------------------------------------

[Jul 2, 2023 1:31:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
jimmyouyang123
Cruncher
Joined: Aug 29, 2016
Post Count: 4
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

<core_client_version>7.20.2</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 3765269347 (0xe06d7363)</message>
<stderr_txt>
Commandline = projects/www.worldcommunitygrid.org/wcgrid_mcm1_map_7.61_windows_x86_64 -SettingsFile MCM1_0201122_0831.txt -DatabaseFile dataset-sarc1.txt
Settings File
DateOfDesign = 20200218
Designer = Krembil/cubes
WorkOrderID = 0201122_0831
DatasetID = sarc1
RSeed = 407940832
StartingGeneSignatureAlgorithm = randomFixedLengthSearch
RunPermutationAlgorithm = 0
FitnessFn = 0
NumberOfGenesInStartingSignature = 20
NumberOfGenesInSignatureMin = 20
NumberOfGenesInSignatureMax = 20
SearchAlgorithmNumberToCreate = 12071
MinFitness = 0.497
VMethod = NFCV
NFolds = 20
SvmArgs = "-v 0 -t 0 -c 1000"
SvmLearnLimit = 250000



[22:45:01] Initializing


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x00007FF970F9CF19

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.5.0


Dump Timestamp : 06/29/23 22:45:03
Install Directory : C:\Program Files\BOINC\
Data Directory : C:\ProgramData\BOINC
Project Symstore :
LoadLibraryA( C:\Program Files\BOINC\\dbghelp.dll ): GetLastError = 126
Loaded Library : dbghelp.dll
LoadLibraryA( C:\Program Files\BOINC\\symsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:\Program Files\BOINC\\srcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:\Program Files\BOINC\\version.dll ): GetLastError = 126
Loaded Library : version.dll
Debugger Engine : 4.0.5.0
Symbol Search Path: C:\ProgramData\BOINC\slots\4;C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org


ModLoad: 00000000a9dc0000 00000000001ab000 C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org\wcgrid_mcm1_map_7.61_windows_x86_64 (-exported- Symbols Loaded)
Linked PDB Filename : C:\Projects\workspace\scienceApps\MCM1\x64\Release\wcgrid_mcm1_prod_64.pdb

ModLoad: 0000000073590000 00000000001f8000 C:\Windows\SYSTEM32\ntdll.dll (6.2.19041.3086) (-exported- Symbols Loaded)
Linked PDB Filename : ntdll.pdb
File Version : 10.0.19041.2788 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2788

ModLoad: 00000000730a0000 00000000000bf000 C:\Windows\System32\KERNEL32.DLL (6.2.19041.3031) (-exported- Symbols Loaded)
Linked PDB Filename : kernel32.pdb
File Version : 10.0.19041.2788 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2788

ModLoad: 0000000070f70000 00000000002f6000 C:\Windows\System32\KERNELBASE.dll (6.2.19041.3086) (-exported- Symbols Loaded)
Linked PDB Filename : kernelbase.pdb
File Version : 10.0.19041.2788 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2788

ModLoad: 0000000071c20000 00000000000af000 C:\Windows\System32\ADVAPI32.dll (6.2.19041.2913) (-exported- Symbols Loaded)
Linked PDB Filename : advapi32.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.1

ModLoad: 0000000072f50000 000000000009e000 C:\Windows\System32\msvcrt.dll (7.0.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : msvcrt.pdb
File Version : 7.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 7.0.19041.546

ModLoad: 000000006aab0000 000000000000a000 C:\Windows\SYSTEM32\VERSION.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : version.pdb
File Version : 10.0.19041.546 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.546

ModLoad: 00000000724a0000 000000000009c000 C:\Windows\System32\sechost.dll (6.2.19041.2913) (-exported- Symbols Loaded)
Linked PDB Filename : sechost.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.1

ModLoad: 0000000073160000 0000000000126000 C:\Windows\System32\RPCRT4.dll (6.2.19041.2965) (-exported- Symbols Loaded)
Linked PDB Filename : rpcrt4.pdb
File Version : 10.0.19041.2788 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2788

ModLoad: 0000000071cd0000 0000000000744000 C:\Windows\System32\SHELL32.dll (6.2.19041.3031) (-exported- Symbols Loaded)
Linked PDB Filename : shell32.pdb
File Version : 10.0.19041.964 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.964

ModLoad: 0000000070d90000 000000000009d000 C:\Windows\System32\msvcp_win.dll (6.2.19041.789) (-exported- Symbols Loaded)
Linked PDB Filename : msvcp_win.pdb
File Version : 10.0.19041.789 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.789

ModLoad: 0000000071360000 0000000000100000 C:\Windows\System32\ucrtbase.dll (6.2.19041.789) (-exported- Symbols Loaded)
Linked PDB Filename : ucrtbase.pdb
File Version : 10.0.19041.789 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.789

ModLoad: 0000000073290000 000000000019d000 C:\Windows\System32\USER32.dll (6.2.19041.2788) (-exported- Symbols Loaded)
Linked PDB Filename : user32.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.1

ModLoad: 0000000071270000 0000000000022000 C:\Windows\System32\win32u.dll (6.2.19041.3086) (-exported- Symbols Loaded)
Linked PDB Filename : win32u.pdb
File Version : 10.0.19041.3086 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.3086

ModLoad: 0000000072830000 000000000002c000 C:\Windows\System32\GDI32.dll (6.2.19041.2913) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32.pdb
File Version : 10.0.19041.2913 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2913

ModLoad: 0000000070c70000 0000000000115000 C:\Windows\System32\gdi32full.dll (6.2.19041.2913) (-exported- Symbols Loaded)
Linked PDB Filename : gdi32full.pdb
File Version : 10.0.19041.2913 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2913

ModLoad: 0000000072ec0000 0000000000030000 C:\Windows\System32\IMM32.DLL (6.2.19041.2673) (-exported- Symbols Loaded)
Linked PDB Filename : imm32.pdb
File Version : 10.0.19041.2673 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2673

ModLoad: 000000006fdf0000 0000000000033000 C:\Windows\SYSTEM32\ntmarta.dll (6.2.19041.546) (-exported- Symbols Loaded)
Linked PDB Filename : ntmarta.pdb
File Version : 10.0.19041.1 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.1

ModLoad: 000000005d930000 00000000001e4000 C:\Windows\SYSTEM32\dbghelp.dll (6.2.19041.867) (-exported- Symbols Loaded)
Linked PDB Filename : dbghelp.pdb
File Version : 10.0.19041.867 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.867

ModLoad: 0000000070e30000 0000000000082000 C:\Windows\System32\bcryptPrimitives.dll (6.2.19041.2486) (-exported- Symbols Loaded)
Linked PDB Filename : bcryptprimitives.pdb
File Version : 10.0.19041.2486 (WinBuild.160101.0800)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 10.0.19041.2486



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 13071, Write: 841, Other 87

- I/O Transfers Counters -
Read: 106979912, Write: 873, Other 8904

- Paged Pool Usage -
QuotaPagedPoolUsage: 106512, QuotaPeakPagedPoolUsage: 106512
QuotaNonPagedPoolUsage: 8696, QuotaPeakNonPagedPoolUsage: 8696

- Virtual Memory Usage -
VirtualSize: 143712256, PeakVirtualSize: 240951296

- Pagefile Usage -
PagefileUsage: 143712256, PeakPagefileUsage: 357412864

- Working Set Size -
WorkingSetSize: 147083264, PeakWorkingSetSize: 328269824, PageFaultCount: 182717

*** Dump of thread ID 3076 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x00007FF970F9CF19

- Registers -
rax=000000000a1d0000 rbx=00000000006fee60 rcx=0000000002690000 rdx=000000000ad47000 rsi=0000000000005b74 rdi=00000000a9f13900
r8=0000000000042010 r9=0000000000004201 r10=0000000002690150 r11=000000000ad47000 r12=0000000000000000 r13=0000000000005b7a
r14=0000000000000000 r15=0000000002698a48 rip=0000000070f9cf19 rsp=00000000006feab0 rbp=0000000002698820
cs=0033 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000206

- Callstack -
ChildEBP RetAddr Args to Child
006feb80 a9e65692 006fee60 a9de295b 006fee60 006fee60 KERNELBASE!RaiseException+0x0
006febf0 a9dfb061 a9dc0000 006fed50 00000000 00000000 wcgrid_mcm1_map_7!DeleteMulticlassModel+0x0
006ff0d0 a9de3efb 006ff180 006ff120 02697df0 02697e38 wcgrid_mcm1_map_7!boost::serialization::singleton<boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<bool,std::allocator<bool> > > >::get_const_instance+0x0
006ff2d0 a9de3a66 02697df0 02697df0 006ff490 006ff560 wcgrid_mcm1_map_7!boost::serialization::singleton<boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<bool,std::allocator<bool> > > >::get_const_instance+0x0
006ff410 a9e0518a 02690780 00000005 00000005 02697df0 wcgrid_mcm1_map_7!boost::serialization::singleton<boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<bool,std::allocator<bool> > > >::get_const_instance+0x0
006ff7b0 a9e638bb 00000000 00000000 00000000 00000000 wcgrid_mcm1_map_7!boost::serialization::singleton<boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<bool,std::allocator<bool> > > >::get_const_instance+0x0
006ff7f0 730b7614 00000000 00000000 00000000 00000000 wcgrid_mcm1_map_7!DeleteMulticlassModel+0x0
006ff820 735e26f1 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0
006ff8a0 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0

*** Dump of thread ID 32761 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 6.000000, User Time: 0.000000, Wait Time: 48967960.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000

- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !bool,std::allocator<bool> > > >::get_const_instance+0x0

*** Dump of thread ID 31042326 (state: Unknown): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 17179869184.000000, User Time: 21474836480.000000, Wait Time: 0.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000 ss=0000 ds=0000 es=0000 fs=0000 gs=0000 efl=00000000

- Callstack -
ChildEBP RetAddr Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !bool,std::allocator<bool> > > >::get_const_instance+0x0


*** Debug Message Dump ****


*** Foreground Window Data ***
Window Name :
Window Class :
Window Process ID: 0
Window Thread ID : 0

Exiting...

</stderr_txt>
]]>


seem like OOM error.
[Jul 3, 2023 12:31:26 AM]   Link   Report threatening or abusive post: please login first  Go to top 
Sgt.Joe
Ace Cruncher
USA
Joined: Jul 4, 2006
Post Count: 7578
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: Computation error for 90 percent of the task

Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x00007FF970F9CF19

There is your reason: Not enough memory
Cheers
----------------------------------------
Sgt. Joe
*Minnesota Crunchers*
[Jul 3, 2023 3:12:22 AM]   Link   Report threatening or abusive post: please login first  Go to top 
[ Jump to Last Post ]
Post new Thread