Index  | Recent Threads  | Unanswered Threads  | Who's Active  | Guidelines  | Search
 

Quick Go »
No member browsing this thread
Thread Status: Active
Total posts in this thread: 152
Posts: 152   Pages: 16   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread
Author
Previous Thread This topic has been viewed 32640 times and has 151 replies Next Thread
Jim1348
Veteran Cruncher
USA
Joined: Jul 13, 2009
Post Count: 1066
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

The first one is running OK on a single core of an E8400 Core2Duo (Win7 64-bit). From the percentage completed, it appears that it will take about 4 hours to complete, about the same as the last round. However, the "Elapsed" and "Remaining" time indicators are all wrong, showing initially that it would complete in less than 5 minutes. The Remaining time counted down to 2 seconds, and now is going upward. But it seems to be running fine, so I will let it complete.

7.21 Beta Test BETA_BETA_9999982_0149_0 00:13:28 (00:12:31) 92.90 5.005 00:00:38 11/6/2013 1:12:43 PM Running [1] 00:03:16
[Nov 4, 2013 6:58:28 PM]   Link   Report threatening or abusive post: please login first  Go to top 
genhos
Veteran Cruncher
UK
Joined: Apr 26, 2009
Post Count: 1103
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

@herna - this is on Win7 64bit Home Premium.
----------------------------------------
[Nov 4, 2013 6:59:54 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Mumak
Senior Cruncher
Joined: Dec 7, 2012
Post Count: 477
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

Got some errors like:

<message>
Maximum elapsed time exceeded
</message>

11/4/2013 7:55:18 PM | World Community Grid | Aborting task BETA_BETA_9999981_0537_1: exceeded elapsed time limit 3498.66 (1749948.51G/500.18G)

Another:

11/4/2013 7:49:23 PM | World Community Grid | Aborting task BETA_BETA_9999981_0252_0: exceeded elapsed time limit 3016.45 (1749948.51G/647.76G)
----------------------------------------

----------------------------------------
[Edit 2 times, last edit by Mumak at Nov 4, 2013 7:05:56 PM]
[Nov 4, 2013 7:02:41 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

I received tasks:
BETA_ BETA_ 9999982_ 0997_ 0-- 0.59 CPU hours
log: 4.11.2013 19:53:28 | World Community Grid | Aborting task BETA_BETA_9999982_0997_0: exceeded elapsed time limit 2203.99 (1749948.51G/793.99G)

BETA_ BETA_ 9999982_ 0996_ 1-- 0.59 CPU hours
log: 4.11.2013 19:53:28 | World Community Grid | Aborting task BETA_BETA_9999982_0996_1: exceeded elapsed time limit 2203.99 (1749948.51G/793.99G)

BETA_ BETA_ 9999982_ 0608_ 0-- 0.59 CPU hours
log: 4.11.2013 19:44:21 | World Community Grid | Aborting task BETA_BETA_9999982_0608_0: exceeded elapsed time limit 2203.99 (1749948.51G/793.99G)

BETA_ BETA_ 9999982_ 0832_ 1-- 0.59 CPU hours
log: 4.11.2013 19:44:21 | World Community Grid | Aborting task BETA_BETA_9999982_0832_1: exceeded elapsed time limit 2203.99 (1749948.51G/793.99G)

BETA_ BETA_ 9999986_ 0480a_ 1-- 0.29 CPU hours
log: 4.11.2013 21:20:46 | World Community Grid | Aborting task BETA_BETA_9999986_0480a_1: exceeded elapsed time limit 3492.38 (1749948.51G/501.08G)

BETA_ BETA_ 9999987_ 0459a_ 0-- 0.78 CPU hours
I tried to pause all tasks, but BETAs were still running, after PC reboot and start of BOINC this error occured:
http://pastebin.com/8vb6nmqs

computer:
4.11.2013 9:24:18 |  | Starting BOINC client version 7.0.64 for windows_x86_64
4.11.2013 9:24:18 | | log flags: file_xfer, sched_ops, task
4.11.2013 9:24:18 | | Libraries: libcurl/7.25.0 OpenSSL/1.0.1 zlib/1.2.6
4.11.2013 9:24:18 | | Data directory: C:\ProgramData\BOINC
4.11.2013 9:24:18 | | Running under account Michal
4.11.2013 9:24:18 | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-3570K CPU @ 3.40GHz [Family 6 Model 58 Stepping 9]
4.11.2013 9:24:18 | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall nx lm vmx tm2 pbe
4.11.2013 9:24:18 | | OS: Microsoft Windows 8: Professional x64 Edition, (06.02.9200.00)
4.11.2013 9:24:18 | | Memory: 14.95 GB physical, 20.32 GB virtual
4.11.2013 9:24:18 | | Disk: 167.34 GB total, 45.14 GB free
4.11.2013 9:24:18 | | Local time is UTC +1 hours
4.11.2013 9:24:18 | | OpenCL: Intel GPU 0: Intel(R) HD Graphics 4000 (driver version 9.18.10.3165, device version OpenCL 1.2, 792MB, 792MB available, 45 GFLOPS peak)


EDIT: added tasks
----------------------------------------
[Edit 6 times, last edit by Former Member at Nov 4, 2013 9:56:01 PM]
[Nov 4, 2013 7:06:36 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

@herna - this is on Win7 64bit Home Premium.


Well, that's odd then. Haven't seen this on Windows. Some other people reported their Betas keep on running after suspending. I also suspended them, had a look at the Task Manager and they kept on running at 100% for a few seconds, then they stopped (Laim off). Maybe you closed Boinc before the Beta apps stopped running, so the background client was still running, you restarted Boinc Manager and it couldn't connect to the still active client.

Now all 4 Betas on Windows are stuck at 0.500 % , CPU times not moving at all.
On Linux, everything's fine.

Add: Looks like suspended Betas keep on running until i turn off "Leave applications in memory". This time they kept on running 100% (as of Task Manager) each, without any progress for minutes. Laim off, they are quickly unloaded, also from Task Manager.
----------------------------------------
[Edit 1 times, last edit by Former Member at Nov 4, 2013 7:16:19 PM]
[Nov 4, 2013 7:09:10 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

Is Werner VON still working in Houston? After 1 hours 3 minutes on W7-64, running 8 beta concurrent.

53499 World Community Grid 04-11-2013 20:06 Aborting task BETA_BETA_9999981_0533_1: exceeded elapsed time limit 3817.96 (1749948.51G/458.35G)
53501 World Community Grid 04-11-2013 20:06 [sched_op] Reason: Unrecoverable error for task BETA_BETA_9999981_0533_1
53502 World Community Grid 04-11-2013 20:06 Aborting task BETA_BETA_9999981_0393_1: exceeded elapsed time limit 3817.96 (1749948.51G/458.35G)

Note that I have some extra log flags set to produce line 53501 as extra.

edit: 5 done the same thing at 1:01 to 1:03 hours.

7.21 beta17 BETA_BETA_9999981_0536_0 01:03:38 (01:03:26) 99.699 100.000 - 04-11-2013 18:52 01d,22:38:49 Computation error 0.00 MB 0.00 MB
7.21 beta17 BETA_BETA_9999981_0533_1 01:03:38 (01:01:37) 96.846 100.000 - 04-11-2013 18:52 01d,22:38:49 Computation error 0.00 MB 0.00 MB
7.21 beta17 BETA_BETA_9999981_0393_1 01:03:38 (01:03:33) 99.867 100.000 - 04-11-2013 18:56 01d,22:43:01 Computation error 0.00 MB 0.00 MB
7.21 beta17 BETA_BETA_9999981_0432_1 01:03:38 (01:02:29) 98.216 100.000 - 04-11-2013 18:56 01d,22:43:01 Computation error 0.00 MB 0.00 MB
7.21 beta17 BETA_BETA_9999981_0384_1 01:03:38 (01:03:22) 99.591 100.000 - 04-11-2013 18:59 01d,22:46:09 Computation error 0.00 MB 0.00 MB

The other 3 of the uplinger highlighted 79/80 still running at 1:49 hours.

The 4 on the Linux quad are 2+ hours of the 79ers, all having logged 11 checkpoints so far, for all the last less than 10 minutes ago with between 46 and 66% progress.
----------------------------------------
[Edit 1 times, last edit by Former Member at Nov 4, 2013 7:17:29 PM]
[Nov 4, 2013 7:10:29 PM]   Link   Report threatening or abusive post: please login first  Go to top 
branjo
Master Cruncher
Slovakia
Joined: Jun 29, 2012
Post Count: 1892
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

4 already errored out:

- 3 of them "exceeded elapsed time limit 2934.55 (1749949.51G/570.75G)" after 48:54 mins elapsed time (0.81 h CPU time)
- 1 of them the same error after 43:48 mins elapsed time (0.73 h CPU time)

- 1 PVal after 1.18/1.19 h

All of them on my Win7 SP1 64b, i7-3770, 7.2.26, 8GB RAM, 60GB SSD rig running only OS, BOINC and MS Security Essentials. 7 CEP2 WU's left in memory.

All 4 I received on my Mac OS X 10.9. (Mavericks), i5-2500S, 7.0.65, 12GB RAM, 1TB HDD, small SSD with OS only - my main PC with ESET CyberSecurity - are still running. 1 FAAH WU left in memory.

On both Win and Mac I am shrubbing PrimeGrid PPS SV GPGPU OpenCL AMD/ATI WU's (will not continue with them during this Beta)

ETA1:
- running 8 tasks concurrent on Win and 4 concurrent on Mac
- RAM per task on Mac: 165 - 217MB
- RAM per task on Win: 52 - 110MB
- LAIM on for both PC and rig
----------------------------------------

Crunching@Home since January 13 2000. Shrubbing@Home since January 5 2006

----------------------------------------
[Edit 2 times, last edit by branjo at Nov 4, 2013 7:41:10 PM]
[Nov 4, 2013 7:10:52 PM]   Link   Report threatening or abusive post: please login first  Go to top 
slakin
Advanced Cruncher
Joined: Jul 4, 2008
Post Count: 79
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

Yes, I just had 3 WU's all error out with the same message:

Result Log

Result Name: BETA_ BETA_ 9999982_ 0438_ 1--
<core_client_version>6.10.58</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
]]>

All of these after about 45 minutes of running on Windows 7 Home Premium on an intel i5-3570 CPU at3.4GHz
[Nov 4, 2013 7:12:01 PM]   Link   Report threatening or abusive post: please login first  Go to top 
themoonscrescent
Veteran Cruncher
UK
Joined: Jul 1, 2006
Post Count: 1320
Status: Offline
Project Badges:
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

I've now had 2 Computational Errors that are unrelated to the previous system restarts:

1) Result Log

Result Name: BETA_ BETA_ 9999981_ 0042_ 1--
<core_client_version>7.0.42</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.21_windows_x86_64 -SettingsFile BETA_9999981_0042.txt -DatabaseFile dataset-GDS2771-v1.txt
Initializing
wcg_learn_limit = 500000
Running
[17:56:32]: Computing pass 0
Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.21_windows_x86_64 -SettingsFile BETA_9999981_0042.txt -DatabaseFile dataset-GDS2771-v1.txt
Initializing
wcg_learn_limit = 500000
Running
[18:23:59]: Computing pass 0
Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.21_windows_x86_64 -SettingsFile BETA_9999981_0042.txt -DatabaseFile dataset-GDS2771-v1.txt
Initializing
wcg_learn_limit = 500000
Running
[18:31:36]: Computing pass 0

</stderr_txt>
]]>

2) BETA_ BETA_ 9999981_ 0742_ 1--
<core_client_version>7.0.44</core_client_version>
<![CDATA[
<message>
- exit code -529697949 (0xe06d7363)
</message>
<stderr_txt>
uting pass 336
[18:12:35]: Computing pass 337
[18:12:35]: Computing pass 338
[18:12:36]: Computing pass 339
[18:12:37]: Computing pass 340
[18:12:37]: Computing pass 341
[18:12:38]: Computing pass 342
(Shorted)
[18:33:35]: Computing pass 1898
[18:33:36]: Computing pass 1899
[18:33:36]: Computing pass 1900
Commandline = projects/www.worldcommunitygrid.org/wcgrid_beta17_7.21_windows_intelx86 -SettingsFile BETA_9999981_0742.txt -DatabaseFile dataset-GDS2771-v1.txt
Initializing
wcg_learn_limit = 500000
Running


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x74E1C41F

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.1.18


Dump Timestamp : 11/04/13 19:04:50
Install Directory : C:\Program Files (x86)\BOINC\
Data Directory : C:\ProgramData\BOINC
Project Symstore :
LoadLibraryA( C:\Program Files (x86)\BOINC\\dbghelp.dll ): GetLastError = 193
Loaded Library : dbghelp.dll
LoadLibraryA( C:\Program Files (x86)\BOINC\\symsrv.dll ): GetLastError = 193
LoadLibraryA( symsrv.dll ): GetLastError = 193
LoadLibraryA( C:\Program Files (x86)\BOINC\\srcsrv.dll ): GetLastError = 193
LoadLibraryA( srcsrv.dll ): GetLastError = 193
LoadLibraryA( C:\Program Files (x86)\BOINC\\version.dll ): GetLastError = 126
Loaded Library : version.dll
Debugger Engine : 4.0.5.0
Symbol Search Path: C:\ProgramData\BOINC\slots\11;C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org


ModLoad: 0000000000910000 0000000000141000 C:\ProgramData\BOINC\projects\www.worldcommunitygrid.org\wcgrid_beta17_7.21_windows_intelx86 (-exported- Symbols Loaded)
Linked PDB Filename : c:\projects\wcgridAustinWorkspace\scienceApps\MCM1\Release\wcgrid_mcm1_prod_32.pdb

ModLoad: 0000000076f80000 0000000000180000 C:\Windows\SysWOW64\ntdll.dll (6.1.7601.18247) (-exported- Symbols Loaded)
Linked PDB Filename : wntdll.pdb
File Version : 6.1.7600.16385 (win7_rtm.090713-1255)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 6.1.7600.16385
(Shortened)
ModLoad: 0000000074fb0000 0000000000005000 C:\Windows\syswow64\PSAPI.DLL (6.1.7600.16385) (-exported- Symbols Loaded)
Linked PDB Filename : psapi.pdb
File Version : 6.1.7600.16385 (win7_rtm.090713-1255)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 6.1.7600.16385

ModLoad: 0000000069f50000 00000000000eb000 C:\Windows\system32\dbghelp.dll (6.1.7601.17514) (-exported- Symbols Loaded)
Linked PDB Filename : dbghelp.pdb
File Version : 6.1.7601.17514 (win7sp1_rtm.101119-1850)
Company Name : Microsoft Corporation
Product Name : Microsoft® Windows® Operating System
Product Version : 6.1.7601.17514



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 0, Write: 0, Other 0

- I/O Transfers Counters -
Read: 0, Write: 0, Other 0

- Paged Pool Usage -
QuotaPagedPoolUsage: 0, QuotaPeakPagedPoolUsage: 0
QuotaNonPagedPoolUsage: 0, QuotaPeakNonPagedPoolUsage: 0

- Virtual Memory Usage -
VirtualSize: 0, PeakVirtualSize: 0

- Pagefile Usage -
PagefileUsage: 0, PeakPagefileUsage: 0

- Working Set Size -
WorkingSetSize: 0, PeakWorkingSetSize: 0, PageFaultCount: 0

*** Dump of thread ID 892 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Unhandled Exception Record -
Reason: Out Of Memory (C++ Exception) (0xe06d7363) at address 0x74E1C41F

- Registers -
eax=003cefd8 ebx=003cf710 ecx=00000003 edx=00000000 esi=00000070 edi=06800b44
eip=74e1c41f esp=003cefd8 ebp=003cf028
cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000216

- Callstack -
ChildEBP RetAddr Args to Child
003cf028 009926e9 e06d7363 00000001 00000003 003cf054 KERNELBASE!RaiseException+0x0
003cf060 00926b62 003cf070 009fe1c4 009f1250 825acd17 wcgrid_beta17_7!DeleteMulticlassModel+0x0
003cf158 0092adde 00000000 825acc07 05f489e0 067a2fc8 wcgrid_beta17_7!+0x0
003cf178 009ba0b1 003cf1a8 003cf198 05f489d0 067a2fc8 wcgrid_beta17_7!boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<double,std::allocator<double> > >::load_object_data+0x0
00517e30 00517d70 00517d90 00a40cc8 00000005 00000001 wcgrid_beta17_7!boost::serialization::singleton<boost::archive::detail::`anonymous namespace'::map<boost::archive::binary_iarchive> >::get_mutable_instance+0x0
00517e34 00517d90 00a40cc8 00000005 00000001 49b79326 wcgrid_beta17_7!+0x0 SymFromAddr(): GetLastError = '126' SymGetLineFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '00517d70'
00517e38 00a40cc8 00000005 00000001 49b79326 88000000 wcgrid_beta17_7!+0x0 SymFromAddr(): GetLastError = '126' SymGetLineFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '00517d90'
00517e3c 00000000 00000001 49b79326 88000000 00517e70 wcgrid_beta17_7!`boost::serialization::singleton<boost::archive::detail::iserializer<boost::archive::binary_iarchive,std::vector<double,std::allocator<double> > > >::get_instance'::`2'::t+0x0 SymFromAddr(): GetLastError = '126' SymGetLineFromAddr(): GetLastError = '126' SymGetModuleInfo(): GetLastError = '126' Address = '00a40cc8'

*** Dump of thread ID 2336 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Registers -
eax=00000000 ebx=00000000 ecx=00000000 edx=00000000 esi=0276fef0 edi=00000000
eip=76f9fd91 esp=0276feac ebp=0276ff14
cs=0023 ss=002b ds=002b es=002b fs=0053 gs=002b efl=00000246

- Callstack -
ChildEBP RetAddr Args to Child
0276ff14 74e24498 00000064 00000000 0276ff30 00984674 ntdll!ZwDelayExecution+0x0
0276ff24 00984674 00000064 0276ff3c 762f336a 00000000 KERNELBASE!Sleep+0x0
0276ff30 762f336a 00000000 0276ff7c 76fb9f72 00000000 wcgrid_beta17_7!DeleteMulticlassModel+0x0
0276ff3c 76fb9f72 00000000 756516dc 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0
0276ff7c 76fb9f45 00984660 00000000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0
0276ff94 00000000 00984660 00000000 00000000 00000000 ntdll!RtlInitializeExceptionChain+0x0


*** Debug Message Dump ****


*** Foreground Window Data ***
Window Name :
Window Class :
Window Process ID: 0
Window Thread ID : 0

Exiting...

</stderr_txt>
]]>
----------------------------------------


[Nov 4, 2013 7:13:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Former Member
Cruncher
Joined: May 22, 2018
Post Count: 0
Status: Offline
Reply to this Post  Reply with Quote 
Re: New Beta Test starting Nov 4, 2013 [ Issues Thread ] Version 7.21

Not sure this is relevant as I think Keith said there were different types of unit within the test, but I picked up two on:

Processor: 2 GenuineIntel Intel(R) Core(TM)2 Duo CPU T9500 @ 2.60GHz [Family 6 Model 23 Stepping 6]
Processor: 6.00 MB cache
Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 nx lm vmx tm2 pbe
OS: Microsoft Windows XP: Professional x86 Edition, Service Pack 3, (05.01.2600.00)
Memory: 2.98 GB physical, 2.83 GB virtual
Disk: 149.05 GB total, 19.10 GB free

and they both look fine after nearly two hours. BOINC Manager shows them both very similar, and checkpointing every few minutes. But when I look in the slots directories they are very different:

BETA_9999979_0181 shows a checkpointo.bin file of nearly 2MB that is updated every few minutes, and a stderr.txt file which is updated every couple of seconds and the last entry of which is currently:

Computing pass 2622

BETA_9999979_0329 shows no checkpointo.bin file and the last entry in the stderr.txt file is from the very start and is simply:

Running

Is this second task really checkpointing? There is a file called "wcg_checkpoint_00.ckp" and another called "checkpoint.evf" that both get updated every few minutes, but neither seem to contain anything that looks like it would take two hours to calculate.

I haven't had the courage to try suspending them yet, I'm afraid.
[Nov 4, 2013 7:19:49 PM]   Link   Report threatening or abusive post: please login first  Go to top 
Posts: 152   Pages: 16   [ Previous Page | 1 2 3 4 5 6 7 8 9 10 | Next Page ]
[ Jump to Last Post ]
Post new Thread