computation errors

Message boards : Number crunching : computation errors

To post messages, you must log in.

1 · 2 · Next

AuthorMessage
XMARK
Avatar

Send message
Joined: 16 Jan 17
Posts: 3
Credit: 3,423,784
RAC: 0
Message 104870 - Posted: 17 Feb 2022, 18:48:58 UTC

computation error on all tasks with movingstub_minimize
ID: 104870 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Falconet

Send message
Joined: 9 Mar 09
Posts: 353
Credit: 1,227,479
RAC: 917
Message 104871 - Posted: 17 Feb 2022, 19:21:06 UTC - in response to Message 104870.  

Looks like a bad batch.
ID: 104871 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
googloo
Avatar

Send message
Joined: 15 Sep 06
Posts: 133
Credit: 22,722,686
RAC: 3,784
Message 104874 - Posted: 17 Feb 2022, 19:59:50 UTC
Last modified: 17 Feb 2022, 20:00:26 UTC

Same here.
ID: 104874 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2125
Credit: 41,245,383
RAC: 9,571
Message 104902 - Posted: 18 Feb 2022, 4:17:12 UTC - in response to Message 104870.  

computation error on all tasks with movingstub_minimize

Only just seen them. Reported.
Hopefully they get cleared up quickly
ID: 104902 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1683
Credit: 17,917,681
RAC: 22,352
Message 104907 - Posted: 18 Feb 2022, 7:03:13 UTC

An new record, 100% failure rate so far.
I've stopped manually updating & will just wait for the Manager to clear it's time outs. No point downloading more work just for all of it to error out.
Grant
Darwin NT
ID: 104907 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 393
Credit: 12,113,928
RAC: 4,486
Message 104909 - Posted: 18 Feb 2022, 7:16:16 UTC
Last modified: 18 Feb 2022, 7:21:26 UTC

So what am I doing that’s different?

So far I have a 0% error rate on movingstub_gzm1_minimise tasks, e.g. https://boinc.bakerlab.org/rosetta/result.php?resultid=1468558423 - are there 2 different batches?
ID: 104909 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1683
Credit: 17,917,681
RAC: 22,352
Message 104913 - Posted: 18 Feb 2022, 7:41:38 UTC - in response to Message 104909.  
Last modified: 18 Feb 2022, 7:42:44 UTC

So what am I doing that’s different?

So far I have a 0% error rate on movingstub tasks - are there 2 different batches?
Maybe (hell i hope so- having 2 million dud Tasks will certainly screw things up).
movingstub_gzm1_minimize_3CL_AVLstub_ 100% failure rate for me.

<core_client_version>7.16.11</core_client_version>
<![CDATA[
<message>
(unknown error) - exit code 3221225477 (0xc0000005)</message>
<stderr_txt>
command: projects/boinc.bakerlab.org_rosetta/rosetta_4.20_windows_x86_64.exe @movingstub_gzm1_minimize_3CL_AVLstub_0184_130_extract_B.flags -nstruct 10000 -cpu_run_time 28800 -boinc:max_nstruct 20000 -checkpoint_interval 120 -mute all -database minirosetta_database -in::file::zip minirosetta_database.zip -boinc::watchdog -boinc::cpu_run_timeout 36000 -run::rng mt19937 -constant_seed -jran 1711459
Using database: database_357d5d93529_n_methylminirosetta_database


Unhandled Exception Detected...

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x0000000000000004 

Engaging BOINC Windows Runtime Debugger...



********************


BOINC Windows Runtime Debugger Version 7.9.0


Dump Timestamp    : 02/18/22 16:28:21
Install Directory : C:Program FilesBOINC
Data Directory    : C:ProgramDataBOINC
Project Symstore  : https://boinc.bakerlab.org/rosetta/symstore
LoadLibraryA( C:ProgramDataBOINCdbghelp.dll ): GetLastError = 126
Loaded Library    : dbghelp.dll
LoadLibraryA( C:ProgramDataBOINCsymsrv.dll ): GetLastError = 126
LoadLibraryA( symsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCsrcsrv.dll ): GetLastError = 126
LoadLibraryA( srcsrv.dll ): GetLastError = 126
LoadLibraryA( C:ProgramDataBOINCversion.dll ): GetLastError = 126
Loaded Library    : version.dll
Debugger Engine   : 4.0.5.0
Symbol Search Path: C:ProgramDataBOINCslots8;C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosetta;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*http://msdl.microsoft.com/download/symbols;srv*C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettasymbols*https://boinc.bakerlab.org/rosetta/symstore


ModLoad: 00000000557a0000 00000000057ef000 C:ProgramDataBOINCprojectsboinc.bakerlab.org_rosettarosetta_4.20_windows_x86_64.exe (-exported- Symbols Loaded)
    Linked PDB Filename   : C:cygwin64homeboinc4.17RosettamainsourceideVisualStudiox64BoincReleaserosetta_4.20_windows_x86_64.pdb

ModLoad: 00000000a0c50000 00000000001f5000 C:WINDOWSSYSTEM32ntdll.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : ntdll.pdb
    File Version          : 10.0.19041.1466 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1466

ModLoad: 00000000a00e0000 00000000000be000 C:WINDOWSSystem32KERNEL32.DLL (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : kernel32.pdb
    File Version          : 10.0.19041.1466 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1466

ModLoad: 000000009e4c0000 00000000002c8000 C:WINDOWSSystem32KERNELBASE.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : kernelbase.pdb
    File Version          : 10.0.19041.1466 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1466

ModLoad: 000000009f020000 000000000006b000 C:WINDOWSSystem32WS2_32.dll (6.2.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : ws2_32.pdb
    File Version          : 10.0.19041.1081 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1081

ModLoad: 000000009ede0000 0000000000125000 C:WINDOWSSystem32RPCRT4.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : rpcrt4.pdb
    File Version          : 10.0.19041.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1

ModLoad: 000000009f920000 00000000001a1000 C:WINDOWSSystem32USER32.dll (6.2.19041.1202) (-exported- Symbols Loaded)
    Linked PDB Filename   : user32.pdb
    File Version          : 10.0.19038.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19038.1

ModLoad: 000000009e490000 0000000000022000 C:WINDOWSSystem32win32u.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : win32u.pdb
    File Version          : 10.0.19041.1466 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1466

ModLoad: 00000000a0770000 000000000002b000 C:WINDOWSSystem32GDI32.dll (6.2.19041.1202) (-exported- Symbols Loaded)
    Linked PDB Filename   : gdi32.pdb
    File Version          : 10.0.19041.1202 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1202

ModLoad: 000000009e7e0000 000000000010d000 C:WINDOWSSystem32gdi32full.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : gdi32full.pdb
    File Version          : 10.0.19041.1466 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1466

ModLoad: 000000009ea50000 000000000009d000 C:WINDOWSSystem32msvcp_win.dll (6.2.19041.789) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcp_win.pdb
    File Version          : 10.0.19041.789 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.789

ModLoad: 000000009e360000 0000000000100000 C:WINDOWSSystem32ucrtbase.dll (6.2.19041.789) (-exported- Symbols Loaded)
    Linked PDB Filename   : ucrtbase.pdb
    File Version          : 10.0.19041.789 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.789

ModLoad: 00000000a05b0000 00000000000ae000 C:WINDOWSSystem32ADVAPI32.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : advapi32.pdb
    File Version          : 10.0.19041.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1

ModLoad: 000000009ec90000 000000000009e000 C:WINDOWSSystem32msvcrt.dll (7.0.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : msvcrt.pdb
    File Version          : 7.0.19041.546 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 7.0.19041.546

ModLoad: 00000000a0510000 000000000009c000 C:WINDOWSSystem32sechost.dll (6.2.19041.1466) (-exported- Symbols Loaded)
    Linked PDB Filename   : sechost.pdb
    File Version          : 10.0.19041.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1

ModLoad: 000000009ed30000 0000000000030000 C:WINDOWSSystem32IMM32.DLL (6.2.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : imm32.pdb
    File Version          : 10.0.19041.546 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.546

ModLoad: 000000009c270000 0000000000012000 C:WINDOWSSYSTEM32kernel.appcore.dll (6.2.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : Kernel.Appcore.pdb
    File Version          : 10.0.19041.546 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.546

ModLoad: 000000009d050000 0000000000033000 C:WINDOWSSYSTEM32ntmarta.dll (6.2.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : ntmarta.pdb
    File Version          : 10.0.19041.1 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1

ModLoad: 0000000098ed0000 00000000001e4000 C:WINDOWSSYSTEM32dbghelp.dll (6.2.19041.867) (-exported- Symbols Loaded)
    Linked PDB Filename   : dbghelp.pdb
    File Version          : 10.0.19041.867 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.867

ModLoad: 0000000099960000 000000000000a000 C:WINDOWSSYSTEM32version.dll (6.2.19041.546) (-exported- Symbols Loaded)
    Linked PDB Filename   : version.pdb
    File Version          : 10.0.19041.546 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.546

ModLoad: 000000009ec00000 0000000000082000 C:WINDOWSSystem32bcryptPrimitives.dll (6.2.19041.1415) (-exported- Symbols Loaded)
    Linked PDB Filename   : bcryptprimitives.pdb
    File Version          : 10.0.19041.1415 (WinBuild.160101.0800)
    Company Name          : Microsoft Corporation
    Product Name          : Microsoft&#174; Windows&#174; Operating System
    Product Version       : 10.0.19041.1415



*** Dump of the Process Statistics: ***

- I/O Operations Counters -
Read: 6750, Write: 650, Other 13796

- I/O Transfers Counters -
Read: 21647037, Write: 17498, Other 6760

- Paged Pool Usage -
QuotaPagedPoolUsage: 317112, QuotaPeakPagedPoolUsage: 317416
QuotaNonPagedPoolUsage: 7200, QuotaPeakNonPagedPoolUsage: 7760

- Virtual Memory Usage -
VirtualSize: 83124224, PeakVirtualSize: 895541248

- Pagefile Usage -
PagefileUsage: 83124224, PeakPagefileUsage: 83124224

- Working Set Size -
WorkingSetSize: 104243200, PeakWorkingSetSize: 104247296, PageFaultCount: 25863

*** Dump of thread ID 3812 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Normal, , Kernel Time: 0.000000, User Time: 0.000000, Wait Time: 0.000000

- Unhandled Exception Record -
Reason: Access Violation (0xc0000005) at address 0x0000000000000004 

- Registers -
rax=000000000000003a rbx=000000008dc241c0 rcx=000000008e63be10 rdx=000000008e71bf48 rsi=000000000000000b rdi=000000008e63be10
r8=000000000000003a r9=0000000000000421 r10=0000000059346e80 r11=0000000018345000 r12=00000000557a0000 r13=000000001835f710
r14=0000000018345740 r15=000000000048b215 rip=0000000000000004 rsp=0000000018345078 rbp=0000000000000000
cs=0033  ss=002b  ds=002b  es=002b  fs=0053  gs=002b             efl=00010206

- Callstack -
ChildEBP RetAddr  Args to Child
18345070 55c7831c 00000000 59346d60 59346e80 18345058 !+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '00000004'
183450a0 55c3935d 8dc241c0 18345140 55c2b215 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55c7831c'
183450d0 58da7f10 59c90150 1835f710 00000000 00000001 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55c3935d'
18345100 55c239e8 5ad3a32c 557a0000 183451f0 a0c80e7b rosetta_4.20_windows_x86_64!xmlValidateNotationDecl+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '58da7f10'
18345170 a0cf20cf 00000000 183456f0 18345db0 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55c239e8'
183451a0 a0ca1454 00000000 183456f0 18345db0 00000000 ntdll!__chkstk+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'a0cf20cf'
183458b0 a0cf0bfe 00000003 a0c7b3c7 5941a450 a0c7b86b ntdll!RtlRaiseException+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'a0ca1454'
18346030 55ee3e2b fffffffe 91f88288 ffffffff 55ef18c5 ntdll!KiUserExceptionDispatcher+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'a0cf0bfe'
18346080 55ef3690 5941a3a0 91f87fe0 5941a3a0 18346179 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55ee3e2b'
183461b0 56009ee8 911d1be8 9242dcf0 91f87fe0 9242dcf0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55ef3690'
18346d60 55fa4b6c 920c3750 a0c7b3c7 93b70000 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '56009ee8'
18346f60 55fa488e 18347048 00000000 18347230 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55fa4b6c'
183470c0 55f03da1 18347238 00000000 8dc23280 18347300 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55fa488e'
18347480 55f09f08 183477d0 183477d0 183477d0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f03da1'
18347ad0 55f084db 8e3008b0 18347b30 8e1807e0 8e1807e0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f09f08'
18347c30 55e71fb7 00000000 18347d40 8e1807e0 18347f40 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f084db'
18347da0 55e757a6 00000005 55c15190 8e1c3330 8e1c3330 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55e71fb7'
18347e10 55e756cc 18348118 18347f89 18348118 8e1807e0 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55e757a6'
18347ec0 55f3b6f5 18348118 18348441 00000000 55c375e8 rosetta_4.20_windows_x86_64!cppdb::session::is_open+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55e756cc'
18347fe0 55f3a592 00000005 18348118 183482f0 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f3b6f5'
183480b0 55f3ad06 00000000 00000000 183489d0 93b70000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f3a592'
18348250 563971a3 183482f0 183489d0 ffffff01 55c23e73 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55f3ad06'
18348540 56399d09 00000000 00000001 18348650 183489d0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '563971a3'
183488d0 56392f8a 18348910 183489d0 916f0e80 8e1decb0 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '56399d09'
18348930 565acc70 183489d0 183490f8 8e1c3330 00000000 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '56392f8a'
183490c0 565ac6e4 922f5930 92209890 5ac15cc0 55c175a6 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '565acc70'
18349120 565b603e 18349210 922f5660 18349230 18349980 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '565ac6e4'
183498a0 565b56d4 933b97df 933b98ef 5ab87f70 565d6cb4 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '565b603e'
18349930 565b578e 00000005 18349ed8 8e1decb0 00000001 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '565b56d4'
18349ad0 55c2081d 8e4f6820 8e4f6820 8e1decb0 8dc25501 rosetta_4.20_windows_x86_64!cppdb::backend::statement::cache+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '565b578e'
1835f700 55c2b215 00000000 00000000 5ab4ccf8 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55c2081d'
1835f740 a00f7034 00000000 00000000 00000000 00000000 rosetta_4.20_windows_x86_64!xmlParserInputRead+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = '55c2b215'
1835f770 a0ca2651 00000000 00000000 00000000 00000000 KERNEL32!BaseThreadInitThunk+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'a00f7034'
1835f7f0 00000000 00000000 00000000 00000000 00000000 ntdll!RtlUserThreadStart+0x0 SymFromAddr(): GetLastError = '126'  SymGetModuleInfo(): GetLastError = '126' Address = 'a0ca2651'

*** Dump of thread ID 32764 (state: Initialized): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 6.000000, User Time: 0.000000, Wait Time: 3852527616.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000  ss=0000  ds=0000  es=0000  fs=0000  gs=0000             efl=00000000

- Callstack -
ChildEBP RetAddr  Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 

*** Dump of thread ID 30942356 (state: Unknown): ***

- Information -
Status: Base Priority: Normal, Priority: Unknown, , Kernel Time: 17179869184.000000, User Time: 21474836480.000000, Wait Time: 0.000000

- Registers -
rax=0000000000000000 rbx=0000000000000000 rcx=0000000000000000 rdx=0000000000000000 rsi=0000000000000000 rdi=0000000000000000
r8=0000000000000000 r9=0000000000000000 r10=0000000000000000 r11=0000000000000000 r12=0000000000000000 r13=0000000000000000
r14=0000000000000000 r15=0000000000000000 rip=0000000000000000 rsp=0000000000000000 rbp=0000000000000000
cs=0000  ss=0000  ds=0000  es=0000  fs=0000  gs=0000             efl=00000000

- Callstack -
ChildEBP RetAddr  Args to Child
(-nosymbols- PC == 0)
00000000 00000000 00000000 00000000 00000000 00000000 !+0x0 


*** Debug Message Dump ****


*** Foreground Window Data ***
    Window Name      : 
    Window Class     : 
    Window Process ID: 0
    Window Thread ID : 0

Exiting...

</stderr_txt>
]]>

Grant
Darwin NT
ID: 104913 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Bryn Mawr

Send message
Joined: 26 Dec 18
Posts: 393
Credit: 12,113,928
RAC: 4,486
Message 104917 - Posted: 18 Feb 2022, 8:44:27 UTC

OK, that’s the first difference, I’m using Ubuntu 20.04.3 rather than Windows, could it be as simple as that?

The task name appears to be from the same batch.

I’m also Ryzen 9 3900, Boinc 7.18.1 if that is any different.
ID: 104917 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1683
Credit: 17,917,681
RAC: 22,352
Message 104919 - Posted: 18 Feb 2022, 8:56:02 UTC - in response to Message 104917.  

OK, that’s the first difference, I’m using Ubuntu 20.04.3 rather than Windows, could it be as simple as that?
Yep.
It shouldn't, but it wouldn't surprise me.
Grant
Darwin NT
ID: 104919 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 34
Message 104931 - Posted: 18 Feb 2022, 12:43:49 UTC

Perhaps someone at the project noticed the problem. I had a large number of these quickly crashing work units, on both my machines, but neither has downloaded new work for a couple of hours.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 104931 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1683
Credit: 17,917,681
RAC: 22,352
Message 104953 - Posted: 18 Feb 2022, 21:32:08 UTC - in response to Message 104931.  

Perhaps someone at the project noticed the problem. I had a large number of these quickly crashing work units, on both my machines, but neither has downloaded new work for a couple of hours.
Nothing to do with the project, that is normal behaviour for the BOINC Manager.

When a Task ends with an error, the BOINC Manager backs off contacting the Scheduler for several minutes. Another error, a longer back off. A whole bunch of errors, and it backs off for several hours. After that backoff period ends, or a Task is completed without an error, then the Manager contacts the Scheduler to report the completed/errored Tasks & to get more work.
Grant
Darwin NT
ID: 104953 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Sid Celery

Send message
Joined: 11 Feb 08
Posts: 2125
Credit: 41,245,383
RAC: 9,571
Message 104971 - Posted: 19 Feb 2022, 0:23:16 UTC - in response to Message 104917.  

OK, that’s the first difference, I’m using Ubuntu 20.04.3 rather than Windows, could it be as simple as that?

The task name appears to be from the same batch.

I’m also Ryzen 9 3900, Boinc 7.18.1 if that is any different.

Interesting. I've reported that difference between Unbuntu (all success) and Windows (all failure) too.
No response yet.
ID: 104971 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kissagogo27

Send message
Joined: 31 Mar 20
Posts: 86
Credit: 2,928,597
RAC: 2,752
Message 105003 - Posted: 19 Feb 2022, 18:04:23 UTC
Last modified: 19 Feb 2022, 18:08:01 UTC

till few days, Boinc download master file before the bad WU !

today, It download the 4.21 version for non 64bits systems !?!?

19-Feb-2022 11:11:48 [Rosetta@home] VirtualBox is not installed
19-Feb-2022 11:11:48 [Rosetta@home] This computer has finished a daily quota of 178 tasks
19-Feb-2022 11:11:48 [Rosetta@home] Project requested delay of 31 seconds
19-Feb-2022 11:12:45 [Rosetta@home] Finished download of rosetta_4.21_windows_intelx86.exe
19-Feb-2022 11:12:56 [Rosetta@home] Finished download of rosetta_graphics_4.21_windows_intelx86.exe


19-Feb-2022 18:49:19 [Rosetta@home] VirtualBox is not installed
19-Feb-2022 18:49:19 [Rosetta@home] This computer has finished a daily quota of 224 tasks
19-Feb-2022 18:49:19 [Rosetta@home] Project requested delay of 31 seconds
19-Feb-2022 18:50:19 [Rosetta@home] Finished download of rosetta_4.21_windows_intelx86.exe
19-Feb-2022 18:50:27 [Rosetta@home] Finished download of rosetta_graphics_4.21_windows_intelx86.exe


and stop using the 4.20.
ID: 105003 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
kotenok2000
Avatar

Send message
Joined: 22 Feb 11
Posts: 259
Credit: 497,274
RAC: 903
Message 105006 - Posted: 19 Feb 2022, 18:38:05 UTC - in response to Message 105003.  

I have set rosetta to no new tasks, synchronized with project, resetted project, and still get 4.20 workunits
ID: 105006 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile adrianxw
Avatar

Send message
Joined: 18 Sep 05
Posts: 653
Credit: 11,840,739
RAC: 34
Message 105009 - Posted: 19 Feb 2022, 19:20:50 UTC - in response to Message 104953.  

>>> Nothing to do with the project, that is normal behaviour for the BOINC Manager.

When a project has submitted a batch of work units that are immediately crashing in the cruncher pool, an in touch project manager can remove future downloads of the failing work units. It is exactly relevent to the project, and absolutely nothing to do with BOINC.
Wave upon wave of demented avengers march cheerfully out of obscurity into the dream.
ID: 105009 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Kissagogo27

Send message
Joined: 31 Mar 20
Posts: 86
Credit: 2,928,597
RAC: 2,752
Message 105014 - Posted: 19 Feb 2022, 20:06:54 UTC

even with the 4.21 , the errors still occured after got one RB task ...
ID: 105014 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Profile Grant (SSSF)

Send message
Joined: 28 Mar 20
Posts: 1683
Credit: 17,917,681
RAC: 22,352
Message 105028 - Posted: 19 Feb 2022, 22:45:46 UTC - in response to Message 105009.  

>>> Nothing to do with the project, that is normal behaviour for the BOINC Manager.

When a project has submitted a batch of work units that are immediately crashing in the cruncher pool, an in touch project manager can remove future downloads of the failing work units. It is exactly relevant to the project, and absolutely nothing to do with BOINC.
Did you even pay the slightest bit of attention to the statement i was responding to??? I doubt it as you have taken my quote completely out of context- you didn't even include the statement that i was actually responding to.
The statement you gave (about projects being responsible for cleaning up any messes they make isn't BOINC's responsibility) is completely irrelevant to the comment i was responding to (which is about how BOINC responds to such messes).

I suggest you think a bit more before you post in future.
Grant
Darwin NT
ID: 105028 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
Odd-Rod

Send message
Joined: 5 Dec 05
Posts: 1
Credit: 1,521,435
RAC: 115
Message 105029 - Posted: 19 Feb 2022, 22:47:40 UTC - in response to Message 105009.  

>>> Nothing to do with the project, that is normal behaviour for the BOINC Manager.

When a project has submitted a batch of work units that are immediately crashing in the cruncher pool, an in touch project manager can remove future downloads of the failing work units. It is exactly relevent to the project, and absolutely nothing to do with BOINC.

Actually, the description of Boinc behaviour is correct. While "an in touch project manager can remove future downloads" would also cause a lack of WUs to be downloaded, that is not what is happening here, since I have just downloaded WUs that so far are all erroring out.
ID: 105029 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tullio

Send message
Joined: 10 May 20
Posts: 63
Credit: 630,125
RAC: 0
Message 105045 - Posted: 20 Feb 2022, 4:33:34 UTC
Last modified: 20 Feb 2022, 4:45:35 UTC

Whw I ask for tasks on my Windows 11 PC I get six Rosetta 4.20 and one rosetta python on my Intel i5 9400F The 4.20 tasks error out immediately and the rosetta python completes and validates.
Tullio
ID: 105045 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
tvdsluis

Send message
Joined: 27 Mar 20
Posts: 11
Credit: 514,960
RAC: 0
Message 105050 - Posted: 20 Feb 2022, 10:18:32 UTC

Same here!

100% failure rate (computation error) on all machines (all windows)

movingstub_gzm1_minimize_3CL_AVLstub_0285_192_extract_B_SAVE_ALL_OUT_2909202_956_1_r1734002377_0

Somebody needs to wake up.
ID: 105050 · Rating: 0 · rate: Rate + / Rate - Report as offensive    Reply Quote
1 · 2 · Next

Message boards : Number crunching : computation errors



©2024 University of Washington
https://www.bakerlab.org