mxosrvr cores when sqstop is done right after tests finish in gate tests

Bug #1414095 reported by Aruna Sadashiva
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Trafodion
Fix Committed
High
Tharak Capirala

Bug Description

During gate tests, mxosrvrs are coring as sqstop is done right after tests finish and mxosvrr is trying to write end session stats, but monitor is down already.

#0 0x0000003299a32625 in raise () from /lib64/libc.so.6
#1 0x0000003299a33d8d in abort () from /lib64/libc.so.6
#2 0x0000003299a2b74e in __assert_fail_base () from /lib64/libc.so.6
#3 0x0000003299a2b810 in __assert_fail () from /lib64/libc.so.6
#4 0x00007ffff4d2b279 in MyGuaProcessHandle::MyGuaProcessHandle (this=0x7fffd9f40df0) at ../common/IpcGuardian.cpp:338
#5 0x00007ffff4d1f200 in IpcEnvironment::getMyOwnProcessId (this=0x7fffd9463060, dom=IPC_DOM_GUA_PHANDLE) at ../common/Ipc.cpp:5327
#6 0x00007ffff3d4d7ab in ExRtFragTable::ExRtFragTable (this=0x7fffd876ffd8, glob=0x7fffd8756338, fragDir=0x7fffd87b02b0, generatedObject=0x7fffd87ae5b8 "\260*&\364\377\177") at ../executor/ex_frag_rt.cpp:175
#7 0x00007ffff3d968a1 in ex_root_tdb::build (this=0x7fffd87ae5b8, cliGlobals=0x1062920, glob=0x7fffd8756338) at ../executor/ex_root.cpp:123
#8 0x00007ffff51a0280 in CliStatement::fixup (this=0x7fffd9503290, cliGlobals=0x1062920, input_desc=0x7fffd9532d10, diagsArea=..., doSimCheck=@0x7fffd9f4235c, partitionUnavailable=@0x7fffd9f4237c, donePrepare=0) at ../cli/Statement.cpp:3112
#9 0x00007ffff51a3067 in CliStatement::execute (this=0x7fffd9503290, cliGlobals=0x1062920, input_desc=0x7fffd9532d10, diagsArea=..., execute_state=CliStatement::INITIAL_STATE_, fixupOnly=0, cliflags=0) at ../cli/Statement.cpp:4250
#10 0x00007ffff512d39f in SQLCLI_PerformTasks(CliGlobals *, ULng32, SQLSTMT_ID *, SQLDESC_ID *, SQLDESC_ID *, Lng32, Lng32, typedef __va_list_tag __va_list_tag *, SQLCLI_PTR_PAIRS *, SQLCLI_PTR_PAIRS *) (cliGlobals=0x1062920, tasks=4882, statement_id=0x2ba3140, input_descriptor=0x2960d70, output_descriptor=0x0, num_input_ptr_pairs=0, num_output_ptr_pairs=0, ap=0x7fffd9f42940, input_ptr_pairs=0x0, output_ptr_pairs=0x0) at ../cli/Cli.cpp:3284
#11 0x00007ffff512dd10 in SQLCLI_Exec(CliGlobals *, SQLSTMT_ID *, SQLDESC_ID *, Lng32, typedef __va_list_tag __va_list_tag *, SQLCLI_PTR_PAIRS *) (cliGlobals=0x1062920, statement_id=0x2ba3140, input_descriptor=0x2960d70, num_ptr_pairs=0, ap=0x7fffd9f42940, ptr_pairs=0x0) at ../cli/Cli.cpp:3531
#12 0x00007ffff51ba8d8 in SQL_EXEC_Exec (statement_id=0x2ba3140, input_descriptor=0x2960d70, num_ptr_pairs=0) at ../cli/CliExtern.cpp:2071
#13 0x00007ffff3e11f56 in ExeCliInterface::exec (this=0x7fffd9f455d0, inputBuf=0x0, inputBufLen=0) at ../executor/ExExeUtilCli.cpp:599
#14 0x00007ffff3e12fb7 in ExeCliInterface::fetchRowsPrologue (this=0x7fffd9f455d0, sqlStrBuf=0x7fffd9f42bf0 "select text from TRAFODION.\"_MD_\".TEXT where text_uid = 39463105331843146 and text_type = 4 and sub_id = 42 for read committed access order by seq_num", noExec=0, monitorThis=0, stmtName=0x0) at ../executor/ExExeUtilCli.cpp:1040
#15 0x00007ffff3e1328f in ExeCliInterface::fetchAllRows (this=0x7fffd9f455d0, infoList=@0x7fffd9f42fe0, query=0x7fffd9f42bf0 "select text from TRAFODION.\"_MD_\".TEXT where text_uid = 39463105331843146 and text_type = 4 and sub_id = 42 for read committed access order by seq_num", inNumOutputEntries=0, varcharFormat=0, monitorThis=0, initInfoList=1) at ../executor/ExExeUtilCli.cpp:1114
#16 0x00007ffff05aaa42 in CmpSeabaseDDL::getTextFromMD (this=0x7fffd9f45dc0, cliInterface=0x7fffd9f455d0, textUID=39463105331843146, textType=4, textSubID=42, outText=...) at ../sqlcomp/CmpSeabaseDDLtable.cpp:4947
#17 0x00007ffff05b27ee in CmpSeabaseDDL::getSeabaseColumnInfo (this=0x7fffd9f45dc0, cliInterface=0x7fffd9f455d0, objUID=39463105331843146, catName=..., schName=..., objName=..., direction=0x7fffd9f459d0 "' '", isTableSalted=0x7fffd9f459ec, identityColPos=0x7fffd9f459cc, numCols=0x7fffd9f459fc, outColInfoArray=0x7fffd9f459f0) at ../sqlcomp/CmpSeabaseDDLtable.cpp:6448
#18 0x00007ffff05b3fc3 in CmpSeabaseDDL::getSeabaseUserTableDesc (this=0x7fffd9f45dc0, catName=..., schName=..., objName=..., objType=COM_BASE_TABLE_OBJECT, includeInvalidDefs=0) at ../sqlcomp/CmpSeabaseDDLtable.cpp:6806
#19 0x00007ffff05b7418 in CmpSeabaseDDL::getSeabaseTableDesc (this=0x7fffd9f45dc0, catName=..., schName=..., objName=..., objType=COM_BASE_TABLE_OBJECT, includeInvalidDefs=0) at ../sqlcomp/CmpSeabaseDDLtable.cpp:7491
#20 0x00007fffeebfdb49 in NATableDB::get (this=0x7fffd93e76b8, corrName=..., bindWA=0x7fffd9f49810, inTableDescStruct=0x0) at ../optimizer/NATable.cpp:8007
#21 0x00007fffee94a328 in BindWA::getNATable (this=0x7fffd9f49810, corrName=..., catmanCollectTableUsages=1, inTableDescStruct=0x0) at ../optimizer/BindRelExpr.cpp:1445
#22 0x00007fffee975545 in GenericUpdate::bindNode (this=0x7fffd9312928, bindWA=0x7fffd9f49810) at ../optimizer/BindRelExpr.cpp:11539
#23 0x00007fffee96ab43 in Insert::bindNode (this=0x7fffd9312928, bindWA=0x7fffd9f49810) at ../optimizer/BindRelExpr.cpp:9043
#24 0x00007fffee94d7b7 in RelExpr::bindChildren (this=0x7fffd9313718, bindWA=0x7fffd9f49810) at ../optimizer/BindRelExpr.cpp:2180
#25 0x00007fffee95a352 in RelRoot::bindNode (this=0x7fffd9313718, bindWA=0x7fffd9f49810) at ../optimizer/BindRelExpr.cpp:5225
#26 0x00007ffff052aaab in CmpMain::compile (this=0x7fffd9f4c400, input_str=0x7fffd9437ff8 "insert into Trafodion.\"_REPOS_\".metric_session_table values(0,0,0,11319,11319,0,0,0,'172.16.0.11',0,'$Z00098E','MXID11000011319212288741031839550000000000206U3333307phoenix','END',CONVERTTIMESTAMP(212"..., charset=15, queryExpr=@0x7fffd9f4c2e8, gen_code=0x7fffd930a838, gen_code_len=0x7fffd930a830, heap=0x7fffd94f7c08, phase=CmpMain::END, fragmentDir=0x7fffd9f4c558, op=3004, useQueryCache=1, cacheable=0x7fffd9f4c2d4, begTime=0x7fffd9f4c2f0, shouldLog=0) at ../sqlcomp/CmpMain.cpp:2119
#27 0x00007ffff05296c1 in CmpMain::sqlcomp (this=0x7fffd9f4c400, input_str=0x7fffd9437ff8 "insert into Trafodion.\"_REPOS_\".metric_session_table values(0,0,0,11319,11319,0,0,0,'172.16.0.11',0,'$Z00098E','MXID11000011319212288741031839550000000000206U3333307phoenix','END',CONVERTTIMESTAMP(212"..., charset=15, queryExpr=@0x7fffd9f4c2e8, gen_code=0x7fffd930a838, gen_code_len=0x7fffd930a830, heap=0x7fffd94f7c08, phase=CmpMain::END, fragmentDir=0x7fffd9f4c558, op=3004, useQueryCache=1, cacheable=0x7fffd9f4c2d4, begTime=0x7fffd9f4c2f0, shouldLog=0) at ../sqlcomp/CmpMain.cpp:1732
#28 0x00007ffff0526dcc in CmpMain::sqlcomp (this=0x7fffd9f4c400, input=..., gen_code=0x7fffd930a838, gen_code_len=0x7fffd930a830, heap=0x7fffd94f7c08, phase=CmpMain::END, fragmentDir=0x7fffd9f4c558, op=3004) at ../sqlcomp/CmpMain.cpp:817
#29 0x00007fffedc8cecd in CmpStatement::process (this=0x7fffd9410898, sqltext=...) at ../arkcmp/CmpStatement.cpp:508
#30 0x00007fffedc7cefd in CmpContext::compileDirect (this=0x7fffd93e2090, data=0x7fffd878a2d0 "\200", data_len=520, outHeap=0x7fffe55f51d0, charset=15, op=CmpMessageObj::SQLTEXT_COMPILE, gen_code=@0x7fffd9f4ca90, gen_code_len=@0x7fffd9f4ca9c, parserFlags=131072, diagsArea=0x7fffd878a4e0) at ../arkcmp/CmpContext.cpp:686
#31 0x00007ffff519cc39 in CliStatement::prepare2 (this=0x7fffd94ebbd0, source=0x7fffd87ab480 "insert into Trafodion.\"_REPOS_\".metric_session_table values(0,0,0,11319,11319,0,0,0,'172.16.0.11',0,'$Z00098E','MXID11000011319212288741031839550000000000206U3333307phoenix','END',CONVERTTIMESTAMP(212"..., diagsArea=..., passed_gen_code=0x0, passed_gen_code_len=0, charset=15, unpackTdbs=1, cliFlags=144) at ../cli/Statement.cpp:1827
#32 0x00007ffff519b9cc in CliStatement::prepare (this=0x7fffd94ebbd0, source=0x7fffd87ab480 "insert into Trafodion.\"_REPOS_\".metric_session_table values(0,0,0,11319,11319,0,0,0,'172.16.0.11',0,'$Z00098E','MXID11000011319212288741031839550000000000206U3333307phoenix','END',CONVERTTIMESTAMP(212"..., diagsArea=..., passed_gen_code=0x0, passed_gen_code_len=0, charset=15, unpackTdbs=1, cliFlags=144) at ../cli/Statement.cpp:1420
#33 0x00007ffff512e3ee in SQLCLI_ExecDirect2(CliGlobals *, SQLSTMT_ID *, SQLDESC_ID *, Int32, SQLDESC_ID *, Lng32, typedef __va_list_tag __va_list_tag *, SQLCLI_PTR_PAIRS *) (cliGlobals=0x1062920, statement_id=0x1b86668, sql_source=0x7fffd9f4cfa0, prepFlags=0, input_descriptor=0x0, num_ptr_pairs=0, ap=0x7fffd9f4cd50, ptr_pairs=0x0) at ../cli/Cli.cpp:3705
#34 0x00007ffff51bb003 in SQL_EXEC_ExecDirect2 (statement_id=0x1b86668, sql_source=0x7fffd9f4cfa0, prep_flags=0, input_descriptor=0x0, num_ptr_pairs=0) at ../cli/CliExtern.cpp:2326
#35 0x00007ffff6bf6279 in SRVR::WSQL_EXEC_ExecDirect (statement_id=0x1b86668, sql_source=0x7fffd9f4cfa0, input_descriptor=0x0, num_ptr_pairs=0) at SQLWrapper.cpp:360
#36 0x00007ffff6bdd1e4 in SRVR::EXECDIRECT (pSrvrStmt=0x1b86050) at sqlinterface.cpp:4479
#37 0x00007ffff6b6abc9 in SRVR::ControlProc (pParam=0x1b86050) at csrvrstmt.cpp:757
#38 0x00007ffff6b6a207 in SRVR_STMT_HDL::ExecDirect (this=0x1b86050, inCursorName=0x0, inSqlString=0x1e04438 "insert into Trafodion.\"_REPOS_\".metric_session_table values(0,0,0,11319,11319,0,0,0,'172.16.0.11',0,'$Z00098E','MXID11000011319212288741031839550000000000206U3333307phoenix','END',CONVERTTIMESTAMP(212"..., inStmtType=1, inSqlStmtType=0, inSqlAsyncEnable=0, inQueryTimeout=0) at csrvrstmt.cpp:439
#39 0x00000000005767a0 in SessionWatchDog (arg=0x0) at SrvrConnect.cpp:795
#40 0x0000003299e079d1 in start_thread () from /lib64/libpthread.so.0
#41 0x0000003299ae88fd in clone () from /lib64/libc.so.6

Changed in trafodion:
assignee: nobody → Arvind Narain (arvind-narain)
Changed in trafodion:
milestone: r1.0 → r1.0.1
importance: Critical → High
Changed in trafodion:
milestone: r1.0.1 → r1.1
Changed in trafodion:
milestone: r1.1 → r1.0.1
Changed in trafodion:
milestone: r1.0.1 → r1.1
Changed in trafodion:
assignee: Arvind Narain (arvind-narain) → Tharak Capirala (capirala-tharaknath)
Revision history for this message
Tharak Capirala (capirala-tharaknath) wrote :

When the repository feature is enabled at time MXOSRVR may core during a dcsstop/sqstop. This is because the repository based thread does get terminated properly before the sqstop is invoked. Fix is to call the SyncPublicationThread() during terminateThreads() where the rest of the threads in MXOSRVR are being stopped. Moved SyncPublicationThread() before the check for tcp/ip thread.

Changed in trafodion:
status: New → Fix Committed
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.