cancel
Showing results for 
Search instead for 
Did you mean: 

Stack Trace on Standby ASE of WS pair and Thread going down on Rep Server

Former Member
0 Kudos

The inserts on two text column tables are failing with a stack trace in ASE with (infected with signal 11 at Address (ptn_clr_hintpage+0x24)) and following error in the Replication server errorlog.

I. 2015/07/03 05:26:41. The DSI thread for database 'HRS1SYBPRD15.gotrade2' is started.

I. 2015/07/03 05:26:41. A grouped transaction of 7 individual transactions has failed in database 'HRS1SYBPRD15.gotrade2'. Each transaction in the group will be executed individually.

I. 2015/07/03 05:26:41. Message from server: Message: 5702, State 1, Severity 10 -- 'ASE is terminating this process.'.

H. 2015/07/03 05:26:42. THREAD FATAL ERROR #5049 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - dsiqmint.c(3275)

    The DSI thread for database 'HRS1SYBPRD15.gotrade2' is being shutdown. DSI received data server error #5702 which is mapped to STOP_REPLICATION. See logged data server errors for more information. The data server error was caused by output command #2 mapped from input command #3 of the failed transaction.

I. 2015/07/03 05:26:42. The DSI thread for database 'HRS1SYBPRD15.gotrade2' is shutdown.

We are skipping these for now. Whenever an update on those skipped transaction comes its fails with following error on replication server.

I. 2015/07/03 07:12:15. DSI received Replication Server error #5185 which is mapped to WARN by error action mapping.

W. 2015/07/03 07:12:15. WARNING #5185 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - /dsiexec.c(4289)

    Row count mismatch for the command executed on 'HRS1SYBPRD15.gotrade2'. The command impacted 0 rows but it should impact 1 rows.

I. 2015/07/03 07:12:15. Function 'rs_get_textptr' for 'GoMLTradeMessage' returned no rows.

I. 2015/07/03 07:12:15. A grouped transaction of 11 individual transactions has failed in database 'HRS1SYBPRD15.gotrade2'. Each transaction in the group will be executed individually.

I. 2015/07/03 07:12:15. DSI received Replication Server error #5185 which is mapped to WARN by error action mapping.

W. 2015/07/03 07:12:15. WARNING #5185 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - /dsiexec.c(4289)

    Row count mismatch for the command executed on 'HRS1SYBPRD15.gotrade2'. The command impacted 0 rows but it should impact 1 rows.

E. 2015/07/03 07:12:16. ERROR #5149 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - /dsiexec.c(5287)

    Function 'rs_get_textptr' for 'GoMLTradeMessage' returned no rows.

We are skipping these also. We tried to create an error class for 5702 and 5185 but these transactions dont get skipped and DSI keeps trying to apply them at standby without getting down or moving ahead. At that time we see these errors in the replication error log.

I. 2015/07/03 05:41:59. The DSI thread for database 'HRS1SYBPRD15.gotrade2' is started.

E. 2015/07/03 05:42:01. ERROR #1027 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - /dsiutil.c(339)

        Open Client Client-Library error: Error: 84083974, Severity 5 -- 'ct_results(): network packet layer: internal net library error: Net-Library operation terminated due to disconnect'.

E. 2015/07/03 05:42:01. ERROR #5097 DSI EXEC(133(1) HRS1SYBPRD15.gotrade2) - /dsiutil.c(342)

        The ct-lib function 'ct_results' returns FAIL for database 'HRS1SYBPRD15.gotrade2'. The errors are retryable. The DSI thread will restart automatically. See ct-lib messages for more information.

The workaround provided by SAP was to reboot which is not helping us as we keep getting these stack traces even after reboot and threads go down.

The solution provided by SAP was to use sp_reptostandby without use_index will take around 24 to 30 hours to run on our databases and it cannot be implemented soon.

Can anyone provide us some pointers on how to resolve these issues without using sp_reptostandby solution.

Regards,

Rhishikesh.

Accepted Solutions (0)

Answers (2)

Answers (2)

Former Member
0 Kudos

We were planning to change the data type of these text column to either binary or varbinary but it can take maximum 16K of data. Our requirement is of a data-type that hold around 32K of data. So the next option for us is to try and use image data-type. Does anything think we will hit the same issue with image data-type which we are hitting with the text columns ?

Does anyone have any other data-type suggestion for us other than text which can hold upto 32K of data ?

victoria_normand
Contributor
0 Kudos

Hi Rhishikesh,
You mentioned a stacktrace in the ASE , can you post it ? Any error reported prior of the stacktrace? Is it on the standby or on the source dataserver ? ASE & RS versions ?
There was an issue identified when using set replication on and usage of index on image/text columns. You may refer to KBA 2057830 - ASE : Signal 11 with stack trace in the module ptn_clr_hintpage on replicate database.

Best regards,
Victoria.

Former Member
0 Kudos

Hi Maria,

ASE Version : Adaptive Server Enterprise/15.0.3/EBF 17166 ESD#3/P/Solaris AMD64/OS 5.10/ase1503/2726/64-bit/FBO/Fri Feb  5 02:16:02 2010

Rep Version: Replication Server/15.2/ONE-OFF EBF 20217/AMD Opteron/Solaris 5.10/1/OPT/Tue Jun 12 11:18:26 2012

We have seen KBA 2057830 and this is the exact issue which we are facing but unfortunately we observe the stack trace even after reboot of the standby ASE server.

Also we cannot run sp_reptostandby without use_index now as it took around 30 hours on a test server with same configuration.

I am pasting the stack trace received on the standby server below.

========================================

18:00000:00572:2015/07/03 06:23:27.28 kernel  Current process (0x4c4a01f4) infected with signal 11 (SIGSEGV)

18:00000:00572:2015/07/03 06:23:27.28 kernel  Address 0x00000000012be3eb (ptn_clr_hintpage+0x24), siginfo (code, address) = (1, 0x000000000000005c)

18:00000:00572:2015/07/03 06:23:27.28 kernel  ************************************

18:00000:00572:2015/07/03 06:23:27.28 kernel  SQL causing error : }^D

18:00000:00572:2015/07/03 06:23:27.28 kernel  ************************************

18:00000:00572:2015/07/03 06:23:27.28 server  SQL Text: }^D

18:00000:00572:2015/07/03 06:23:27.28 kernel  curdb = 8 tempdb = 2 pstat = 0x10000

18:00000:00572:2015/07/03 06:23:27.28 kernel  lasterror = 0 preverror = 0 transtate = 0

18:00000:00572:2015/07/03 06:23:27.28 kernel  curcmd = 337 program = RepServer

18:00000:00572:2015/07/03 06:23:27.28 kernel  extended error information: hostname: ykt1repprd15 login: gotrade2_maint

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000018d10ee pcstkwalk+0x35()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000018d0e9f ucstkgentrace+0x24f()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000001847386 ucbacktrace+0xe8()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b8fe94 terminate_process+0x11f1()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x000000000188d5b3 kisignal+0x22f()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000012be3eb ptn_clr_hintpage+0x24()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000001261321 pg_deallpage+0x35c()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x000000000103afce bt__shrinkleaf+0x511()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x000000000103d259 bt__maybeshrink+0x4fb()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000001031a11 bt__handle_pagedone+0x120()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x000000000102ee6c bt_getnext+0x108()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b7c550 dol_bt_getnext+0x13d()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b20473 getnext+0xc0()

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000015b8e9d hdl_backout_msg installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000012ff9a2 txtimg__search_textpointer+0x24e()

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000015b8e9d hdl_backout_msg installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000012fb94a txtimg_get_datarow+0x24e()

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000012dc0b8 text_handler installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x00000000012db031 writetext+0x428()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000ad5102 s_execute+0x4a22()

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000015b8e73 hdl_stack installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000015605e0 s_handle installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b3d67d sequencer+0x24f()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000bd2b26 tdsrecv_language+0xca()

18:00000:00572:2015/07/03 06:23:27.28 kernel  [Handler pc: 0x00000000017c99ca ut_handle installed by the following function:-]

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b15748 conn_hdlr+0xa2c()

18:00000:00572:2015/07/03 06:23:27.28 kernel  pc: 0x0000000000b5c385 kpstartproc+0x56()

18:00000:00572:2015/07/03 06:23:27.28 kernel  end of stack trace, spid 572, kpid 1279918580, suid 2311

=======================================