SQL2016 Upgrade to SQL2019 Long Running Query

Question

SQL2016 Upgrade to SQL2019 Long Running Query

Terry Kaczmarski 1

Hello Everyone

We are converting several ETL processes from a windows 2012, SQL2016 environment to a windows 2019, SQL2019 environment. All is going well, except for one process. This process is running longer in SQL2019 then SQL2016.

The process performs an INSERT into A, select x from B, where x is a series of substring and case statements.

The source table basically contains 1 data field, 6,000 bytes long. The target table contains over 950 data columns, the largest in our ETL processes.

The source table contains 4.7 million rows and loads in 47 minutes, 100,000 rows per minute.

I split out 1 million rows for testing purposes, this runs in 10 minutes. Again, 100,000 rows per minute.

On my SQL2016, windows 2012 server, a full load runs in 12 - 14 minutes, while the 1 million sampling runs in 2 minutes.

For grins, I stalled SQL2016 on my ETL 2019 server and ran the same tests. Full load 7 minutes 14 seconds, whereas the 1 million sampling took 1 minute 32 seconds.

All three instances have a MAXDOP of 8 and a Cost Threshold For Parallelism (CTFP) of 50.

While the actual ETL process is procedure based, I pulled the insert code out and I'm running it in SSMS as t-sql commands.

I understand there are a lot of new "performance" improvements in SQL2019 and I've tried a lot of different combinations, so far, nothing has helped reduce the run times.

Some observations: A) reducing the target table to only 100 columns, 1 million rows loaded in 1 minute; 4.7 million loaded in 5 minutes. B) increasing the column count to 250 and loading 1 million rows took 6 minutes.
I cannot put my finger on it, but something around the substring/case function and how its interacting with SQL is causing me some grief.

So, I'm reaching out to SQL2019 guru's for advise and where to look, what to tune. Converting to SSMS at this time is not an option.

Your thoughts?

Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-23T22:34:39.547+00:00

What compatibility level are you using on SQL 2019? 150 or something lower? What if you lower the compat level to 130? I'm not saying that this is the solution, but this to serves narrow down what to look for.

What do you mean with "converting to SSMS"?

You did apply CU8 on the SQL 2019 instance, didn't you?

7 answers

Your answer

Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-23T22:34:39.547+00:00

What compatibility level are you using on SQL 2019? 150 or something lower? What if you lower the compat level to 130? I'm not saying that this is the solution, but this to serves narrow down what to look for.

What do you mean with "converting to SSMS"?

You did apply CU8 on the SQL 2019 instance, didn't you?

Answer 1

Terry Kaczmarski 1

Sorry, I did not mean "converting to SSMS", I meant "converting to SSIS". [Too many S's, they all look the same :) thanks for pointing that out]

My SQL2019 instance is at CU8. After the initial poor performance tests, I rebuilt all the databases from scratch as opposed to restoring them.

As far as compatibility level, on SQL2019 it was at 150. When I change the DB to 130, it still runs for 10 minutes to load 1 million rows.

Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-24T07:43:36.107+00:00
OK, so it does not seem to be a query-plan issue then.

I think next step is to look at waits. Start a new session, run the procedure, and when completed, or you have stopped it, run

SELECT * FROM sys.dm_exec_session_wait_stats WHERE session_id = @@spid ORDER BY wait_time_ms DESC

It may be worth comparing against what you see on the SQL 2016 instance on the same machine.

I am not sure that the suggestions by MiaMiao are particularly relevant. However, it could be worth checking that your configuration settings are the same on the SQL 2019 and SQL 2016 instances in case there is something you have overlooked.
Terry Kaczmarski 1 Reputation point

2020-11-24T13:20:21.817+00:00

Thank You Erland Sommarskog.

SQL2019 Execution wait stats:
wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
PAGEIOLATCH_SH 4 2 1 0
MEMORY_ALLOCATION_EXT 111 0 0 0

SQL2016 Execution wait stats on same machine:
wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
SOS_SCHEDULER_YIELD 2451 4 0 3
MEMORY_ALLOCATION_EXT 81 0 0 0
Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-24T21:57:52.66+00:00

There are hardly no waits at all for SQL 2019. Given your story, it seems that you made some mistake when collecting the data.

Rather than stopping the operation to run the query, you could run the query from a different window, replacing @@spid with the session_id for the window where the procedure is running.
Terry Kaczmarski 1 Reputation point

2020-11-25T18:17:21.997+00:00

Yup, I messed up. Ran the stats from the another window!

SQL2019 Execution wait stats:
wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
SOS_SCHEDULER_YIELD 1529426 5083008 90 5082527
CXCONSUMER 1462 1227889 19310 7893
PAGELATCH_UP 2642 79941 230 5974
PAGEIOLATCH_SH 5668 30964 86 4781
PAGELATCH_SH 2765 12223 59 6139
LATCH_EX 1231 11925 211 1936
CXPACKET 34 8023 5986 138
MEMORY_ALLOCATION_EXT 3823373 3280 15 0
EXECSYNC 17 277 73 134
IO_COMPLETION 202 263 86 14
PAGELATCH_EX 4 210 70 18
PAGEIOLATCH_EX 35 29 19 4
WRITELOG 5 19 18 7
PAGEIOLATCH_UP 1 0 0 0
LOGMGR_FLUSH 1 0 0 0
RESERVED_MEMORY_ALLOCATION_EXT 66 0 0 0
SESSION_WAIT_STATS_CHILDREN 12 0 0 0
Terry Kaczmarski 1 Reputation point

2020-11-25T18:17:33.577+00:00

Results from SQL2016 on same machine:
wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
PAGEIOLATCH_SH 239 9631 169 4
SOS_SCHEDULER_YIELD 39344 2468 102 2454
MEMORY_ALLOCATION_EXT 2510060 1322 73 0
PAGELATCH_UP 4 708 376 73
PAGEIOLATCH_UP 2 167 130 140
WRITELOG 2 100 76 24
PAGEIOLATCH_EX 1 5 5 0
Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-25T21:57:39.137+00:00

The major wait on SQL 2019 is SOS_SCHEDULER_YIELD, which means that you are CPU bound. I also note that there are CXCONSUMER and CXPACKET waits on SQL 2019, but not on SQL 2016. That, there is parallelism on SQL 2019, but not on SQL 2016.

While you said that the compatibility level did not make a difference, it certainly seems to me that there are different query plans on the two versions. Have you looked at them?

Share them if you like. Save them as .xml files and attach them.

Answer 2

m 4,276

Hi @Terry Kaczmarski ,

Your thoughts?

After your upgrade, you need do as next,please verify you have done:

• Confirm that the compatibility level of the database has been upgraded to the latest database version

• Rebuild the index on the database

• Re-update the statistics on the database

2.Please change the MAXDOP to 1 and CTFP to 5 to have a test:

USE dbname ;    
GO     

EXEC sp_configure 'show advanced options', 1;    
GO    

RECONFIGURE WITH OVERRIDE;    
GO    

EXEC sp_configure 'max degree of parallelism', 1;    
GO    

RECONFIGURE WITH OVERRIDE;    
GO    


USE dbname;    
GO    

EXEC sp_configure 'show advanced options', 1 ;    
GO   

RECONFIGURE    
GO    

EXEC sp_configure 'cost threshold for parallelism', 5 ;    
GO    

RECONFIGURE    
GO

BR,
Mia

If the answer is helpful, please click "Accept Answer" and upvote it.

Terry Kaczmarski 1 Reputation point

2020-11-24T13:17:30.797+00:00

Thank you Mia. To answer your questions, yes he compatibility level of the databases at SQL2019 are all 150. After I did a restore, I changed them. Then when performance didn't look right, I dropped them and rebuilt from scratch.

These tables do not have indexes. 100% load into a flat table [varchar(6000)], then 100% load of the data into a table with 950+ data columns.

Table structure is the same between SQL2019 and SQL2016.

Answer 3

David Browne - msft 3,851

The recommended upgrade procedure is to not change the database compatibility level after an upgrade until you have a performance baseline with Query Store enabled.

https://learn.microsoft.com/en-us/sql/database-engine/install-windows/change-the-database-compatibility-mode-and-use-the-query-store?view=sql-server-ver15

Answer 4

Hi @Terry Kaczmarski ,

SQL2019 Execution wait stats:

wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
PAGEIOLATCH_SH 4 2 1 0
MEMORY_ALLOCATION_EXT 111 0 0 0

SQL2016 Execution wait stats on same machine:
wait_type waiting_tasks_count wait_time_ms max_wait_time_ms signal_wait_time_ms
SOS_SCHEDULER_YIELD 2451 4 0 3
MEMORY_ALLOCATION_EXT 81 0 0 0

1.Please check whether your memory is enough, set max server memory for SQL Server is 0.7 * Server Memory; And if it is possible, allocate one server is special for SQL Server,if not possible, make sure other applications in your server have enough memory and they won't preempt Sql Server's memory when they are running.

It should be emphasized that the two most common types of PAGEIOLATCH_x waits are PAGEIOLATCH_SH and PAGEIOLATCH_EX. PAGEIOLATCH_SH often happens when the user is about to access a data page, while SQL Server wants to read this page from disk to memory. If this page is frequently accessed by users, then in the final analysis, the problem is that the memory is not large enough to always cache the data page in the memory. Therefore, it is often the memory pressure that triggers SQL Server to do a lot of page reading work, which causes the disk read bottleneck. The disk bottleneck here is often a by-product of the memory bottleneck.

These tables do not have indexes. 100% load into a flat table [varchar(6000)], then 100% load of the data into a table with 950+ data columns.

Please create a clustered index for your table,this will improve the performance.

BR,
Mia

If the answer is helpful, please click "Accept Answer" and upvote it.

Answer 5

Thank You Mia and Erland

Server specifics:

This is a dedicated server to SQLSERVER for ETL processing. No other apps running on it.
2 - 3.80Ghz processors with 4 cores per socket and 8 logical processors per socket.
1571465 MB of RAM. (1.5 TeraBytes)
16,813 GB on-board SSD storage
Very controlled testing, no other processes running at the time of tests

The SQL2019 install is configured with max memory of 1331200 MB (1300 GB 85% of total)
name minimum maximum config_value run_value
max server memory (MB) 128 2147483647 1331200 1331200

The SQL2016 install is configured with default memory value since this install will be dropped.

Source Table: (1 Million rows all with ImportRecordType = '01BASE')
ImportRecID int IDENTITY(1,1) NOT NULL, (Clustered IX on this)
ImportText varchar(8000) NULL,
ImportRecordType char(7) NULL,
ImportAccount char(28) NULL,
ImportKey char(33) NULL
PK added to table and tests rerun in both 2019/2016. No impact to results.43827-sql2019-executeplan-xml.txt 43852-sql2016-executeplan-xml.txt

T-SQL command:
Insert into Target SELECT ... from source where ImportRecordType = '01BASE'

XML formatted execution plan is attached.

Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2020-11-30T22:25:13.18+00:00

The plans are quite similar. I notice that these are estimated plans, and it would have helped to see the actual plans. Then again, given the simplicity of the plans I would not that the actual plans would reveal anything startling.

I don't really have any good ideas at this point. But just to check? Collations are the same both cases, both for the source and the target tables?

Share via

SQL2016 Upgrade to SQL2019 Long Running Query

7 answers

Your answer