Index creation plan estimates

Question

Index creation plan estimates

ACDBA 421

Hi All,

Hope everyone is safe.

I have a SQL server 2017 database with a compatibility level set to 130(one version below server)

I have a partitioned table that has around 100 million records. A clustered index is already there in it. While I create NCI it keeps CPU to 100% for almost 2 hours and brings the server to the knee.

There are no included columns for NCI. it's a 128GB 16 CPU VM.

Any idea why it's so slow and consumes most CPU? waits seem to be memory\CPU. Will there be any difference if we switch compatibility level to 140?

The execution plan and the details are attached.

Thanks,
ACDBA

YufeiShao-msft 7,146 Reputation points

2022-02-07T09:04:28.707+00:00

Hi， The cost of the query optimizer to perform this operation and other costs seem too high, maybe we have to look at the rest of the plan, fire up the debugger and see what choices were made by the optimizer along the way. The costs are not only IO and CPU. There are additional costs associated with a given operator that are reflected in the total cost, but are not reflected in the IO and CPU cost estimates.

https://www.sql.kiwi/2010/09/inside-the-optimizer-plan-costing.html
https://www.scarydba.com/2020/12/14/getting-started-reading-execution-plans-highest-cost-operator/

3 answers

Your answer

YufeiShao-msft 7,146 Reputation points

2022-02-07T09:04:28.707+00:00

Hi， The cost of the query optimizer to perform this operation and other costs seem too high, maybe we have to look at the rest of the plan, fire up the debugger and see what choices were made by the optimizer along the way. The costs are not only IO and CPU. There are additional costs associated with a given operator that are reflected in the total cost, but are not reflected in the IO and CPU cost estimates.

https://www.sql.kiwi/2010/09/inside-the-optimizer-plan-costing.html
https://www.scarydba.com/2020/12/14/getting-started-reading-execution-plans-highest-cost-operator/

Answer 1

You could use this to track the progress and give you an estimate on the completion time.
From you plan it looks like the sorting consumes most of the CPU
Also I might be obvious but try to do this when the there is either no or minimal load ...

Basically you check set SET STATISTICS PROFILE ON; in the where you will create run the CREATE INDEX script and note the SPID and enter in in the in the query below in another tab.

This will help you to understand if after 2 hours you were at 99% or still at 10%.

/*
remember to set
SET STATISTICS PROFILE ON;
in index creation script
*/

DECLARE @SPID INT = 64;

;WITH agg AS
(
SELECT SUM(qp.[row_count]) AS [RowsProcessed],
SUM(qp.[estimate_row_count]) AS [TotalRows],
MAX(qp.last_active_time) - MIN(qp.first_active_time) AS [ElapsedMS],
MAX(IIF(qp.[close_time] = 0 AND qp.[first_row_time] > 0,
[physical_operator_name],
N'<Transition>')) AS [CurrentStep]
FROM sys.dm_exec_query_profiles qp
WHERE qp.[physical_operator_name] IN (N'Table Scan', N'Clustered Index Scan', N'Sort')
AND qp.[session_id] = @SPID
), comp AS
(
SELECT *,
([TotalRows] - [RowsProcessed]) AS [RowsLeft],
([ElapsedMS] / 1000.0) AS [ElapsedSeconds]
FROM agg
)
SELECT [CurrentStep],
[TotalRows],
[RowsProcessed],
[RowsLeft],
CONVERT(DECIMAL(5, 2),
(([RowsProcessed] * 1.0) / [TotalRows]) * 100) AS [PercentComplete],
[ElapsedSeconds],
(([ElapsedSeconds] / [RowsProcessed]) * [RowsLeft]) AS [EstimatedSecondsLeft],
DATEADD(SECOND,
(([ElapsedSeconds] / [RowsProcessed]) * [RowsLeft]),
GETUTCDATE()) AS [EstimatedCompletionTime]
FROM comp;

see this for reference:
https://dba.stackexchange.com/questions/139191/sql-server-how-to-track-progress-of-create-index-command

Answer 2

Erland Sommarskog 121.4K MVP Volunteer Moderator

Will there be any difference if we switch compatibility level to 140?

I would not really expect so, but the only way to find out is to test.

Two hours a wee bit long time, but there is a lot of data to be processed and to be read and written. The bottleneck could very well be in your I/O subsystem which gets overtaxed.

You could sample sys.dm_io_virtual_file_stats while the operation is running and compute the average response time for the I/O requests to get an idea of your I/O subsystem is performing.

Also, you said VM, but what sort of? Is it on-prem or in the cloud?

ACDBA 421 Reputation points

2022-02-05T18:55:29.927+00:00

Hi Erland,

Thank you so much for your reply. This is an on-prem VMware-hosted VM. VM hosts and cluster utilization looks good.
I did check with the storage and Wintel team and confirmed that there are no issues. But I will try the way you proposed sys.dm_io_virtual_file_stats ensure the same.

Regards,
ACDBA
Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2022-02-05T19:00:25.963+00:00

I did check with the storage and Wintel team and confirmed that there are no issues.

They always say that. :-)

More seriously, you need to check with sys.dm_io_virtual_file_stats, to get the numbers that SQL Server seems. Looking downstream things may look fine, because they are looking below the bottleneck.
ACDBA 421 Reputation points

2022-02-07T18:04:09.173+00:00

Storage has provided screenshots having very little latency. Checking host VM utilizations.
ACDBA 421 Reputation points

2022-02-09T09:55:43.697+00:00

validated with less load stats and it looks OK.
ACDBA 421 Reputation points

2022-02-09T09:56:12.633+00:00

No much difference in stats during peak load.
Erland Sommarskog 121.4K Reputation points MVP Volunteer Moderator

2022-02-09T22:09:44.253+00:00

If that is the data from sys.dm_io_virtual_file_stats, I am not too alarmed.

Another possible factor is that the VM host is overcommitted on the CPU. That is, say that there are 30 VMs is host, each with 16 CPUs like yours, but the host only has 160 cores. Which may be OK, as long as all VMs are not running CPU-intensive stuff simultaneously.
ACDBA 421 Reputation points

2022-02-10T08:26:12.25+00:00

Thank you so much.. Wintel has provided details of Host utilization and it seems ok. requested to increase memory. Lets see how it goes.

Answer 3

Like Erland, I would not expect the compatibility level to make a difference in the index create performance. I ran a test on my workstation to verify and observed no performance difference with compatibility level 130 vs. 140. It took a little over one minute to create a partitioned single column nonclustered rowstore index on a 550 million row clustered columnstore partitioned table. This was using a Hyper-V VM (SQL 2017 CU28, 64GB RAM, 16 core).

There may be particulars of your index/partitioning that might contribute to the high elapsed time but I suggest you first rule out VM infrastructure that isn't providing sufficient CPU cycles. 2 hours seems quite high. Also, consider applying the latest CU if you have not already done so.

ACDBA 421 Reputation points

2022-02-05T18:56:46.613+00:00

Thank you @Dan Guzman for advice. I can confirm latest SP is applied.

Share via

Index creation plan estimates

3 answers

Your answer