Help Understanding Execution Plans

Question

Help Understanding Execution Plans

Mike-E-angelo 611

I am having a heck of a time learning SQL performance guidance and need some help. My application has been growing over the past year and one table in particular has grown to over 590K rows. Here is it's schema:

User's image

It's a very simple table and I have been struggling recently with Sql Server initialization around this table whenever it scans the keys, where queries will take anywhere up to 2:30 minutes to scan them all. I have another thread where I discuss this that started this whole rabbit hole for me as I was receiving Timeout exceptions when my application initialized and also randomly during operation.

Anyway, back to my current plight. I ended up making a view that helped reduce the time down from minutes to seconds:

CREATE VIEW [dbo].[View_ExternalProcess_Properties] AS
SELECT pu.ExternalProcessId AS Id, pu.Created, pu.[Status], pu.[Message] FROM ProcessUpdate pu
WHERE ID = (SELECT TOP(1) Id FROM ProcessUpdate p2 WHERE p2.ExternalProcessId = pu.ExternalProcessId ORDER BY Id DESC)

I thought this solved my problem but after running my application I landed on another problem that I will attempt to demonstrate with a few queries. Here is this first:

CHECKPOINT; 
GO 
DBCC DROPCLEANBUFFERS; 
GO
DBCC FREEPROCCACHE;
SELECT CASE
    WHEN EXISTS (
        SELECT 1
        FROM ExternalProcess e JOIN [View_ExternalProcess_Properties] p ON p.Id = e.Id 
        WHERE [e].[Discriminator] = N'IssuanceProcess' AND [p].[Status] = 4) THEN CAST(1 AS bit)
    ELSE CAST(0 AS bit)
END

Notice the [Status] = 4 . When I run this on my Azure Sql S1 server, this executes in 4 seconds. That is very acceptable to me. However, if I change this query to:

CHECKPOINT; 
GO 
DBCC DROPCLEANBUFFERS; 
GO
DBCC FREEPROCCACHE;
SELECT CASE
    WHEN EXISTS (
        SELECT 1
        FROM ExternalProcess e JOIN [View_ExternalProcess_Properties] p ON e.Id = p.Id 
        WHERE [e].[Discriminator] = N'IssuanceProcess' AND [p].[Status] = 6) THEN CAST(1 AS bit)
    ELSE CAST(0 AS bit)
END

(Basically, changing the Status from 4 to 6)

This takes 12 seconds to complete, or 3-times longer than when Status is 4.

My concern is that as I add more rows to this table the startup time will only increase as the scan would have to scan more rows (I would like confirmation on this if possible) and it's only a matter of time before I potentially get timeouts again in my application because it's taking so long to scan these rows.

Outside of this, I have taken the time to review the execution plans for the above queries and they are confusing to me. Here is the Status = 4 :

User's image

And the Status = 6 :

User's image

I've outlined in red rectangles the areas that concern me. It appears there are more rows than expected, if I understand correctly, and leading to values higher than 100% for total rows processed. Additionally, for the Status = 6, there are more of these rows which seem to be the reason why it's taking longer.

I would great appreciate any assistance in alleviating my confusion in learning how to properly diagnose these performance issues.

Thank you for any assistance you can provide.

LiHongMSFT-4306 31,616 Reputation points

2024-03-11T02:30:25.1566667+00:00

Hi @Mike-E-angelo

Do you have further question on this issue, could we offer more support?

If any answer helps on your issue, you could mark it as answer so other user with similar problem could see this easier. :)

Answer accepted by question author

0 additional answers

Your answer

LiHongMSFT-4306 31,616 Reputation points

2024-03-11T02:30:25.1566667+00:00

Hi @Mike-E-angelo

Do you have further question on this issue, could we offer more support?

If any answer helps on your issue, you could mark it as answer so other user with similar problem could see this easier. :)

Answer 1

Erland Sommarskog 129.6K MVP Volunteer Moderator

As you may guess, what you just have entered is not a small topic - but a huge one. But also one which is very rewarding the more you learn.

You are making a correct observation when you say "I've outlined in red rectangles the areas that concern me. It appears there are more rows than expected," When it says "334 of 12" it means that the optimizer expected to read 12, but in fact read 334.

A very common reason to that queries are slow is that the estimates that the optimizer works from are inaccurate. Had the optimizer known that the Index Seek would return over 300 rows, it might have gone for another plan.

There can be two reasons for this:

Statistics are outdated.
Auto-parameterisation is kicking in. In the query text above the plan, does it say 6, or does it say @1?

Mike-E-angelo 611 Reputation points

2024-03-08T22:36:37.4166667+00:00
Thank you for your continued assistance @Erland! Here's what I see for the text:

SELECT 1 FROM ExternalProcess e JOIN [View_ExternalProcess_Properties] p ON e.Id = p.Id WHERE [e].[Discriminator] = N'IssuanceProcess' AND [p].[Status] = 6

So it does not appear this query is parameterized. Maybe it's the statistics? How do I update them?
Erland Sommarskog 129.6K Reputation points MVP Volunteer Moderator

2024-03-08T22:47:13.9433333+00:00
You would say

UPDATE STATISTICS tbl

There are some variations, so that you can request update only of a specific statistics object. You can also specify the sampling frequence, and you can add WITH FULLSCAN to force reading all data. You can try this if a plain UPDATE STATISTICS does not work out.

Now, how well this works out, depends on the number of distinct values for the column. The distribution is saved in a histogram which can have up to 200 steps, and the way it is constructed, it can handle up 400 distinct values with full accuracy.

From a column called Status, I would only expect a handful of distinct values, far less than 400. But if there are fact, say, 800 steps, it could be that 4 and 6 falls into the same bucket, and therefore will get the same estimates.
Mike-E-angelo 611 Reputation points

2024-03-09T08:49:32.2766667+00:00
Thank you for the continued suggestions Erland. Unfortunately I was not able to see an improvement. To start, there are only 7 Status Values:

public enum ProcessStatus { New, Queued, Processing, Completed, Error, Canceled, Paused }

I tried both variants of Update Statistics but I still get overages if I clear the caches:

CHECKPOINT; GO DBCC DROPCLEANBUFFERS; GO DBCC FREEPROCCACHE; SELECT CASE WHEN EXISTS ( SELECT 1 FROM ExternalProcess e JOIN [View_ExternalProcess_Properties] p ON e.Id = p.Id WHERE [e].[Discriminator] = N'IssuanceProcess' AND [p].[Status] = 6) THEN CAST(1 AS bit) ELSE CAST(0 AS bit) END

Results in the following (note that the numbers are different but the overages still occur in red):

Is there a way to find out what those 300 rows are?
Erland Sommarskog 129.6K Reputation points MVP Volunteer Moderator

2024-03-09T10:37:15.6866667+00:00
I would expect that if you run

UPDATE STATISTICS dbo.ProcessUpdate IX_ProcessUpdate_Status WITH FULLSCAN

You should get correct estimates. Then again, you may still get the same execution plan. The execution plan you have still looks meaningful a for a few hundred rows. But since there are more rows to process, it is not surprising that it takes longer.

And overall, performance tuning on an S1 is a little funny. It's like buying a Trabant and then try to make it go really fast.
Mike-E-angelo 611 Reputation points

2024-03-09T13:28:41.8866667+00:00

Bummer, that didn't seem to do anything either. :( Even though I have Microsoft for Startup credits and I can use a much faster server, my credits expire in December so I would like to prepare for the worst case scenario. Cheaper server is the better server in this case. But also, buying a more expensive server to avoid performance issues seems a bit like sweeping it under the rug.

That stated, is there a way to see how those 300 rows are being selected? Maybe there's something in my view that is making this happen.
Mike-E-angelo 611 Reputation points

2024-03-09T13:37:50.3966667+00:00

Ahhh I figured it out. Those numbers are estimated vs. actual:

Still doesn't say how to improve the read/scan times 🤔
Erland Sommarskog 129.6K Reputation points MVP Volunteer Moderator

2024-03-09T16:32:34.5666667+00:00
Yes, you got it. The 334 are the actual rows, and 100 is the estimate. With fullscan statistics and seven values, I would expect the statistics to be accurate. But as I said, it may not matter that much in the end.

But also, buying a more expensive server to avoid performance issues seems a bit like sweeping it under the rug.

It's certainky correct that you cannot always throw hardware at a performance problem. There are certainly lots of system out there that could run on leaner hardware, if the developers had written better qureies and applied better indexing.

But an S1... Extract a bacpac and load the database on your laptop, and try the same queries. I would execpt execution time for that query to be sub-second. I don't have the full picture, but the execution you have shown looks perfectly normal.

Maybe, if I would know about about your tables and business I could suggest improvements, but it would still be an S1, and I would expect anything startling.

One thing you could try, though, is to rebuild the indexes, in case you have very heavy fragmentation:

ALTER INDEX ALL ON tbl REBUILD WITH (ONLINE = ON)

Beware that this command can take quite some time to complete on your S1.

Note: I am not 100% sure that ONLINE is supported on S1. If you leave it, the table will be unavailable until the rebuild has completed.
Mike-E-angelo 611 Reputation points

2024-03-09T17:38:18.66+00:00

Thank you very much for your continued assistance Erland. ONLINE did indeed worked and took about five minutes... almost as long as some of these queries :D But unfortunately that did not improve the estimates vs actuals.

However, please be aware that my concern was based on misunderstanding those numbers. I thought it was saying that there were 300% more rows being processed than necessary, leading to the long delays. Now that I understand it's estimates vs. actual that doesn't really matter.

Now, as far as doing local development with 59K rows and trying it out, that is a surprising assessment. If it takes 12 seconds to process 59K rows (let's round to 60K), then that means if I have a million it will take roughly 3.33 minutes to process these keys, correct? That is my concern here, and why I am trying to address this now rather than later.

If I have something fundamentally misunderstood here that would be good to know. :)
Erland Sommarskog 129.6K Reputation points MVP Volunteer Moderator

2024-03-09T21:34:00.6566667+00:00

Now that I understand it's estimates vs. actual that doesn't really matter.

Yes and no. 100 vs. 300 may not matter much in practice. But say that the estimate is 1 row, and the actual is one million. In this case, it is very likely that the chosen plan is not the best one.

Now, as far as doing local development with 59K rows and trying it out, that is a surprising assessment. If it takes 12 seconds to process 59K rows (let's round to 60K), then that means if I have a million it will take roughly 3.33 minutes to process these keys, correct?

Wtih the same execution plan, yes. But there are a number of if and buts here, and one is throttling, which I am not sure how it would affect. Throttling is part of the story why your S1 is so slow - you only get the performance you pay for.

Another important thing is of course the distributon of these Status values. I assume most rows have Status = Completed. So as long as number of rows with status 4 or 6 are the same, the performance should remain the same for these queries, since the index on Status is used. But if the business would expand so that you get more rows with these status values, execution would increase. But the key is that the total size of the table is not relevant, when the matching rows are found through an index.
Mike-E-angelo 611 Reputation points

2024-03-10T09:51:13.08+00:00
But the key is that the total size of the table is not relevant, when the matching rows are found through an index

This is not the case after it's been scanned, correct? In order to use the index it must first scan them all and if it takes 3 minutes to do this with a million rows this is a problem that must be accounted for.

Since AppService has a warm up feature which I already use, I can add calls to this process that makes sure all the heavy lifters are cached before swapping into production.

That takes care of that problem. :D

As for the View, the mystery continues here. I landed on a new query that gets similar results for Status 4 and 6 ... roughly 4-5 seconds each now on a clean cache:

SELECT pu.ExternalProcessId as [Id], pu.[Created], pu.[Status], pu.[Message] FROM ProcessUpdate pu WHERE pu.[Id] IN(SELECT MAX(Id) FROM ProcessUpdate p Group By p.ExternalProcessId)

Here's the resulting Execution Plan:

Finally, all of this has made me analyze the model, and really, there is no historical reason for storing the rows like I am doing. This table is used to present messaging to the user on the status of a workflow. It was one of the first things I built when I started this 4 years ago and I was super impressed that I was able to get it to work so well. :D But now that it's causing so many problems it might be worth cleaning up the table after the process completes and save the completed state/message on the table itself.

Still, this is good analysis/exploration for when the real bullets start flying when other tables start creeping up in size + row count.
Erland Sommarskog 129.6K Reputation points MVP Volunteer Moderator

2024-03-10T10:44:13.3+00:00

This is not the case after it's been scanned, correct? In order to use the index it must first scan them all and if it takes 3 minutes to do this with a million rows this is a problem that must be accounted for.

The plans for statuses 4 and 6 have an Index Seek, which means that it goes straight to the rows with the Status values. Therefore, the total size of the table is irrelevant.

On the other hand, the plan in your most recent post includes an Index Scan, which means that the entire index is scanned, so in this case the total number of rows in the table does matter.

Share via

Help Understanding Execution Plans

0 additional answers

Your answer