Azure Cosmos DB does not scale as the number of logical partitions grow: How to also reduce RU consumption when executing a query in Azure Cosmos DB SQL API

Question

Azure Cosmos DB does not scale as the number of logical partitions grow: How to also reduce RU consumption when executing a query in Azure Cosmos DB SQL API

Magnus Fors 21

Setup 1

We have Cosmos DB (setup as serverless) that contains approx. 11,000,000 documents in one container. The container is partitioned by the property storageContainerName (see the document JSON below). Currently we have 36 logical partitions and the number of documents vary from 800,000 to 10,000 in the different partitions (in the longer term we expect the number of documents to be evenly distributed among the partitions).

The document structure is identical to all documents in the database and it looks like below.

{
"id": "5c4eab89-3a16-45af-b54d-bc243c36e5b5:blobs:storage:recordings:20230412:88bc0094-5114-42be-a893-6417ca74fdca-0_C:20230412055549629:V1:20230412062831098-20230412062848128",
"storageContainerName": "5c4eab89-3a16-45af-b54d-bc243c36e5b5",
"sizeInBytes": 676807,
"startTime": "20230412062831098",
"videoSourceId": "88bc0094-5114-42be-a893-6417ca74fdca-0",
"ttl": 2592019,
"_rid": "vpx1AKoCsb2B5QAAAAAAAA==",
"_self": "dbs/vpx1AA==/colls/vpx1AKoCsb0=/docs/vpx1AKoCsb2B5QAAAAAAAA==/",
"_etag": ""1800964f-0000-0c00-0000-64364fa10000"",
"_attachments": "attachments/",
"_ts": 1681280929
}

The indexing policy of the container looks like below (default index everything).

{
"indexingMode": "consistent",
"automatic": true,
"includedPaths": [
{
"path": "/*"
}
],
"excludedPaths": [
{
"path": "/"_etag"/?"
}
]
}

Query 1 When executing the following query for the partition 89bd41d7-ad89-46cf-80ad-4c14ff012e70 that contains 11,000 documents

select sum(c.sizeInBytes) from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70"

we get the following query stats.

Request Charge: 2225.94 RUs
Retrieved document count: 0
Retrieved document size: 0 bytes
Output document count: 1
Output document size: 235 bytes
Index hit document count: 0
Index lookup time: 16605.2899 ms
Document load time: 0 ms
Query engine execution time: 0.050100000000000006 ms
System function execution time: 0 ms
User defined function execution time: 0 ms
Document write time: 0 ms

Query 2 When executing the following query for the partition 89bd41d7-ad89-46cf-80ad-4c14ff012e70 that contains 11,000 documents

select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.videoSourceId = "5862401d-d172-49f2-a7b4-e4e284a88c71-0" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"

we get the following query stats.

Request Charge: 389.08000000000004 RUs
Retrieved document count: 13029
Retrieved document size: 7746492 bytes
Output document count: 5095
Output document size: 173379 bytes
Index hit document count: 5108.88
Index lookup time: 5.46 ms
Document load time: 9351.130000000001 ms
Query engine execution time: 114.16000000000001 ms
System function execution time: 0 ms
User defined function execution time: 0 ms
Document write time: 5.510000000000001 ms

Setup 2

We have set up another Cosmos database with identical configuration as the one in Setup 1 above, but with 281,000 documents in the container. The difference compared to Setup 1 is that we have only one partition so far in that database.

Query 1 When executing the following query for the partition 5c4eab89-3a16-45af-b54d-bc243c36e5b5 that contains 281,000 documents

select sum(c.sizeInBytes) from c where c.storageContainerName = "5c4eab89-3a16-45af-b54d-bc243c36e5b5"

we get the following query stats.

Request Charge: 74.7 RUsRetrieved document count: 0
Retrieved document size: 0 bytes
Output document count: 1
Output document size: 89 bytes
Index hit document count: 0
Index lookup time: 18.3 ms
Document load time: 0 ms
Query engine execution time: 0.02 ms
System function execution time: 0 ms
User defined function execution time: 0 ms
Document write time: 0 ms

Query2 When executing the following query for the partition 5c4eab89-3a16-45af-b54d-bc243c36e5b5 that contains 281,000 documents

select c.startTime from c where c.storageContainerName = "5c4eab89-3a16-45af-b54d-bc243c36e5b5" and c.videoSourceId = "88bc0094-5114-42be-a893-6417ca74fdca-0" and c.startTime > "20230507000035000" and c.startTime < "20230508000035000"

we get the following query stats.

Request Charge: 160.85 RUs
Retrieved document count: 5065
Retrieved document size: 3008610 bytes
Output document count: 5065
Output document size: 172261 bytes
Index hit document count: 5065
Index lookup time: 0.89 ms
Document load time: 73.62 ms
Query engine execution time: 6.88 ms
System function execution time: 0 ms
User defined function execution time: 0 ms
Document write time: 0.62 ms

Comparison

In both setups we run queries against a single partition (i.e. no cross partition queries). In Setup 1 we run the queries against a partition that contains 11,000 documents. In Setup 2 we run the queries against a partition that contains 281,000 documents.

Query 1: Despite the partition in Setup 2 has 25 times more documents than the one in Setup 1 the execution time is almost 1,000 times faster in Setup 2 and the RU consumption is 30 times higher in Setup 1!!!

Query 2: Despite the partition in Setup 2 has 25 times more documents than the one in Setup 1 the execution time is almost 10 times faster in Setup 2 and the RU consumption is almost 2.5 times higher in Setup 1!!!

If the RU consumption and the execution times increase dramatically with the number of partitions then the solution will not scale. It seems like the indexes are global and not on a partition level. Is this how it is supposed to work?

N.B.

Another odd behaviour that we have run into is that if you run a sum on a property like below select sum(c.sizeInBytes) from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" the execution time decreases by a factor 1,000 and the RUs decreases by a factor 30 if the summed property has just a few possible values compared to when the property has no duplicate values at all.

Oury Ba-MSFT 20,931 Reputation points Microsoft Employee Moderator

2023-05-12T21:30:13.69+00:00

Magnus Fors Thank you for reaching out.

Which Cosmos DB API are you using?

Regards,

Oury
Magnus Fors 21 Reputation points

2023-05-15T14:41:25.96+00:00

We are using the NoSQL API
Magnus Fors 21 Reputation points

2023-05-16T06:49:49.34+00:00

Thank you for clarifying how the indexes are set up. That explains the behaviour. Do you have any good suggestion how to make it scale better, i.e. so the RU consumption does not increase that much if the container total size grow but the individual logical partition sizes stay the same (increase number of partitions)?

The composite index on {storageContainerName, sizeInBytes} improved RU consumption from 518 to 11 RUs.

We have tried a composite index on {storageContainerName, videoSourceId} on the query below there was actually no change at all when it comes to RU consumption.

select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.videoSourceId = "5862401d-d172-49f2-a7b4-e4e284a88c71-0" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"
How would an optimal index setup look for the query above? A composite on {storageContainerName, videoSourceId, startTime} perhaps?

The final question we had in the original post above was about the odd behavior of a much better performance when doing a sum() on an indexed field that have very many duplicate values compared to doing a sum() on an indexed field that have almost no duplicate values?
Magnus Fors 21 Reputation points

2023-05-17T09:02:05.82+00:00

Thanks a lot this was very helpful. FYI adding the {storageContainerName, videoSourceId, startTime} composite index actually reduced the RU consumption by 50% when executing the query
select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"
Oury Ba-MSFT 20,931 Reputation points Microsoft Employee Moderator

2023-05-18T19:48:22.7166667+00:00

Magnus Fors Thank you so much for confirming this. I am glad it was helpful.

Regards,

Oury

Accepted answer

1 additional answer

Your answer

Oury Ba-MSFT 20,931 Reputation points Microsoft Employee Moderator

2023-05-12T21:30:13.69+00:00

Magnus Fors Thank you for reaching out.

Which Cosmos DB API are you using?

Regards,

Oury
Magnus Fors 21 Reputation points

2023-05-15T14:41:25.96+00:00

We are using the NoSQL API
Magnus Fors 21 Reputation points

2023-05-16T06:49:49.34+00:00

Thank you for clarifying how the indexes are set up. That explains the behaviour. Do you have any good suggestion how to make it scale better, i.e. so the RU consumption does not increase that much if the container total size grow but the individual logical partition sizes stay the same (increase number of partitions)?

The composite index on {storageContainerName, sizeInBytes} improved RU consumption from 518 to 11 RUs.

We have tried a composite index on {storageContainerName, videoSourceId} on the query below there was actually no change at all when it comes to RU consumption.

select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.videoSourceId = "5862401d-d172-49f2-a7b4-e4e284a88c71-0" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"
How would an optimal index setup look for the query above? A composite on {storageContainerName, videoSourceId, startTime} perhaps?

The final question we had in the original post above was about the odd behavior of a much better performance when doing a sum() on an indexed field that have very many duplicate values compared to doing a sum() on an indexed field that have almost no duplicate values?
Magnus Fors 21 Reputation points

2023-05-17T09:02:05.82+00:00

Thanks a lot this was very helpful. FYI adding the {storageContainerName, videoSourceId, startTime} composite index actually reduced the RU consumption by 50% when executing the query
select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"
Oury Ba-MSFT 20,931 Reputation points Microsoft Employee Moderator

2023-05-18T19:48:22.7166667+00:00

Magnus Fors Thank you so much for confirming this. I am glad it was helpful.

Regards,

Oury

Answer 1

@Magnus Fors Magnus Fors

Thank you for clarifying this.

The indexes are set up at a physical partition level and not at logical partition level. So, even if the logical partition has only 11K docs but the physical partition has lot of unique sizeInBytes value then the charges as going to be high as the index is for all data in the physical partition.

Composite indexes can be used for this case to improve performance. The index needs to be on {storageContainerName, sizeInBytes}.

Tuning query performance with Azure Cosmos DB

Regards,

Oury

Answer 2

Magnus Fors

The composite index on {storageContainerName, sizeInBytes} is good enough for the query. You don’t need a composite index on {storageContainerName, videoSourceId}. Composite index on all three could make it better {storageContainerName, videoSourceId, startTime}. But you need to check how much better it gets and is that improvement significant.

When a composite index gets longer, then the number of queries it is applicable might reduce. For example, if there is a query with just storageContainerName and startTime then the 3-path composite index cannot be used but the 2-path composite index can be used.

select c.startTime from c where c.storageContainerName = "89bd41d7-ad89-46cf-80ad-4c14ff012e70" and c.startTime > "20230508113340124" and c.startTime < "20230509113340124"

The final question is about the odd behavior of a much better performance when doing a sum() on an indexed field that have very many duplicate values compared to doing a sum() on an indexed field that have almost no duplicate values?

When a property has lot of duplicate values like enums, then the index is very compact, and only a smaller number of pages will be scanned. But in case of unique values, the index will be sparse and span lot more pages which is why they are expensive.

https://learn.microsoft.com/en-us/azure/cosmos-db/nosql/query-metrics

https://learn.microsoft.com/en-us/azure/cosmos-db/index-policy

https://devblogs.microsoft.com/cosmosdb/new-ways-to-use-composite-indexes/

Regards,

Oury

Share via

Azure Cosmos DB does not scale as the number of logical partitions grow: How to also reduce RU consumption when executing a query in Azure Cosmos DB SQL API

Setup 1

Setup 2

Comparison

N.B.

1 additional answer

Your answer