Data Latency between primary database and named replicas in Hyperscale database

Question

Data Latency between primary database and named replicas in Hyperscale database

Simon Goldshmid 20

Before I ask a question I want to describe our use case.

Multiple clients are inserting/updating the data in the primary hyperscale database.
After data is inserted/modified those clients immediately sending service bus messages to several topics.
Several function apps are subscribed to those topics and start a heavy-lifting analysis of the inserted/updated data.
Right now those function apps are working from the primary database.
We want to move the read-only portion of analysis from main database to a named replica to decrease the load on the main database.

Since there is an unpredictable latency between the data placed in main database and replicated into named replica, there is a big probability that the function apps will start the analysis before the data is replicated to the read-only replica. This is unacceptable.

Now is the question:

Is it possible from the function app to know when the data is fully replicated?

We can't tolerate latencies greater than 1.5 seconds.

Thank you,

Accepted answer

2 additional answers

Your answer

Answer 1

Alberto Morillo 34,671 MVP Volunteer Moderator

Typical data latency for small transactions is in tens of milliseconds, however there is no upper bound on data latency. Let me share this paragraph of the documentation with you:

Data latency from the time a transaction is committed on the primary to the time it is readable on a secondary depends on current log generation rate, transaction size, load on the replica, and other factors. Typical data latency for small transactions is in tens of milliseconds, however there is no upper bound on data latency. Data on a given secondary replica is always transactionally consistent, thus larger transactions take longer to propagate. However, at a given point in time data latency and database state may be different for different secondary replicas. Workloads that need to read committed data immediately should run on the primary replica.

Source is here.

Simon Goldshmid 20 Reputation points

2023-06-01T20:22:16.6666667+00:00

Thank you for a quick reply. It helped.
Alberto Morillo 34,671 Reputation points MVP Volunteer Moderator

2023-06-01T20:39:39.99+00:00

You are welcome. Thank you for visiting Microsoft QA. Have a great day! Thank you Oury for your help answering the question also. We replied with a difference of a few minutes.
Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-06-02T00:07:45.62+00:00

Thank you @Alberto Morillo for your continuous support.

Answer 2

Oury Ba-MSFT 20,926 Microsoft Employee Moderator

Simon Goldshmid

Thank you for reaching out.

Is it possible from the function app to know when the data is fully replicated?

To ensure that your function apps only start the analysis when the data is fully replicated to the read-only replica, you can use the sys.dm_hadr_database_replica_states dynamic management view to monitor the replication status of the database. This view provides information about the synchronization state of the database on each replica.

Keep in mind that the replication latency can vary depending on the size of the database and the network conditions. You may need to adjust the frequency of your monitoring to ensure that the analysis is not started before the data is fully replicated.

How much delay is there going to be between the primary and secondary compute replicas?

Data latency from the time a transaction is committed on the primary to the time it is readable on a secondary depends on current log generation rate, transaction size, load on the replica, and other factors. Typical data latency for small transactions is in tens of milliseconds, however there is no upper bound on data latency. Data on a given secondary replica is always transactionally consistent, thus larger transactions take longer to propagate. However, at a given point in time data latency and database state may be different for different secondary replicas. Workloads that need to read committed data immediately should run on the primary replica.

Regards,

Oury

Simon Goldshmid 20 Reputation points

2023-06-02T11:18:01.8266667+00:00

When I tried to execute "SELECT * FROM sys.dm_hadr_database_replica_states" against our Hyperscale SQL Server 12.0.2000.8 database I received "Invalid object name 'sys.dm_hadr_database_replica_states" error.

I tried to execute it against our work database and against master database and I got the same error. Do I need to have a special version of the database to have this view?
Simon Goldshmid 20 Reputation points

2023-06-02T12:59:29.2466667+00:00

Thanks for the quick reply. I connected to the database with ApplicationIntent=ReadOnly and still have the same "invalid object name" error. Attached are the snapshots.
Alberto Morillo 34,671 Reputation points MVP Volunteer Moderator

2023-06-02T13:30:22.1033333+00:00

Is that the latest version of SSMS?
Simon Goldshmid 20 Reputation points

2023-06-02T14:06:43.0233333+00:00

I upgraded my SSMS from 18.9.1. to 19.1 and still have the same error.
Alberto Morillo 34,671 Reputation points MVP Volunteer Moderator

2023-06-02T14:27:51.7766667+00:00

Do you have VIEW DATABASE STATE permission on the database?
Simon Goldshmid 20 Reputation points

2023-06-05T11:43:50.93+00:00

Yes, I do have this permission.
Alberto Morillo 34,671 Reputation points MVP Volunteer Moderator

2023-06-05T15:56:37.0433333+00:00

I don't have in mind any other possible cause of the error.
Simon Goldshmid 20 Reputation points

2023-06-05T21:10:39.5466667+00:00

Thanks for trying to help me! I will escalate it further.

Answer 3

Simon Goldshmid 20

When I tried to execute "SELECT * FROM sys.dm_hadr_database_replica_states" against our Hyperscale SQL Server 12.0.2000.8 database I received "Invalid object name 'sys.dm_hadr_database_replica_states" error.

I tried to execute it against our work database and against master database and I got the same error. Do I need to have a special version of the database to have this view?

Alberto Morillo 34,671 Reputation points MVP Volunteer Moderator

2023-06-02T12:33:46.6933333+00:00

You need to connect to a read-only replica to query that DMV. When connected to a read-only replica, the redo_queue_size and redo_rate columns in the sys.dm_database_replica_states DMV may be used to monitor data synchronization process, serving as indicators of data propagation latency on the read-only replica.

Please read this documentation for more information.

Share via

Data Latency between primary database and named replicas in Hyperscale database

2 additional answers

Your answer