IOPS postgresql monitoring - meaning of absolute values in graphs

Question

IOPS postgresql monitoring - meaning of absolute values in graphs

Ľubomíra Trnavská 0

Hi.
I have a slight problem understanding the metrics for IOPS.

What do the absolute values/percentages in graphs represent?

These are the absolute values
User's image

This is the percentual usage
User's image

Both graphs are from the same time span and with 5-minute granularity.

The server`s specs are:

Azure Database for PostgreSQL flexible server
General Purpose, D8s_v3, 8 vCores, 32 GiB RAM, 1024 GiB storage
- 5000 IOPS
plus one readonly replica with the same specs

If the first graph shows real IOPS (average per second) and we have the specs 5000, then the usage before 2PM -> 10+K should reach 100% (or more).
We can see a peak in the second graph as well but it is below 50%, so the disk should not be overloaded according to the second graph.

The question is:
What do the values in the first graph represent? What does the avg of absolute values of IOPS mean? Are IOPS logged and measured every second -> the first graph is the real representation? If yes, then what does the second graph represent? Why is there such low percentual usage? Which graph is more relevant to see if we are experiencing issues with IOPS?

We have tried to understand it better using the disk depth queue metric, but the units/measurements are unclear. What do the absolute values for disk depth queue mean?
Did < 40 disk IO operations wait in the queue at the peak before 2PM? How was this measured?
User's image

Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-03-17T18:02:23.5533333+00:00

@Ľubomíra Trnavská The average of absolute values of IOPS in Azure Database for PostgreSQL - Flexible Server refers to the average number of input/output operations per second. IOPS are logged and measured every second.

I tried to create a flexible server with the same option, but I am seeing different number of IOPS see number in the graph highlighted in yellow.
Ľubomíra Trnavská 0 Reputation points

2023-03-20T08:58:09.6733333+00:00

Hi, thank you for your reply. I have a few questions still, though. :)

The IOPS depends on the disk size, the values are here https://learn.microsoft.com/en-us/azure/postgresql/flexible-server/concepts-compute-storage#maximum-iops-for-your-configuration We have 1024 GiB, and therefore 5K IOPS.

If the absolute values are correct, then what does the other metric represent?
Disk IOPS Consumed Percentage (Preview)
Should not these be "the same"?

Because if the absolute values are right, then the average was close to 10K which is 200% of the given 5K. But according to the percentage metrics, at the same point, it shows only 12% usage. Is this percentual metric incorrect?
Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-03-22T22:22:57.7033333+00:00

@Ľubomíra Trnavská Thank you for raising this question.

IOPS - Is a calculated metric that is sourced from Linux DISKSTATS command (1 sec sample)

Disk IOPS Consumed Percentage - Is default Azure VM Storage IO Utilization Metric – details here

This is being investigated. I will let you know once I got a result. Thank you for raising this question.

Regards,

Oury

1 answer

Your answer

Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-03-17T18:02:23.5533333+00:00

@Ľubomíra Trnavská The average of absolute values of IOPS in Azure Database for PostgreSQL - Flexible Server refers to the average number of input/output operations per second. IOPS are logged and measured every second.

I tried to create a flexible server with the same option, but I am seeing different number of IOPS see number in the graph highlighted in yellow.
Ľubomíra Trnavská 0 Reputation points

2023-03-20T08:58:09.6733333+00:00

Hi, thank you for your reply. I have a few questions still, though. :)

The IOPS depends on the disk size, the values are here https://learn.microsoft.com/en-us/azure/postgresql/flexible-server/concepts-compute-storage#maximum-iops-for-your-configuration We have 1024 GiB, and therefore 5K IOPS.

If the absolute values are correct, then what does the other metric represent?
Disk IOPS Consumed Percentage (Preview)
Should not these be "the same"?

Because if the absolute values are right, then the average was close to 10K which is 200% of the given 5K. But according to the percentage metrics, at the same point, it shows only 12% usage. Is this percentual metric incorrect?
Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-03-22T22:22:57.7033333+00:00

@Ľubomíra Trnavská Thank you for raising this question.

IOPS - Is a calculated metric that is sourced from Linux DISKSTATS command (1 sec sample)

Disk IOPS Consumed Percentage - Is default Azure VM Storage IO Utilization Metric – details here

This is being investigated. I will let you know once I got a result. Thank you for raising this question.

Regards,

Oury

Answer 1

Oury Ba-MSFT 20,926 Microsoft Employee Moderator

@Ľubomíra Trnavská

Thank you for being patient while working on this.

Problem:

Discrepancy between the IOPS metric and the Disk IOPS Consumed Percentage metric.

Solution:

To clarify,

The IOPS metric measures the Input/Output operations per second, calculated directly from the Linux diskstats command on Postgres VM.
On the other hand, the Disk IOPS Consumed Percentage is a saturation metric derived from the default Azure VM Storage IO Utilization Metric. details here

During testing, we discovered an issue where the Data Disk IOPS Consumed Percentage was reaching 100%, even though the Read IOPS and Write IOPS were well below the storage maximum of 1000 IOPS. The Azure platform team identified this as a platform bug affecting the Percentage consumed metrics for the disks.

We are happy to inform you that the team has fixed the issue, and the fix has been pushed to resolve it. However, it will take a few weeks to get fully rolled out to all our production.

Please don't forget to mark this as accept answer if the reply was helpful.

Regards,

Oury

Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-03-30T14:21:39.1633333+00:00

@Ľubomíra Trnavská

We haven’t heard from you on the last response and was just checking back to see if you have a resolution yet. In case if you have any resolution please do share that same with the community as it can be helpful to others. Otherwise, will respond with more details and we will try to help.
Oury Ba-MSFT 20,926 Reputation points Microsoft Employee Moderator

2023-04-24T17:22:23.4833333+00:00

@Ľubomíra Trnavská I would like to Inform you that the issue of inconsistent metric data is now fixed. Please check and let us know if you are still seeing any issues in inconsistent data in your Azure PostgreSQL metrics. Regards, Oury

Share via

IOPS postgresql monitoring - meaning of absolute values in graphs

1 answer

Your answer