Hi @sneka selvakumar ,
Thanks for reaching out to Microsoft Q&A.
Here's a breakdown of your queries:
Is this behavior considered expected for transient platform-level server errors that recover automatically?
Yes, it's generally expected that transient platform-level server errors may not always trigger alert notifications. These types of errors can resolve themselves quickly, and if the errors are short-lived, the alert may not have a chance to fire.
Under what conditions can Service Bus ServerErrors be recorded in metrics but not trigger an alert notification?
ServerErrors can be recorded in metrics without triggering an alert notification if:
- The duration of the error is shorter than your alert's evaluation frequency.
- The number of errors does not exceed the configured thresholds for triggering an alert.
- There may be certain suppression rules in place that prevent notifications for transient errors.
Are there any known limitations, delays, or suppression mechanisms in Azure Monitor that could prevent alert notifications for short-lived Service Bus server errors?
Azure Monitor has certain limitations, such as:
- Alerts can be suppressed during maintenance windows or if the alert rules are misconfigured.
- There may be a delay in alert firing due to the frequency of evaluation you have set.
What are Microsoft’s recommended best practices for alerting on Service Bus ServerErrors to balance visibility and noise?
- Set appropriate thresholds to minimize noise while maintaining sensitivity to critical issues.
- Consider increasing the evaluation frequency of your alerts to capture transient errors more effectively.
- Create action groups to manage how notifications are sent, ensuring they reach the right personnel with minimal spam.
Refereces:
Hope this helps!
If the resolution was helpful, kindly take a moment to click on
and click on Yes for was this answer helpful. And, if you have any further query do let us know.