@Simon Dix Sure,
As an endpoint experiences delivery failures, Event Grid begins to delay the delivery and retry of events to that endpoint.
For example, if the first 10 events published to an endpoint fail, Event Grid assumes that the endpoint is experiencing issues and will delay all subsequent retries and new deliveries for some time - in some cases up to several hours. The functional purpose of delayed delivery is to protect unhealthy endpoints and the Event Grid system.
Without back-off and delay of delivery to unhealthy endpoints, Event Grid's retry policy and volume capabilities can easily overwhelm a system.
So, suggestion here is to customize the retry policy when creating an event subscription by using the following two configurations. An event is dropped if either of the limits of the retry policy is reached.
- Maximum number of attempts - The value must be an integer between 1 and 30. The default value is 30.
- Event time-to-live (TTL) - The value must be an integer between 1 and 1440. The default value is 1440 minutes
Let us know.