Hi Andrew Citera ,
Welcome to Micrososft Q&A platform and thanks for your query here.
I understand that you are trying to understand how Azure Event Hubs behave when there is an availability zone outage and how it handles recovery.
It's important to note the distinction between "outages" and "disasters." An outage is the temporary unavailability of Azure Event Hubs, and can affect some components of the service, such as a messaging store, or even the entire datacenter. However, after the problem is fixed, Event Hubs becomes available again. Typically, an outage doesn't cause the loss of messages or other data. An example of such an outage might be a power failure in the datacenter. Some outages are only short connection losses because of transient or network issues.
A disaster is defined as the permanent, or longer-term loss of an Event Hubs cluster, Azure region, or datacenter. The region or datacenter may or may not become available again, or may be down for hours or days , so the Geo-disaster recovery feature of Azure Event Hubs is a disaster recovery solution.
When an availability zone outage occurs in Azure Event Hubs, the service fabric model under the hood will elect a new leader for the affected partition(s) in another availability zone. The new leader will then begin replicating data to the other two availability zones to ensure that the data is fully replicated across all three zones.
If an application provides a specific partition key and that partition is unavailable, the Event Hub gateway will prevent the data from being written to an unavailable partition. The application will receive an error indicating that the partition is unavailable. The partition will eventually recover and become available again, at which point the application can retry writing to that partition with the same partition key. When a partition recovers, it will maintain the same partition ID. The partition ID is a unique identifier for the partition and is not affected by availability zone outages or other issues.
It is important to note that specifying a partition key does not completely break high availability in Azure Event Hubs. The partition key is used to ensure that related events are written to the same partition, which can improve performance and ordering guarantees. However, if a partition is unavailable, the Event Hub gateway will prevent data from being written to that partition until it becomes available again. This can result in reduced availability for the affected partition(s), but the overall availability of the Event Hub is not affected.
Hope it helps. Thankyou