Exchange Server 2016 failover

Gavin Ross 231 Reputation points
2021-06-14T18:36:57.173+00:00

Hi All, Need some help with an issue we had when trying to test a WAN failure with Exchange 2016. Here is the setup

4 Exchange 2016 Servers
2 DAGS
DAGA - Has 2 servers, 1 in primary site and 1 in secondary site
DAGB - Has 2 servers, 1 in primary site and 1 in secondary site

When testing we first manually shutdown the primary server in DAGA and manually moved the databases to the secondary server in DAGA, tested by logging into webmail and everything worked fine. We then tried to test by shutting down the WAN links between the 2 sites and that's where we had an issue. When trying to login to webmail, we got to the login page, but after logging in, we got a page could not be displayed, same with trying to login to the ECP.

So here is my question, why would this stop working with the WAN link down and the mailboxes were already mounted in the secondary site? Was there something else that we needed to do on the Exchange configuration in order to get this to work correctly?

Thanks,
Gavin

Exchange Server Management
Exchange Server Management
Exchange Server: A family of Microsoft client/server messaging and collaboration software.Management: The act or process of organizing, handling, directing or controlling something.
7,598 questions
0 comments No comments
{count} votes

6 answers

Sort by: Most helpful
  1. Andy David - MVP 147.6K Reputation points MVP
    2021-06-14T18:56:30.553+00:00

    Did you account for the client connection to the CAS Service?
    If the clients are connecting to the CAS in DAG A, then it will fail if the WAN link to the other site is down.

    You need to ensure your load balancer marks the DAGA servers down and redirects them to use only the DAGB servers and the clients have the ability to connect to the DAGB servers if the WAN link is down.

    0 comments No comments

  2. Gavin Ross 231 Reputation points
    2021-06-14T19:04:47.237+00:00

    Yes, I manually changed the host file on my system and re-pointed it to the correct servers, flushed DNS and also did a reboot on the system I was testing with.


  3. Gavin Ross 231 Reputation points
    2021-06-14T21:26:02.453+00:00

    The test was to assume that the primary site is down (earthquake, fire etc... completely not available) services then failover to the secondary and then can use webmail for e-mail access externally. Do I need to remove the server from the DAG in order to achieve this? We also have DAC enabled to avoid split-brain if the site was to come up again, but in this instance, we could not even load webmail on the internal network with the database already failed over to the secondary site. I could not even access ECP. I had a VPN connection to the secondary site to test.


  4. Gavin Ross 231 Reputation points
    2021-06-14T22:47:48.157+00:00

    After researching a bit, it looks like I need to perform the steps in the document. https://learn.microsoft.com/en-us/exchange/high-availability/manage-ha/datacenter-switchovers?view=exchserver-2019


  5. Eric Yin-MSFT 4,386 Reputation points
    2021-06-15T07:05:49.207+00:00

    Hi,
    How is your witness server deployed?
    Did you enable Failover Clustering Dynamic Quorum? Run "(Get-Cluster “cluster_name”).DynamicQuorum" to check it.
    If you have witness server in siteA and haven't enabled Dynamic Quorum, the DAG loses quorum and database copy won't mount on second site.
    Run "Get-MailboxDatabaseCopyStatus -Identity MDB0* | select name, status, contentIndexState | sort Status | ft -auto" to check the database copy status.


    If an Answer is helpful, please click "Accept Answer" and upvote it.
    Note: Please follow the steps in [our documentation][99] to enable e-mail notifications if you want to receive the related email notification for this thread.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.