Azure Active Directory general operations guide reference
This section of the Azure AD operations reference guide describes the checks and actions you should take to optimize the general operations of Azure Active Directory (Azure AD).
These recommendations are current as of the date of publishing but can change over time. Organizations should continuously evaluate their operational practices as Microsoft products and services evolve over time.
Key operational processes
Assign owners to key tasks
Managing Azure Active Directory requires the continuous execution of key operational tasks and processes, which may not be part of a rollout project. It is still important you set up these tasks to optimize your environment. The key tasks and their recommended owners include:
|Drive Improvements on Identity Secure Score||InfoSec Operations Team|
|Maintain Azure AD Connect Servers||IAM Operations Team|
|Regularly execute and triage IdFix Reports||IAM Operations Team|
|Triage Azure AD Connect Health Alerts for Sync and AD FS||IAM Operations Team|
|If not using Azure AD Connect Health, then customer has equivalent process and tools to monitor custom infrastructure||IAM Operations Team|
|If not using AD FS, then customer has equivalent process and tools to monitor custom infrastructure||IAM Operations Team|
|Monitor Hybrid Logs: Azure AD App Proxy Connectors||IAM Operations Team|
|Monitor Hybrid Logs: Passthrough Authentication Agents||IAM Operations Team|
|Monitor Hybrid Logs: Password Writeback Service||IAM Operations Team|
|Monitor Hybrid Logs: On-premises password protection gateway||IAM Operations Team|
|Monitor Hybrid Logs: Azure AD MFA NPS Extension (if applicable)||IAM Operations Team|
As you review your list, you may find you need to either assign an owner for tasks that are missing an owner or adjust ownership for tasks with owners that aren’t aligned with the recommendations above.
Owners recommended reading
Recent versions of on-premises components
Having the most up-to-date versions of on-premises components provides the customer all the latest security updates, performance improvements as well as functionality that could help to further simplify the environment. Most components have an automatic upgrade setting, which will automate the upgrade process.
These components include:
- Azure AD Connect
- Azure AD Application Proxy Connectors
- Azure AD Pass-through authentication agents
- Azure AD Connect Health Agents
Unless one has been established, you should define a process to upgrade these components and rely on the automatic upgrade feature whenever possible. If you find components that are six or more months behind, you should upgrade as soon as possible.
Hybrid management recommended reading
- Azure AD Connect: Automatic upgrade
- Understand Azure AD Application Proxy connectors | Automatic updates
Azure AD Connect Health alert baseline
Organizations should deploy Azure AD Connect Health for monitoring and reporting of Azure AD Connect and AD FS. Azure AD Connect and AD FS are critical components that can break lifecycle management and authentication and therefore lead to outages. Azure AD Connect Health helps monitor and gain insights into your on-premises identity infrastructure thus ensuring the reliability of your environment.
As you monitor the health of your environment, you must immediately address any high severity alerts, followed by lower severity alerts.
Azure AD Connect Health recommended reading
On-premises agents logs
Some identity and access management services require on-premises agents to enable hybrid scenarios. Examples include password reset, pass-through authentication (PTA), Azure AD Application Proxy, and Azure AD MFA NPS extension. It is key that the operations team baseline and monitor the health of these components by archiving and analyzing the component agent logs using solutions such as System Center Operations Manager or SIEM. It is equally important your Infosec Operations team or help desk understand how to troubleshoot patterns of errors.
On-premises agents logs recommended reading
- Troubleshoot Application Proxy
- Self-service password reset troubleshooting- Azure Active Directory
- Understand Azure AD Application Proxy connectors
- Azure AD Connect: Troubleshoot Pass-through Authentication
- Troubleshoot error codes for the Azure AD MFA NPS extension
On-premises agents management
Adopting best practices can help the optimal operation of on-premises agents. Consider the following best practices:
- Multiple Azure AD Application proxy connectors per connector group are recommended to provide seamless load balancing and high availability by avoiding single points of failure when accessing the proxy applications. If you presently have only one connector in a connector group that handles applications in production, you should deploy at least two connectors for redundancy.
- Creating and using an app proxy connector group for debugging purposes can be useful for troubleshooting scenarios and when onboarding new on-premises applications. We also recommend installing networking tools such as Message Analyzer and Fiddler in the connector machines.
- Multiple pass-through authentication agents are recommended to provide seamless load balancing and high availability by avoiding single point of failure during the authentication flow. Be sure to deploy at least two pass-through authentication agents for redundancy.
On-premises agents management recommended reading
Management at scale
Identity secure score
The identity secure score provides a quantifiable measure of the security posture of your organization. It is key to constantly review and address findings reported and strive to have the highest score possible. The score helps you to:
- Objectively measure your identity security posture
- Plan identity security improvements
- Review the success of your improvements
If your organization currently has no program in place to monitor changes in Identity Secure Score, it is recommended you implement a plan and assign owners to monitor and drive improvement actions. Organizations should remediate improvement actions with a score impact higher than 30 as soon as possible.
Microsoft sends email communications to administrators to notify various changes in the service, configuration updates that are needed, and errors that require admin intervention. It is important that customers set the notification email addresses so that notifications are sent to the proper team members who can acknowledge and act upon all notifications. We recommend you add multiple recipients to the Message Center and request that notifications (including Azure AD Connect Health notifications) be sent to a distribution list or shared mailbox. If you only have one global admin account with an email address, be sure to configure at least two email-capable accounts.
- Azure AD Access Reviews
- Azure AD Connect Health
- Azure AD Identity Protection
- Azure AD Privileged Identity Management
- Enterprise App Expiring Certificate Notifications
- Enterprise App Provisioning Service Notifications
Refer to the following table to learn the type of notifications that are sent and where to check for them:
|Notification source||What is sent||Where to check|
|Technical contact||Sync errors||Azure portal - properties blade|
|Message Center||Incident and degradation notices of Identity Services and Microsoft 365 backend services||Office Portal|
|Identity Protection Weekly Digest||Identity Protection Digest||Azure AD Identity Protection blade|
|Azure AD Connect Health||Alert notifications||Azure portal - Azure AD Connect Health blade|
|Enterprise Applications Notifications||Notifications when certificates are about to expire and provisioning errors||Azure portal - Enterprise Application blade (each app has its own email address setting)|
Notifications recommended reading
Operational surface area
AD FS lockdown
Organizations, which configure applications to authenticate directly to Azure AD benefit from Azure AD smart lockout. If you use AD FS in Windows Server 2012 R2, implement AD FS extranet lockout protection. If you use AD FS on Windows Server 2016 or later, implement extranet smart lockout. At a minimum, we recommend you enable extranet lockout to contain the risk of brute force attacks against on-premises Active Directory. However, if you have AD FS in Windows 2016 or higher, you should also enable extranet smart lockout that will help to mitigate password spray attacks.
If AD FS is only used for Azure AD federation, there are some endpoints that can be turned off to minimize the attack surface area. For example, if AD FS is only used for Azure AD, you should disable WS-Trust endpoints other than the endpoints enabled for usernamemixed and windowstransport.
Access to machines with on-premises identity components
Organizations should lock down access to the machines with on-premises hybrid components in the same way as your on-premises domain. For example, a backup operator or Hyper-V administrator should not be able to log in to the Azure AD Connect Server to change rules.
The Active Directory administrative tier model was designed to protect identity systems using a set of buffer zones between full control of the Environment (Tier 0) and the high-risk workstation assets that attackers frequently compromise.
The tier model is composed of three levels and only includes administrative accounts, not standard user accounts.
- Tier 0 - Direct Control of enterprise identities in the environment. Tier 0 includes accounts, groups, and other assets that have direct or indirect administrative control of the Active Directory forest, domains, or domain controllers, and all the assets in it. The security sensitivity of all Tier 0 assets is equivalent as they are all effectively in control of each other.
- Tier 1 - Control of enterprise servers and applications. Tier 1 assets include server operating systems, cloud services, and enterprise applications. Tier 1 administrator accounts have administrative control of a significant amount of business value that is hosted on these assets. A common example role is server administrators who maintain these operating systems with the ability to impact all enterprise services.
- Tier 2 - Control of user workstations and devices. Tier 2 administrator accounts have administrative control of a significant amount of business value that is hosted on user workstations and devices. Examples include Help Desk and computer support administrators because they can impact the integrity of almost any user data.
Lock down access to on-premises identity components such as Azure AD Connect, AD FS, and SQL services the same way as you do for domain controllers.
There are seven aspects to a secure Identity infrastructure. This list will help you find the actions you should take to optimize the operations for Azure Active Directory (Azure AD).
- Assign owners to key tasks.
- Automate the upgrade process for on-premises hybrid components.
- Deploy Azure AD Connect Health for monitoring and reporting of Azure AD Connect and AD FS.
- Monitor the health of on-premises hybrid components by archiving and analyzing the component agent logs using System Center Operations Manager or a SIEM solution.
- Implement security improvements by measuring your security posture with Identity Secure Score.
- Lock down AD FS.
- Lock down access to machines with on-premises identity components.
Refer to the Azure AD deployment plans for implementation details on any capabilities you haven't deployed.