Overview of OMOP analytics in Healthcare data solutions (preview)

[This article is prerelease documentation and is subject to change.]

The objective of implementing any common data model is to establish standardization and streamline the structuring, storage, sharing, exchange, and usage of data. In Healthcare data solutions (preview), the OMOP analytics capability facilitates the deployment of the Observational Medical Outcomes Partnership (OMOP) common data model (CDM) in the Fabric lakehouse environment. This deployment provides researchers within the OMOP community access to OneLake's expansive scale and the AI capabilities of the Fabric platform. The setup enables efficient and reliable execution of standardized analytics for patient and population-level observational studies.

By deploying the OMOP CDM to Fabric using prebuilt pipelines, users can utilize the provided notebooks to construct statistical models, conduct population distribution studies, and utilize Power BI reports to visually compare various interventions and their effects on patient outcomes. These analytical capabilities empower researchers to perform comparative analyses, such as evaluating different procedures and drug exposures, or examining correlations between drug exposures and condition occurrences. For reference and guidance, the capability also provides sample notebooks for these scenarios.

OMOP analytics is an optional capability under Healthcare data solutions in Microsoft Fabric (preview). You have the flexibility to decide whether or not to use it, depending on your specific needs or scenarios.

To learn how to deploy, configure, and use the OMOP analytics capability, see:

What's included in OMOP CDM

Healthcare data solutions in Microsoft Fabric (preview) implement the OMOP Common Data Model version v5.4 with focus on mapping Fast Healthcare Interoperability Resources (FHIR) clinical resources to corresponding OMOP tables in Fabric. Though the complete OMOP CDM schema with all its tables is populated in the healthcare#_msft_gold_omop lakehouse, there are no FHIR resources mapped to the nonclinical tables.

  • The tables under Standardized health economics are out of scope and aren't mapped to any FHIR resources. They won't be populated after the transformation pipelines are executed from the healthcare#_msft_silver lakehouse to the healthcare#_msft_gold_omop lakehouse.

  • We include Standardized vocabularies to support mapping of terminologies and codes referenced in the FHIR clinical resources to the corresponding standard concepts for OMOP in the clinical domain.

    The Athena Data Set contains references to domain specific vocabularies.

  • Standard vocabularies included in the preview release are SNOMED-CT (Systematized Nomenclature of Medicine - Clinical Terms), LOINC (Logical Observation Identifiers, Names, and Codes), and RxNorm.

An image displaying the OMOP common data model depiction.

See also