Mapping Parquet types to Common Data Model data types
Article
This article provides assistance to developers in finding the appropriate equivalents of Parquet data types in Common Data Model.
Parquet type: This column represents Parquet data type. For more details, visit here.
Common Data Model equivalent type: Each attribute in Common Data Model entities can be associated with a single data type. A Common Data Model data type is an object that represents a collection of traits. All data types should indicate the data format traits but can also add additional semantic information. For more details, visit here.
Traits included in the equivalent data type: When an attribute is defined by using a data type, the attribute will gain the traits of the data type, visit here. Traits are the fundamental mechanism in the Common Data Model metadata grammar for describing the data format, semantic meaning, and specifications for entities, attributes and other objects, such as partitions or manifests. For more details visit here.
Traits to add: These traits won't be implicitly included when specifying the Common Data Model data type. Users must add them to complete the suggested data type and match the equivalent Parquet type.
Unsupported: Common Data Model doesn't offer out-of-box equivalents. Depending on the use case, users can define new data types but it will not be standard.
The following code snippet sets integer data type to Common Data Model attribute. Follow CDM SDK API documentation for the API references.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see: https://aka.ms/ContentUserFeedback.