PublicData Class
Defines the base class of public data.
Public data class contains common properties and methods for each open datasets.
Initialize with columns.
- Inheritance
-
builtins.objectPublicData
Constructor
PublicData(cols: List[str] | None, enable_telemetry: bool = True)
Parameters
- cols
column name list which the user wants to enrich
- enable_telemetry
whether to send telemetry
Methods
get_enricher |
Get enricher. |
to_pandas_dataframe |
To pandas dataframe. |
to_spark_dataframe |
To spark dataframe. |
get_enricher
Get enricher.
get_enricher()
to_pandas_dataframe
To pandas dataframe.
to_pandas_dataframe()
to_spark_dataframe
To spark dataframe.
to_spark_dataframe()
Attributes
cols
Get the column name list to retrieve.
env
Return the runtime environment.
id
Get the location ID of the open data.
registry_id
Get the registry ID of this public dataset registered at the backend.
Azure uses this registry ID to get latest metadata like storage location. You should expect all public data sub classes to assign _registry_id.
Returns
The registry ID.
Return type
logger
logger = <Logger azureml.opendatasets (DEBUG)>
mandatory_columns
mandatory_columns = []
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for