PublicData Class

Defines the base class of public data.

Public data class contains common properties and methods for each open datasets.

Initialize with columns.

Inheritance
builtins.object
PublicData

Constructor

PublicData(cols: List[str] | None, enable_telemetry: bool = True)

Parameters

cols
list
Required

A list of column names to enrich.

enable_telemetry
bool
default value: True

Indicates whether to send telemetry.

cols
Required

column name list which the user wants to enrich

enable_telemetry
Required

whether to send telemetry

Methods

get_enricher

Get enricher.

to_pandas_dataframe

To pandas dataframe.

to_spark_dataframe

To spark dataframe.

get_enricher

Get enricher.

get_enricher()

to_pandas_dataframe

To pandas dataframe.

to_pandas_dataframe()

to_spark_dataframe

To spark dataframe.

to_spark_dataframe()

Attributes

cols

Get the column name list to retrieve.

env

Return the runtime environment.

id

Get the location ID of the open data.

registry_id

Get the registry ID of this public dataset registered at the backend.

Azure uses this registry ID to get latest metadata like storage location. You should expect all public data sub classes to assign _registry_id.

Returns

The registry ID.

Return type

str

logger

logger = <Logger azureml.opendatasets (DEBUG)>

mandatory_columns

mandatory_columns = []