OpenDatasetBase Class
Open Dataset Base Class for inherit.
Construct open datasets.
- Inheritance
-
OpenDatasetBase
Constructor
OpenDatasetBase(cols: List[str] | None = None, enable_telemetry: bool = True, **kwargs)
Parameters
A list of columns names to load from the dataset, defaults to None
- enable_telemetry
- bool
Whether to enable telemetry on this dataset, defaults to True
Methods
get_file_dataset |
Get the file dataset for open dataset. |
get_tabular_dataset |
Initialize AbstractTabularOpenDataset with blob url. |
to_pandas_dataframe |
To pandas dataframe. |
to_spark_dataframe |
To spark dataframe. |
get_file_dataset
Get the file dataset for open dataset.
get_file_dataset(start_date: datetime = None, end_date: datetime = None, enable_telemetry: bool = True, **kwargs) -> FileDataset
Parameters
Returns
file dataset
Return type
get_tabular_dataset
Initialize AbstractTabularOpenDataset with blob url.
get_tabular_dataset(start_date: datetime = None, end_date: datetime = None, cols: List[str] = None, enable_telemetry: bool = True, **kwargs) -> TabularDataset
Parameters
Returns
TabularDataset
Return type
to_pandas_dataframe
To pandas dataframe.
to_pandas_dataframe() -> DataFrame
to_spark_dataframe
To spark dataframe.
to_spark_dataframe()
Attributes
cols
Get the column name list to retrieve.
data
Get the data of the OpenDataset Object.
id
Get the location ID of the open data.
log_properties
Get log properties.
registry_id
Get the registry ID of this public dataset registered at the backend.
This registry ID is used to get latest metadata like storage location. Expect all public data sub classes to assign _registry_id.
Returns
Registry ID string.
Return type
time_column_name
Time column name.
Feedback
https://aka.ms/ContentUserFeedback.
Coming soon: Throughout 2024 we will be phasing out GitHub Issues as the feedback mechanism for content and replacing it with a new feedback system. For more information see:Submit and view feedback for