DataOperationsCatalog Class
Definition
Important
Some information relates to prerelease product that may be substantially modified before it’s released. Microsoft makes no warranties, express or implied, with respect to the information provided here.
Class used to create components that operate on data, but are not part of the model training pipeline. Includes components to load, save, cache, filter, shuffle, and split data.
public sealed class DataOperationsCatalog
type DataOperationsCatalog = class
Public NotInheritable Class DataOperationsCatalog
- Inheritance
-
DataOperationsCatalog
Methods
BootstrapSample(IDataView, Nullable<Int32>, Boolean) |
Take an approximate bootstrap sample of |
Cache(IDataView, String[]) |
Creates a lazy in-memory cache of |
CreateEnumerable<TRow>(IDataView, Boolean, Boolean, SchemaDefinition) |
Convert an IDataView into a strongly-typed IEnumerable<T>. |
CrossValidationSplit(IDataView, Int32, String, Nullable<Int32>) |
Split the dataset into cross-validation folds of train set and test set.
Respects the |
FilterRowsByColumn(IDataView, String, Double, Double) |
Filter the dataset by the values of a numeric column. |
FilterRowsByKeyColumnFraction(IDataView, String, Double, Double) |
Filter the dataset by the values of a KeyDataViewType column. |
FilterRowsByMissingValues(IDataView, String[]) |
Drop rows where any column in |
LoadFromEnumerable<TRow>(IEnumerable<TRow>, DataViewSchema) |
Create a new IDataView over an enumerable of the items of user-defined type using the provided DataViewSchema, which might contain more information about the schema than the type can capture. |
LoadFromEnumerable<TRow>(IEnumerable<TRow>, SchemaDefinition) |
Create a new IDataView over an enumerable of the items of user-defined type.
The user maintains ownership of the One typical usage for streaming data view could be: create the data view that lazily loads data as needed, then apply pre-trained transformations to it and cursor through it for transformation results. |
ShuffleRows(IDataView, Nullable<Int32>, Int32, Boolean) |
Shuffle the rows of |
SkipRows(IDataView, Int64) |
Skip |
TakeRows(IDataView, Int64) |
Take |
TrainTestSplit(IDataView, Double, String, Nullable<Int32>) |
Split the dataset into the train set and test set according to the given fraction.
Respects the |