Overview
nimbusml
is a Python module that provides Python bindings for ML.NET.
nimbusml
aims to enable data science teams that are more familiar with Python
to take advantage of ML.NET's functionality and performance. It provides
battle-tested, state-of-the-art ML algorithms, transforms, and components. The
components are authored by the team members, as well as numerous contributors
from MSR, CISL, Bing, and other teams at Microsoft.
nimbusml
is interoperable with scikit-learn
estimators and transforms,
while adding a suite of fast, highly optimized, and scalable algorithms written
in C++ and C#. nimbusml
trainers and transforms support the following data
structures for the fit()
and transform()
methods:
numpy.ndarray
scipy.sparse_cst
pandas.DataFrame
.
In addition, nimbusml
also supports streaming from files without loading the
dataset into memory with FileDataStream
, which allows training on data
significantly exceeding memory.
With FileDataStream
, nimbusml
is able to handle up to a billion
features and billions of training examples for select algorithms.
NimbusML can be easily used for the following problems:
- Binary classification
- Multiclass classification
- Regression
- Ranking
- Clustering
For more details, please refer to the tutorial section.