This section shows you how to train machine learning and AI models on Mosaic AI.
Mosaic AI Model Training streamlines and unifies the process of training and deploying traditional ML models through AutoML and Foundation Model Fine-tuning workloads.
AutoML
AutoML simplifies the process of applying machine learning to your datasets by automatically finding the best algorithm and hyperparameter configuration for you. AutoML offers a no-code UI as well as a Python API.
Foundation Model Fine-tuning
Foundation Model Fine-tuning (now part of Mosaic AI Model Training) on Databricks lets you customize large language models (LLMs) using your own data. This process involves fine-tuning the training of a pre-existing foundation model, significantly reducing the data, time, and compute resources required compared to training a model from scratch. Key features include:
Instruction fine-tuning: Adapt your model to new tasks by training on structured prompt-response data.
Continued pre-training: Enhance your model with additional text data to add new knowledge or focus on a specific domain.
Chat completion: Train your model on chat logs to improve conversational abilities.
Open source library examples
See machine learning training examples from a wide variety of open source machine learning libraries, including hyperparameter tuning examples using Optuna and Hyperopt.
Deep learning
See examples and best practices for distributed deep learning training so you can develop and fine-tune deep learning models on Azure Databricks.
Recommenders
Learn how to train deep-learning-based recommendation models on Azure Databricks. Compared to traditional recommendation models, deep learning models can achieve higher quality results and scale to larger amounts of data.
Manage data ingestion and preparation, model training and deployment, and machine learning solution monitoring with Python, Azure Machine Learning and MLflow.
Learn how to train machine learning models using scikit-learn in Azure Databricks. Basic classification model. End-to-end example of ML on Azure Databricks.
AI and Machine Learning on Databricks, an integrated environment to simplify and standardize AI, ML, DL, and LLM development. Tutorials and user guides for common tasks and scenarios.
This article includes reference solutions, detailed examples of how you can use Azure Databricks for common machine learning applications such as data labeling and image processing.
Learn how to train machine learning models on single nodes using TensorFlow and debug machine learning programs using inline TensorBoard. A 10-minute tutorial notebook shows an example of training machine learning models on tabular data with TensorFlow Keras.
Learn about deep-learning-based recommender models in Databricks, including the two-tower model and the DLRM architecture, and when to use each type of recommender model.