9 episodes

Learn Live: Foundations of Data Science for Machine Learning

Join Jason DeBoever and Glenn Stephens live on Learn TV and explore this nine-part “Foundations of data science for machine learning” series. We will be walking through Learn modules and answering your questions live. From basic classical machine learning models to exploratory data analysis and customizing architectures, you’ll be guided by easy to digest conceptual content and interactive Jupyter notebooks and will learn about the underlying concepts as well as how to get into building models with the most common machine learning tools.

Introduction to machine learning

Introduction to machine learning

A high-level overview of machine learning for people with little or no knowledge of computer science and statistics. You’ll be introduced to some essential concepts, explore data, and interactively go through the machine learning life-cycle - using Python to train, save, and use a machine learning model like we would in the real world. In this episode, you will:

  • Explore how machine learning differs from traditional software.
  • Create and test a machine learning model.
  • Load a model and use it with new datasets.
Build classical machine learning models with supervised learning

Build classical machine learning models with supervised learning

Supervised learning is a form of machine learning where an algorithm learns from examples of data. We progressively paint a picture of how supervised learning automatically generates a model that can make predictions about the real world. We also touch on how these models are tested, and difficulties that can arise In this episode, you will:

  • Define supervised and unsupervised learning.
  • Explore how cost functions affect the learning process.
  • Discover how models are optimized by gradient descent.
  • Experiment with learning rates, and see how they can affect training.
Introduction to data for machine learning

Introduction to data for machine learning

The power of machine learning models comes from the data that is used to train them. Through content and exercises, we explore how to understand your data, how to encode it so that the computer can interpret it properly, how to clean it of errors, and tips that will help you create models that perform well. In this episode, you will:

  • Visualize large datasets with Exploratory Data Analysis (EDA).
  • Clean a dataset of errors.
  • Predict unknown values using numeric and categorical data.
Train and understand regression models in machine learning

Train and understand regression models in machine learning

Regression is arguably the most widely used machine learning technique, commonly underlying scientific discoveries, business planning, and stock market analytics. This learning material takes a dive into some common regression analyses, both simple and more complex, and provides some insight on how to assess model performance. In this episode, you will:

  • Understand how regression works.
  • Work with new algorithms: Linear regression, multiple linear regression, and polynomial regression.
  • Understand the strengths and limitations of regression models.
  • Visualize error and cost functions in linear regression.
  • Understand basic evaluation metrics for regression.
Refine and test machine learning models

Refine and test machine learning models

When we think of machine learning, we often focus on the training process. A small amount of preparation before this process can not only speed up and improve learning but also give us some confidence about how well our models will work when faced with data we have never seen before. In this episode, you will:

  • Define feature normalization.
  • Create and work with test datasets.
  • Articulate how testing models can both improve and harm training.
Create and understand classification models in machine learning

Create and understand classification models in machine learning

Classification means assigning items into categories or can also be thought of automated decision making. Here we introduce classification models through logistic regression, providing you with a stepping-stone toward more complex and exciting classification methods. In this episode, you will:

  • Discover how classification differs from classical regression.
  • Build models that can perform classification tasks.
  • Explore how to assess and improve classification models.
Select and customize architectures and hyperparameters using random forest

Select and customize architectures and hyperparameters using random forest

More complex models often can be manually customized to improve how effective they are. Through exercises and explanatory content, we explore how altering the architecture of more complex models can bring about more effective results. In this episode, you will:

  • Discover new model types– decision trees and random forests.
  • Learn how model architecture can affect performance.
  • Practice working with hyperparameters to improve training effectiveness.
Confusion matrix and data imbalances

Confusion matrix and data imbalances

How do we know if a model is good or bad at classifying our data? The way that computers assess model performance sometimes can be difficult for us to comprehend or can over-simplify how the model will behave in the real world. To build models that work in a satisfactory way, we need to find intuitive ways to assess them, and understand how these metrics can bias our view. In this episode, you will:

  • Assess performance of classification models.
  • Review metrics to improve classification models.
  • Mitigate performance issues from data imbalances.
Measure and optimize model performance with ROC and AUC

Measure and optimize model performance with ROC and AUC

Receiver operator characteristic curves are a powerful way to assess and fine-tune trained classification models. We introduce and explain the utility of these curves through learning content and practical exercises. In this episode, you will:

  • Understand how to create ROC curves.
  • Explore how to assess and compare models using these curves.
  • Practice fine-tuning a model using characteristics plotted on ROC curves.

Our Speakers

  • Jason DeBoever

    Microsoft Senior Product Manager

    Twitter
  • Glenn Stephens

    Microsoft Senior Content Developer

    Twitter

Frequently Asked Questions

Is there a cost to attend the event?

The event is free! You can join Learn TV to the stream content for free!

When does the event take place?

Join us every Tuesday, 2:00pm-3:30pm PST, starting September 14 through November 9.

Will this event be available on-demand?

Yes! You can stream our past events anytime! Browse all Learn Live to stream anytime.

Where does the event take place?

This event series is virtual! Join us and thousands of others to enjoy the experience from the comfort of your own home, office, or device.

What language are episodes delivered in?

This show is delivered in English. Captioning in 14 different languages will be available in the recordings, made available 48 hours after each live show at Learn Live.

Code of Conduct

We seek to provide a respectful, friendly, professional experience for everyone, regardless of gender, sexual orientation, physical appearance, disability, age, race or religion. We do not tolerate any behavior that is degrading to any gender, race, sexual orientation, or disability, or any behavior that would be deemed harassment or discrimination. Individuals are responsible for knowing and abiding by our standards and we encourage everyone to assist in creating a welcoming and safe environment. Additionally, we strive to treat our shared planet with the same amount of respect. Please report any questions, concerns, or disruptive activity or behavior to the organizing team, so that we can address the issue immediately.