Exercise - Train a machine learning model to recognize polar bears
Azure AI services is a suite of more than 20 services and APIs that are backed by machine learning. Developers can use the APIs to incorporate intelligent features like facial recognition and sentiment analysis into their applications. Azure AI Custom Vision is just one member of the Azure AI services family. Its purpose is to create image classification models that "learn" from labeled images you provide. Want to know whether a photo contains a picture of a flower? Train Custom Vision by using a collection of flower images. Then, it can tell you whether the next image includes a flower, or even what type of flower appears in an image.
The Custom Vision service exposes two APIs: the Custom Vision Training API and the Custom Vision Prediction API. You can build, train, and test image classification models by using the Custom Vision portal, or you can build, train, and test the models by using the Custom Vision Training API. After a model is trained, you can use the Custom Vision Prediction API to build apps that use the model. Both APIs are REST APIs that can be called from various programming languages.
In this unit, you'll create a Custom Vision model and train it to differentiate between various types of Arctic wildlife.
Build a Custom Vision model
You'll begin by creating a new Custom Vision project. Then, you'll upload images of polar bears, Arctic foxes, and walruses and tag the images, so Custom Vision can learn to differentiate between them.
Create a new project, resource, and resource group
In your browser, go to the Custom Vision portal. Select Sign In and sign in with your Microsoft account. Accept any terms of service.
Select New Project.
In Create new project:
- Enter a project name and description.
- Next to Resource, select create new to create a new Azure AI services resource.
Create a new Cognitive Services project
In Create New Resource:
- For the resource name, enter polar-bear-vision.
- Select your subscription.
- Next to Resource Group, select create new to create a new Azure AI services resource group.
Create a new Cognitive Services resource
In Create New Resource Group:
- For the resource group name, enter polar-bear-rg.
- Make sure Location is set to South Central US, and then select Create resource group.
Create a new Cognitive Services resource group
The new resource group information is added in Create New Resource.
In Create New Resource, select Create resource.
Create the Cognitive Services resource
In Create new project, make sure you have the following settings:
- Resource: polar-bear-vision
- Project Types: Classification
- Classification Types: Multiclass (Single tab per image)
- Domains: General
Note
A domain optimizes a model for specific types of images. For example, if your goal is to classify food images by the types of food they contain or by the ethnicity of the dishes, it's helpful to select the Food domain. For scenarios that don't match any of the offered domains, or if you're unsure which domain to choose, select the General domain.
Verify your settings
When all settings are correct, select Create project. Your project opens.
Get the images to use in the project
Now, you're ready to get the images to use in your project.
Download the .zip file of Arctic fox training images. Unzip the file.
To add the Arctic fox images to your project, select Add images.
Add images to the Custom Vision project
In Open, select all the images of Arctic foxes found in the unzipped file folder. To choose the images, select Open.
Open the images to use in the project
For the My Tags setting for the images, enter arctic-fox. Then, select Upload 130 files. Wait for the upload to finish.
Add the image tag and upload the images to the project
When the images are successfully uploaded, a summary is shown. Select Done.
Repeat these steps for the polar bear images:
Download the .zip file of polar bear training images. Unzip the file.
To upload the polar bear images in the unzipped file folder to your project, select Add images. For these images, set My Tags to polar-bear. Wait for the upload to finish—there are 140 files in this folder. When the upload is finished, select Done.
Finally, repeat these steps for the walrus images:
Download the .zip file of walrus training images. Unzip the file.
To upload the walrus images in the unzipped file folder to your project, select Add images. Set My Tags to walrus. Wait for the upload to finish—there are 138 files in this folder. When the upload is finished, select Done.
With the images tagged and uploaded, the next step is to train the model so that it can distinguish between Arctic foxes, polar bears, and walruses and determine whether an image contains one of these animals.
Train and test the model
Now it's time to train the model by using the images that you tagged and uploaded. After you train the model, you'll test it to determine how adept it is at identifying Arctic wildlife in photos. After the model is trained, it can be refined by uploading more tagged images and retraining it.
To train the model, select Train. When you're prompted to select a training type, select Quick Training.
Train the model
Wait for the training process to finish. (The process might take a few minutes.) Then, review the training statistics that are shown for iteration 1.
Note
Each time you train the model, a new version (iteration) is created. Custom Vision maintains all your iterations, so you can choose which version of the model to use.
Results of training the model
Precision and recall are two separate but related measures of the model's accuracy. Suppose the model is presented with three polar bear images and three walrus images. It correctly identifies two of the images that contain polar bears as polar bear images, but it incorrectly identifies two of the walrus images as polar bear images. In this case, precision would be 50 percent (two of the four images it classified as polar bear images actually are polar bear images). Its recall would be 67 percent (it correctly identified two of the three polar bear images as polar bear images). Learn more about precision and recall in Precision and recall.
Average precision (AP) is a third measurement of the model's accuracy. Whereas precision measures the false-positive rate and recall measures the false-negative rate, AP is the mean of false-positive rates that are computed across a range of thresholds. For more information, see Understanding the mAP Evaluation Metric for Object Detection.
Now, test the model by using the portal's Quick Test feature. You can use Quick Test to submit images to the model and see how the model classifies the images by using the knowledge gained during training.
Download the .zip file of test images. Unzip the file locally on your computer.
Select Quick Test.
Select Browse local files, and then go to the testing-images/polar-bear directory. Select any of the test images in that directory.
In Quick Test, examine the results of the test. What's the probability that the image contains a polar bear? What's the probability that the image contains an Arctic fox or a walrus?
Repeat this test by using one of the images in the testing-images/arctic-fox directory. How well is the model able to differentiate between Arctic foxes and polar bears?
The testing-images directory contains subdirectories that have a total of 30 different images for testing. Do more quick tests by using these images until you're satisfied that the model is reasonably adept at predicting whether an image contains a polar bear.
Return to your project. Select the Performance tab, and then select Publish.
In Publish Model, for Prediction resource, select
polar-bear-vision
, the name of your resource. Make sure you choose your resource name, not the resource group name. To publish the model, select Publish.Publish the model
Select Prediction URL.
In How to use the Prediction API, copy the following values and save them to use later:
- Under If you have an image URL, copy and then save the URL in your favorite text editor, so you can get it later.
- For Set Prediction-Key Header to, copy and then save the key value. This value must be passed in each call to the prediction URL.
Copy the Prediction API URL
To finish, select Got it!.
Now, you have a machine learning model that can discern whether an image contains a polar bear, and you have a URL and API key to invoke the model. The next step is to invoke the model each time an image is uploaded to Blob Storage. Azure Functions is the perfect tool for this job.