Setup

Completed

Prerequisites

In order to follow the hands-on exercises in this module, you'll need the following:

Create a storage account

Note

If you already have an ADLS Gen2 or Azure Storage account in your subscription that you would like to use, you can skip this section.

To provide data for Microsoft Purview to classify, we'll set up an ADLS Gen 2 account with some sample data. Follow the steps below to create the resource.

  1. Sign in to the Azure portal with your Azure account.

  2. Select Create a resource.

    Screenshot of Azure portal with the Create a Resource button selected in the Azure services menu at the top of the window.

  3. Search for and select Storage account.

  4. Select the Storage account plan, and select Create.

    Screenshot of Azure Marketplace, with storage account in the search bar, and the create menu selected under Storage account.

  5. Select an existing resource group, or create a new one.

  6. Create a unique storage account name.

  7. Select Standard performance.

  8. Select Locally-redundant storage.

    Screenshot of the Create a storage account window basics tab with subscription, resource group, storage account name, performance, and redundancy options highlighted.

  9. Select Next: Advanced.

  10. Select Enable hierarchical namespace.

    Screenshot of  the Create a storage account window advanced tab with the Enable hierarchical namespace option highlighted and selected.

  11. Select Review + create.

  12. After validation succeeds, select Create.

    Screenshot of  the Create a storage account window review + create tab, with validation showing as passed and the create button highlighted.

Add data to your storage account

Note

If your storage account already has data you would like to scan, you can skip this section.

  1. Download a copy of the Bing Coronavirus Query Set to your local machine.

    Note

    This data set was originally sourced from Microsoft Research Open Data.

  2. Locate the downloaded zip file via File Explorer and unzip the contents by right-clicking or holding on the file and selecting Extract All...

    Screenshot of the BingCoronavirusQuerySet zip file with the menu open and Extract All highlighted.

  3. Select Extract.

    Screenshot of the extract menu with the Extract button highlighted.

  4. In the Azure portal, locate your storage account, and in the left menu select Storage browser.

    Screenshot of A D L S Gen 2 storage account with Storage browser selected from the left side menu.

  5. Select Upload from the top menu.

    Screenshot of A D L S Gen 2 storage browser, with the upload button selected in the menu at the top.

  6. In the new menu either browse for files, or drag all your files from the query set we extracted earlier. Create a new container, named raw.

    Screenshot of A D L S Gen 2 storage browser upload menu, with **browse for files** highlighted, the container dropdown menu highlighted with a container named raw, and the upload button in the middle of the page highlighted.

  7. Select Upload, and at the bottom of the screen you'll see the upload status of each file as it is added.

  8. Once all your files are uploaded, you can exit the upload menu and select our new raw container.

  9. In the raw container, you should see all the files you uploaded.

    Screenshot of A D L S Gen 2 storage browser with the new raw container opened, and all the uploaded files listed below.