Exercise - Inspect and prepare data

Completed

Your task for this exercise is to create a new process mining project from a dataset that you receive in a CSV format.

In this exercise, you review the data, verify that the required fields are included in the dataset, identify what other columns to use in your process mining, and then load the dataset.

Objectives

Your objectives for this exercise are to:

  • Review and prepare data.

  • Verify that required fields are included in the dataset.

  • Create the process and load data.

  • Map attributes.

  • Save data for analysis.

Required fields

For process mining to analyze your event logs, you're required to have the following fields: Case ID, Activity, and Event Start.

Prepare data and load

You receive a dataset in CSV format. You want to review the data, make sure that all required fields are included in the dataset, and then create the process.

High-level steps

Try to perform the following high-level steps to complete this exercise. If you get stuck, go to the step-by-step section and follow those instructions instead.

  1. Open and check the dataset.

  2. If necessary, convert the data to a format that's easily readable.

  3. Verify that all required columns are included in the data.

  4. Identify the columns to map for process mining.

  5. Save the data in the correct format.

  6. Create the process and load the data.

  7. Map attributes.

Step by step

This section provides more in-depth guidance for the preceding high-level steps.

Task: Inspect the data

Your first task is to inspect the data by following these steps:

  1. Locate and open the AP_EV_LAB_Inspect.csv file.

    The data in the file should resemble the following image.

    Screenshot of the initial C S V data.

    Note

    A pipe or vertical bar (|) separates the data, which makes it difficult to read. Use the Text to Columns feature of Microsoft Excel to make the data easier to read.

  2. Select the Data tab.

  3. All data for each row is in one cell. Select all rows, including the header, and then select the Text to Columns button.

    Screenshot of the Data tab and the Text to Columns button.

  4. Select Delimited and then select Next.

    Screenshot of step 1 of the Convert Text to Columns Wizard.

  5. Select Other, enter a vertical bar (|) character, and then select Next.

    Screenshot of the Delimiters options.

  6. Select Finish.

    The data should now become easier to read.

    Screenshot of the data after converting text to columns.

Task: Identify columns and save the dataset

For Power Automate Process Mining to analyze your event logs, you're required to have the following fields: Case ID, Activity Name, and Timestamp.

  1. Identify the required fields.

    Screenshot of columns that are identified as the required fields.

  2. Identify other fields that you want to map for process mining.

    Screenshot of all remaining columns.

  3. Optionally, you can take notes on what fields to use. After you finish inspecting the data, select File > Save As.

  4. Save the file as CSV and then give it an appropriate name.

    Screenshot of the file being saved as C S V.

  5. Close the file.

Task 3: Create the process

To create the process, follow these steps:

  1. Go to Microsoft Power Automate and select the correct environment.

    Screenshot of the selected environment.

  2. Select Process mining from the left pane and then select + Start here.

    Screenshot of Process mining selected and the Start here button.

  3. Enter My first mining for the Process name, select Import data for the Data source, select Dataflow, and then select Continue.

    Screenshot of the Create a new process wizard with Import data and Dataflow selected.

  4. Select Skip if the system prompts you to add Power BI workspace.

  5. Select Text/CSV as the data source.

    Screenshot of the selected data source as Text C S V.

  6. Select Upload file and then select the Browse button.

    Screenshot of the Upload file option selected and the Browse button.

  7. Select the CSV file that you created and then select Open.

    Screenshot of the selected C S V file and the Open button.

  8. If you aren't signed in, select the Sign in button and provide your credentials.

  9. Select Next.

    Screenshot of the connection settings.

  10. A preview of the data should display. Select Next.

    Screenshot of the preview data.

  11. The system should advance you to the Transform stage. Leave all other settings as they are and then select Next.

    Screenshot of the Transform data stage.

  12. You should advance to the mapping stage. First, you map the required fields, as shown in the following table.

    Attribute name Attribute type Attribute data type
    Case ID Case ID Text
    Activity Activity Text
    Timestamp Event Start Date/Time

    Screenshot of the required fields mapped.

  13. Map any other fields that you want to include. For this exercise, you map all fields.

  14. Map the rest of the fields as shown in the following table.

    Attribute name Attribute type Attribute data type
    Resource Resource Text
    Resource Name Event Level Attribute Text
    Vendor Case Level Attribute (first event) Text
    Automation Case Level Attribute (first event) Text
    Paid on Time Case Level Attribute (first event) Text
    Reason for Rejection Case Level Attribute (first event) Text
    Cash Discount % Case Level Attribute (first event) Numeric
    Cash Discount Value Financial per Case (first event) Numeric
    Invoice Value Financial per Case (first event) Numeric
    Business Unit Case Level Attribute (first event) Text
    Region Case Level Attribute (first event) Text
  15. After mapping the fields, select Save and analyze.

    Screenshot of the rest of the fields mapped and the Save and analyze button.

  16. Wait for analysis to complete. This process could take a few minutes.

After analysis is complete, the process overview should resemble the following image.

Screenshot of the process map overview.

Task: Review the validation report

Your final task is to review the validation report by following these steps:

  1. In the upper part of your overview, locate the data validation report notification and then select Get data validation report.

    Screenshot of the Get data validation report link in the upper part of the overview.

  2. Open the downloaded data validation report.

  3. By default, the report shows a validation summary. The following image shows a summary that displays the Reason for Rejection field indicating a value of 99% of the records already contained or had values replaced by default value.

    Screenshot of the data validation report summary.

  4. Select Expand all.

    Screenshot of the Expand all toggle switch.

    The report should expand and show all columns. The fields that cause the validation issues should show with a different background color.

    Screenshot of all columns of the data validation report.

  5. After reviewing the validations error, you can decide whether to make corrections and reload the data or not.

  6. Close the data validation report.