Cohort options

This section covers the configuration options for cohort visualizations. For an example, see cohort example.

Columns

To configure column options, click Columns and configure each of the following required settings:

  • Date (bucket): The date that uniquely identifies a cohort. Suppose you’re visualizing monthly user activity by sign-up date. Your cohort date for all users that signed up in January 2018 would be January 1, 2018. The cohort date for any user who signed up in February would be February 1, 2018.
  • Stage: A count of how many stages transpired since the cohort date as of this sample. If you are grouping users by sign-up month, then your stage will be the count of months since these users signed up. In the above example, a measurement of activity in July for users who signed up in January would yield a value of 7 because seven stages have transpired between January and July.
  • Bucket population size: The denominator to use to calculate the percentage of a cohort’s target satisfaction for a given stage. Continuing the example above, if 72 users signed up in January then the bucket population size would be 72. When the visualization is rendered, the value would be displayed as 41.67% (32 ÷ 72).
  • Stage value: Your actual measurement of this cohort’s performance in the given stage. In the above example, if 30 users who signed up in January showed activity in July then the stage value would be 30.

Options

To configure options, click Options and configure each of the following required settings:

  • Time interval: Lets you choose to define the cohort on either a daily, weekly, or monthly basis.
  • Mode: Options are Fill gaps with zeros (default) or Show data as is.

Colors

To configure color options, click Colors and configure the following optional settings:

  • min color
  • max color
  • steps

Appearance

To configure appearance options, click Appearance and configure the following optional settings:

  • Title column title: Override the column name with a different display name.
  • People column title: Override the column name with a different display name.
  • Stage column title: Override the column name with a different display name.
  • Number values format: The format to use for labels for numeric values.
  • Percent values format: The format to use for labels for percentages.
  • No value placeholder: Default is -. Specify other value if desired.
  • Show tooltips: Tooltips are displayed by default. Clear checkbox to override.
  • Normalize values to percentage: Values normalized to percentage by default. Clear checkbox to override.

Cohort date notes

Even if you define your cohorts by month or week, Azure Databricks expects the values in your Date column to be a full date value. If you are grouping by month, 2018-01-18 should be shortened to 2018-01-01 or any other full date in January, not 2018-01.

The cohort visualizer converts all date and time values to GMT before rendering. To avoid rendering issues, you should adjust the date times returned from your database by your local UTC offset.