Dimensions for Azure Stream Analytics metrics

Azure Stream Analytics provides a serverless, distributed streaming processing service. Jobs can run on one or more distributed streaming nodes, which the service automatically manages. The input data is partitioned and allocated to different streaming nodes for processing.

Stream Analytics has many metrics available to monitor a job's health. To troubleshoot performance problems with your job, you can split and filter metrics by using the following dimensions.

Dimension Definition
Logical Name The input or output name for a Stream Analytics job.
Partition ID The ID of the input data partition from an input source. For example, if the input source is an event hub, the partition ID is the event hub's partition ID. For embarrassingly parallel jobs, Partition ID in the output is the same as it is in the input.
Node Name The identifier of a streaming node that's provisioned when your job runs. A streaming node represents the amount of compute and memory resources allocated to your job.

Screenshot of a chart that shows the area for selecting a dimension for Stream Analytics job metrics.

Screenshot that shows splitting a metric by dimension.

Screenshot that shows filtering a metric by dimension.

Logical Name dimension

Logical Name is the input or output name for a Stream Analytics job. For example, assume that a Stream Analytics job has four inputs and five outputs. You'll see the four individual logical inputs and five individual logical outputs when you split input-related and output-related metrics by this dimension.

Screenshot that shows multiple inputs and outputs in a Stream Analytics job.

Screenshot of a chart that shows splitting the Output Events metric by Logical Name.

The Logical Name dimension is available for filtering and splitting the following metrics:

  • Backlogged Input Events
  • Data Conversion Errors
  • Early Input Events
  • Input Deserialization Errors
  • Input Event Bytes
  • Input Events
  • Input Source Received
  • Late Input Events
  • Out-of-Order Events
  • Output Events
  • Watermark Delay

Node Name dimension

A streaming node represents a set of compute resources that's used to process your input data. Every six streaming units (SUs) translate to one node, which the service automatically manages on your behalf. For more information about the relationship between streaming units and streaming nodes, see Understand and adjust streaming units.

Node Name is a dimension at the streaming node level. It can help you to drill down certain metrics to the specific streaming node level. For example, you can split the CPU % Utilization metric by streaming node level to check the CPU utilization of an individual streaming node.

Screenshot of a chart that shows splitting average CPU utilization by the Node Name dimension.

The Node Name dimension is available for filtering and splitting the following metrics:

  • Backlogged Input Events
  • CPU % Utilization (preview)
  • Input Events
  • Output Events
  • SU (Memory) % Utilization
  • Watermark Delay

Partition ID dimension

When streaming data is ingested into the Azure Stream Analytics service for processing, the input data is distributed to streaming nodes according to the partitions in the input source. The Partition ID dimension is the ID of the input data partition from the input source.

For example, if the input source is an event hub, the partition ID is the event hub's partition ID. Partition ID in the input is the same as it is in the output.

Diagram that shows splitting a watermark delay by the Partition ID dimension.

The Partition ID dimension is available for filtering and splitting the following metrics:

  • Backlogged Input Events
  • Data Conversion Errors
  • Early Input Events
  • Input Deserialization Errors
  • Input Event Bytes
  • Input Events
  • Input Source Received
  • Late Input Events
  • Output Events
  • Watermark Delay

Next steps