RelationalGroupedDataset Class

Definition

A set of methods for aggregations on a DataFrame.

public sealed class RelationalGroupedDataset
type RelationalGroupedDataset = class
Public NotInheritable Class RelationalGroupedDataset
Inheritance
RelationalGroupedDataset

Methods

Agg(Column, Column[])

Compute aggregates by specifying a series of aggregate columns.

Apply(StructType, Func<DataFrame,DataFrame>)

Maps each group of the current DataFrame using a UDF and returns the result as a DataFrame.

The user-defined function should take an DataFrame and return another DataFrame. For each group, all columns are passed together as an DataFrame to the user-function and the returned FxDataFrame are combined as a DataFrame.

The returned DataFrame can be of arbitrary length and its schema must match returnType.

Apply(StructType, Func<RecordBatch,RecordBatch>)

Maps each group of the current DataFrame using a UDF and returns the result as a DataFrame.

The user-defined function should take an Apache Arrow RecordBatch and return another Apache Arrow RecordBatch. For each group, all columns are passed together as a RecordBatch to the user-function and the returned RecordBatch are combined as a DataFrame.

The returned Apache.Arrow.RecordBatch can be of arbitrary length and its schema must match returnType.

Avg(String[])

Compute the average value for each numeric columns for each group.

Count()

Count the number of rows for each group.

Max(String[])

Compute the max value for each numeric columns for each group.

Mean(String[])

Compute the mean value for each numeric columns for each group.

Min(String[])

Compute the min value for each numeric columns for each group.

Pivot(Column)

Pivots a column of the current DataFrame and performs the specified aggregation.

Pivot(Column, IEnumerable<Object>)

Pivots a column of the current DataFrame and performs the specified aggregation.

Pivot(String)

Pivots a column of the current DataFrame and performs the specified aggregation.

Pivot(String, IEnumerable<Object>)

Pivots a column of the current DataFrame and performs the specified aggregation.

Sum(String[])

Compute the sum for each numeric columns for each group.

Applies to