DataFrame.Repartition 方法

参考

定义

命名空间:: Microsoft.Spark.Sql

程序集:: Microsoft.Spark.dll

包:: Microsoft.Spark v1.0.0

重要

一些信息与预发行产品相关，相应产品在发行之前可能会进行重大修改。对于此处提供的信息，Microsoft 不作任何明示或暗示的担保。

重载

Repartition(Int32, Column[])	将给定分区表达式分区的新 `DataFrame` 返回到 `numPartitions`中。生成的 `DataFrame` 是哈希分区。
Repartition(Int32)	返回一个刚`numPartitions`好具有分区的新 `DataFrame` 。
Repartition(Column[])	返回由给定分区表达式分区的新 `DataFrame` ，使用 `spark.sql.shuffle.partitions` 作为分区数。

Repartition(Int32, Column[])

将给定分区表达式分区的新 DataFrame 返回到 numPartitions中。生成的 DataFrame 是哈希分区。

public Microsoft.Spark.Sql.DataFrame Repartition (int numPartitions, params Microsoft.Spark.Sql.Column[] partitionExprs);

member this.Repartition : int * Microsoft.Spark.Sql.Column[] -> Microsoft.Spark.Sql.DataFrame

Public Function Repartition (numPartitions As Integer, ParamArray partitionExprs As Column()) As DataFrame

参数

numPartitions: Int32

分区数

partitionExprs: Column[]

分区表达式

DataFrame

DataFrame 对象

适用于

Repartition(Int32)

返回一个刚numPartitions好具有分区的新 DataFrame 。

public Microsoft.Spark.Sql.DataFrame Repartition (int numPartitions);

member this.Repartition : int -> Microsoft.Spark.Sql.DataFrame

Public Function Repartition (numPartitions As Integer) As DataFrame

参数

numPartitions: Int32

分区数

DataFrame

DataFrame 对象

适用于

Repartition(Column[])

返回由给定分区表达式分区的新 DataFrame ，使用 spark.sql.shuffle.partitions 作为分区数。

public Microsoft.Spark.Sql.DataFrame Repartition (params Microsoft.Spark.Sql.Column[] partitionExprs);

member this.Repartition : Microsoft.Spark.Sql.Column[] -> Microsoft.Spark.Sql.DataFrame

Public Function Repartition (ParamArray partitionExprs As Column()) As DataFrame

参数

partitionExprs: Column[]

分区表达式

DataFrame

DataFrame 对象

适用于

DataFrame.Repartition 方法

定义

重载

Repartition(Int32, Column[])

参数

返回

适用于

Repartition(Int32)

参数

返回

适用于

Repartition(Column[])

参数

返回

适用于

反馈

其他资源