DataFrame.Checkpoint(Boolean) Method

Definition

Returns a checkpointed version of this DataFrame.

public Microsoft.Spark.Sql.DataFrame Checkpoint (bool eager = true);
member this.Checkpoint : bool -> Microsoft.Spark.Sql.DataFrame
Public Function Checkpoint (Optional eager As Boolean = true) As DataFrame

Parameters

eager
Boolean

Whether to checkpoint this DataFrame immediately

Returns

Checkpointed DataFrame

Remarks

Checkpointing can be used to truncate the logical plan of this `DataFrame`, which is especially useful in iterative algorithms where the plan may grow exponentially. It will be saved to files inside the checkpoint directory set with SetCheckpointDir(String).

Applies to