Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Generates a random column with independent and identically distributed (i.i.d.) samples from the standard normal distribution. Supports Spark Connect.
For the corresponding Databricks SQL function, see randn function.
Syntax
from pyspark.databricks.sql import functions as dbf
dbf.randn(seed=<seed>)
Parameters
| Parameter | Type | Description |
|---|---|---|
seed |
int (default: None) |
Seed value for the random generator. |
Returns
pyspark.sql.Column: A column of random values.
Examples
from pyspark.databricks.sql import functions as dbf
spark.range(0, 2, 1, 1).select("*", dbf.randn()).show() # doctest: +SKIP
+---+--------------------------+
| id|randn(3968742514375399317)|
+---+--------------------------+
| 0| -0.47968645355788...|
| 1| -0.4950952457305...|
+---+--------------------------+
from pyspark.databricks.sql import functions as dbf
spark.range(0, 2, 1, 1).select("*", dbf.randn(seed=42)).show() # doctest: +SKIP
+---+------------------+
| id| randn(42)|
+---+------------------+
| 0| 2.384479054241...|
| 1|0.1920934041293...|
+---+------------------+