Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Returns the number of days from start to end.
For the corresponding Databricks SQL function, see date_diff function.
Syntax
from pyspark.databricks.sql import functions as dbf
dbf.date_diff(end=<end>, start=<start>)
Parameters
| Parameter | Type | Description |
|---|---|---|
end |
pyspark.sql.Column or str |
to date column to work on. |
start |
pyspark.sql.Column or str |
from date column to work on. |
Returns
pyspark.sql.Column: difference in days between two dates.
Examples
from pyspark.databricks.sql import functions as dbf
df = spark.createDataFrame([('2015-04-08','2015-05-10')], ['d1', 'd2'])
df.select('*', dbf.date_diff('d1', 'd2')).show()
df.select('*', dbf.date_diff(df.d2, df.d1)).show()