Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Returns a count of the number of times that the Java regex pattern regexp is matched in the string str.
For the corresponding Databricks SQL function, see regexp_count function.
Syntax
from pyspark.databricks.sql import functions as dbf
dbf.regexp_count(str=<str>, regexp=<regexp>)
Parameters
| Parameter | Type | Description |
|---|---|---|
str |
pyspark.sql.Column or str |
target column to work on. |
regexp |
pyspark.sql.Column or str |
regex pattern to apply. |
Examples
from pyspark.databricks.sql import functions as dbf
df = spark.createDataFrame([("1a 2b 14m", r"\d+")], ["str", "regexp"])
df.select('*', dbf.regexp_count('str', dbf.lit(r'\d+'))).show()
df.select('*', dbf.regexp_count('str', dbf.lit(r'mmm'))).show()
df.select('*', dbf.regexp_count("str", dbf.col("regexp"))).show()
df.select('*', dbf.regexp_count(dbf.col('str'), "regexp")).show()