Pat index in databricks

Question

Pat index in databricks

Shambhu Rai 1,411

Hi Expert, How to use Pat index in databricks.. Pls help me with an example

PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2024-01-25T05:08:25.61+00:00

@Shambhu Rai - Just checking in to see if the below answer provided by @Amira Bedhiafi helped. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

1 answer

Your answer

PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2024-01-25T05:08:25.61+00:00

@Shambhu Rai - Just checking in to see if the below answer provided by @Amira Bedhiafi helped. If this answers your query, do click Accept Answer and Yes for was this answer helpful. And, if you have any further query do let us know.

Answer 1

The PATINDEX function is not directly available in Databricks, which typically runs on Apache Spark. However, you can achieve similar functionality using Spark SQL functions. The PATINDEX function in SQL Server is used to find the starting position of a pattern in a string. In Databricks (Spark SQL), you would typically use a combination of functions like regexp_extract and instr to mimic the behavior of PATINDEX. Check also this old thread : https://stackoverflow.com/questions/58329209/patindex-in-spark-sql

from pyspark.sql import functions as F

# Example DataFrame
data = [("Hello abc world",), ("abc starts here",), ("no match here",)]
df = spark.createDataFrame(data, ["text"])

# Pattern to search for
pattern = "abc"

# Adding a new column to DataFrame with the starting position of the pattern
df = df.withColumn("pat_index", F.instr(F.regexp_extract("text", pattern, 0), pattern))

df.show()

Shambhu Rai 1,411 Reputation points

2024-01-25T05:44:18.51+00:00

howto use this n delta sql df = df.withColumn("pat_index", F.instr(F.regexp_extract("text", pattern, 0), pattern)) df.show()

Share via

Pat index in databricks

1 answer

Your answer