How can we convert these funcation in Pyspark from Scala in Azure Databricks

Question

How can we convert these funcation in Pyspark from Scala in Azure Databricks

manish verma 516

Hi All,

i have below code that is working fine in Scala, but due to requirement we need to change this in Pyspark. we are new in Pyspark unable to convert in Pyspark

Scala code

import org.apache.spark.sql.types.{DataType, StringType, StructField, StructType}
import org.apache.spark.sql.catalyst.parser.CatalystSqlParser;

def toSparkType(inputType: String): DataType = CatalystSqlParser.parseDataType(inputType)

def getSchema(inputType: String) : StructType = StructType(inputType.split(",").map(line ⇒ line.split(" ")).map(field => StructField(field(0),toSparkType(field(1)),true)))

val NewFileSchema = getSchema(Source_Schema_V)
print(" newfileschema print "+NewFileSchema)

val NewFileDF = spark.read.option("header", "true").schema(NewFileSchema).csv(Source_File)

value of Source_Schema_V="ID int,FirstName string,LastName string,Department string,Created_Date string,LastUpdated_Date string"

Please help on this

MrDiaaYossry 46 Reputation points

2022-06-10T07:02:36.187+00:00

https://learn.microsoft.com/en-us/azure/databricks/languages/scala
https://learn.microsoft.com/en-us/azure/databricks/spark/latest/dataframes-datasets/introduction-to-dataframes-scala
https://learn.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/udf-scala
manish verma 516 Reputation points

2022-06-10T07:56:29.443+00:00

sorry, i already have these links
PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2022-06-14T09:28:41.7+00:00

Hello @manish verma ,

I see this question is already answered on SO thread, please do check and do let us know.
manish verma 516 Reputation points

2022-06-14T10:50:33.697+00:00

there(SO) it is say library is not there in python, but what is solution. it is not answer there SO. also when i found library is not available in python , then i will ask how we convert , both question is different. if it is not possible to convert this code from Scala to python that i think. i know only Scala.
PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2022-06-20T06:52:26.167+00:00
Hello @manish verma ,

There is no direct tool that can convert scala to pyspark.

Below is the strategy to do this kind of migrations

Understand the code and logic written scala.

Try to have a pseudo code prepared ruffly.

Implement the steps one by one in pyspark by exploring.

Your answer

MrDiaaYossry 46 Reputation points

2022-06-10T07:02:36.187+00:00

https://learn.microsoft.com/en-us/azure/databricks/languages/scala
https://learn.microsoft.com/en-us/azure/databricks/spark/latest/dataframes-datasets/introduction-to-dataframes-scala
https://learn.microsoft.com/en-us/azure/databricks/spark/latest/spark-sql/udf-scala
manish verma 516 Reputation points

2022-06-10T07:56:29.443+00:00

sorry, i already have these links
PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2022-06-14T09:28:41.7+00:00

Hello @manish verma ,

I see this question is already answered on SO thread, please do check and do let us know.
manish verma 516 Reputation points

2022-06-14T10:50:33.697+00:00

there(SO) it is say library is not there in python, but what is solution. it is not answer there SO. also when i found library is not available in python , then i will ask how we convert , both question is different. if it is not possible to convert this code from Scala to python that i think. i know only Scala.
PRADEEPCHEEKATLA 90,646 Reputation points Moderator

2022-06-20T06:52:26.167+00:00

Hello @manish verma ,

There is no direct tool that can convert scala to pyspark.

Below is the strategy to do this kind of migrations

Understand the code and logic written scala.

Try to have a pseudo code prepared ruffly.

Implement the steps one by one in pyspark by exploring.

Share via

How can we convert these funcation in Pyspark from Scala in Azure Databricks

Your answer