How to read multiple csv files into one data frame using azure synapse notebooks

Question

How to read multiple csv files into one data frame using azure synapse notebooks

SaiSekhar, MahasivaRavi (Philadelphia) 140

Hi Team,

Could you please help us on how to read multiples files of csv from different subfolders into a single data frame using azure synapse note book using pyspark.

Eg:'abfss://@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/01/26/test1.csv','abfss:/@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/02/02/test1.csv','abfss:/******@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/02/03/test2.csv'

Need to read above files in azure synapse notebooks, please share your thoughts on it.

Bhargava-MSFT 31,261 Reputation points Microsoft Employee Moderator

2024-04-04T20:57:38.1+00:00

Hello SaiSekhar, MahasivaRavi (Philadelphia),

I am checking to see if you had a chance to look into my earlier response.

1 answer

Your answer

Bhargava-MSFT 31,261 Reputation points Microsoft Employee Moderator

2024-04-04T20:57:38.1+00:00

Hello SaiSekhar, MahasivaRavi (Philadelphia),

I am checking to see if you had a chance to look into my earlier response.

Answer 1

Hello SaiSekhar, MahasivaRavi (Philadelphia),

Read multiple csv files using pyspark is discussed here: https://sparkbyexamples.com/spark/spark-read-multiple-csv-files/

You can try the below code and let me know

`from pyspark.sql.functions import *

Define the file paths

file_paths = ['abfss://@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/01/26/test1.csv', 'abfss:/@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/02/02/test1.csv', 'abfss:/******@testsalesdatalake.dfs.core.windows.net/Bronze/properties/2024/02/03/test2.csv']

Read the CSV files into a single DataFrame

df = spark.read.format("csv")
.option("header", "true")
.option("inferSchema", "true")
.load(file_paths)

Show the DataFrame

df.show() `

Share via

How to read multiple csv files into one data frame using azure synapse notebooks

1 answer

Your answer