replace the column delimiter ¶ with another delimiter supported by synapse

Question

replace the column delimiter ¶ with another delimiter supported by synapse

Sara Fadaei | b.telligent 46

Hello everybody,

I'm extracting data in .csv format, which has a paragraph mark (¶) as the field delimiter. As I see this format is not defined in the list of column separators in the synapse. I've tried different ways to replace the delimiter, for example, by creating an external table but was not successful. Any Idea?

User's image

PRADEEPCHEEKATLA 90,651 Reputation points Moderator

2023-03-21T10:43:53.13+00:00

@Sara Fadaei | b.telligent - Glad to know that the below response was helpful. Thank you for marking this answer as accepted. This will help others find useful answers faster.

Please feel free to take a survey on the relevant answer. You can help us improve by leaving feedback verbatim.

Have a good day!

Regards,

PRADEEPCHEEKATLA-MSFT

Accepted answer

0 additional answers

Your answer

PRADEEPCHEEKATLA 90,651 Reputation points Moderator

2023-03-21T10:43:53.13+00:00

@Sara Fadaei | b.telligent - Glad to know that the below response was helpful. Thank you for marking this answer as accepted. This will help others find useful answers faster.

Please feel free to take a survey on the relevant answer. You can help us improve by leaving feedback verbatim.

Have a good day!

Regards,

PRADEEPCHEEKATLA-MSFT

Answer 1

AnnuKumari-MSFT 34,556 Microsoft Employee Moderator

@Sara Fadaei | b.telligent ,

Thankyou for using Microsoft Q&A platform and thanks for posting your query here.

As I understand your question, you want to replace the paragraph mark to some character which is supported as column delimiter by synapse. Please let me know if that is not the ask here.

You can use the replace function in derived column transformation of mapping data flow to replace the paragraph mark (¶) with a different character that is supported as a column separator Check here for more details: replace function , regex replace function in mapping dataflow.
You can also leverage replace()/regexp_replace() function in PySpark to write a notebook in synapse to achieve this requirement Reference: Replace , Regexp_replace in spark.

Hope it helps. Kindly accept the answer and mark it as helpful. Thankyou

Sara Fadaei | b.telligent 46 Reputation points

2023-03-16T14:25:37.87+00:00

What I have done instead of using data flow is utilize the vs code to identify the Encoding which is Windows-1552 in my case. When transferring data in Parquet format, the delimiter (¶) and encoding (Windows-1252) should be specified in the source of copy activity to load the data into a legible table
AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator

2023-03-16T14:26:28.88+00:00

Sara Fadaei | b.telligent ,

Thanks for sharing your approach with the community. Glad to know the issue has been resolved.

Share via

replace the column delimiter ¶ with another delimiter supported by synapse

0 additional answers

Your answer