replace the column delimiter ¶ with another delimiter supported by synapse

Sara Fadaei | b.telligent 46 Reputation points
2023-03-13T14:40:15.0333333+00:00

Hello everybody,

I'm extracting data in .csv format, which has a paragraph mark (¶) as the field delimiter. As I see this format is not defined in the list of column separators in the synapse. I've tried different ways to replace the delimiter, for example, by creating an external table but was not successful.  Any Idea?

User's image

Azure Synapse Analytics
Azure Synapse Analytics
An Azure analytics service that brings together data integration, enterprise data warehousing, and big data analytics. Previously known as Azure SQL Data Warehouse.
5,379 questions
{count} votes

Accepted answer
  1. AnnuKumari-MSFT 34,556 Reputation points Microsoft Employee Moderator
    2023-03-14T09:15:58.9433333+00:00

    @Sara Fadaei | b.telligent ,

    Thankyou for using Microsoft Q&A platform and thanks for posting your query here.

    As I understand your question, you want to replace the paragraph mark to some character which is supported as column delimiter by synapse. Please let me know if that is not the ask here.

    • You can use the replace function in derived column transformation of mapping data flow to replace the paragraph mark (¶) with a different character that is supported as a column separator Check here for more details: replace function , regex replace function in mapping dataflow.
    • You can also leverage replace()/regexp_replace() function in PySpark to write a notebook in synapse to achieve this requirement Reference: Replace , Regexp_replace in spark.

    Hope it helps. Kindly accept the answer and mark it as helpful. Thankyou


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.