Process Multi-Line Fixed Width Input Files in ADF

Jon Meyer 21 Reputation points
2021-02-23T14:42:11.47+00:00

We need to process traditional mainframe batch files using ADF. These text files are fixed record length AND multi-line per logical record, meaning all the columns to be extracted span across multiple rows of the input file. Any way to do this in ADF natively without preprocessing the file?

Example:

HGAXX2021-02-23 // Header Row
GAAELEMENT1 ELEMENT2 ELEMENT3 ELEMENT4 // Record 1
GABELEMENT5 ELEMENT6 ELEMENT7 ELEMENT8 // Record 1
GAAELEMENT1 ELEMENT2 ELEMENT3 ELEMENT4 // Record 2
GABELEMENT5 ELEMENT6 ELEMENT7 ELEMENT8 // Record 2
GAAELEMENT1 ELEMENT2 ELEMENT3 ELEMENT4 // Record 3
GABELEMENT5 ELEMENT6 ELEMENT7 ELEMENT8 // Record 3
FGAXX000003 // Footer Row with Count of Records

Thank you!
Jon

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
0 comments No comments
{count} votes

Answer accepted by question author
  1. MarkKromer-MSFT 5,231 Reputation points Microsoft Employee Moderator
    2021-02-24T01:26:14.363+00:00

    Add a Data Flow to your Pipeline. In the Source transformation, select "Multiline rows".

    71331-image.png


0 additional answers

Sort by: Most helpful

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.