How to import dynamically generated csv file as a source in ADF

123ay4h321 21 Reputation points
2020-10-04T13:05:55.647+00:00

In data factory what source setup can be used to get this dynamically generated csv file as a source? I can't seem to get a valid file from this public url which auto creates a file for download (and where we are not providing a filename). I have tried various options but the CSV file I output to blob storage is not correctly created. An example of the csv file is the one created by this URL:

https://api.coronavirus.data.gov.uk/v1/data?filters=areaType=nation;areaName=england&structure={"date":"date","name":"areaName","code":"areaCode","cases":{"daily":"newCasesByPublishDate","cumulative":"cumCasesByPublishDate"},"deaths":{"daily":"newDeathsByDeathDate","cumulative":"cumDeathsByDeathDate"}}format=csv
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
11,623 questions
0 comments No comments
{count} votes

Accepted answer
  1. MartinJaffer-MSFT 26,236 Reputation points
    2020-10-05T19:38:20.907+00:00

    Hello @123ay4h321 and welcome to MIcrosoft Q&A. Thank you for this question. The example URL helped greatly. Let me share the steps I took to solve your issue.

    I visited the link you shared, with ym browser. It resulted in a download, which I was able to read in a text editor.
    So I then tried using a web activity in ADF on the same URL. The output content was not readable, as shown below. I recalled that attached content is often compressed. So I checked the output again. See the highlighted below.

    30212-image.png

    The response headers included "Content-Encoding" : "gzip" .

    To solve one issue at a time, I tried copying, using HTTP > Binary dataset for source, and BLOB > Binary for sink. When I set the
    source dataset compression to gzip , and the sink dataset compression to none, I was able to read the output in my storage.
    30140-image.png

    30120-image.png

    TLDR: Match the source dataset compression to that in the headers.

    If this answer solved your issue, please mark as accepted. Otherwise let me know what I can do to help better.

    1 person found this answer helpful.
    0 comments No comments

1 additional answer

Sort by: Most helpful
  1. 123ay4h321 21 Reputation points
    2020-10-06T18:24:08.61+00:00

    Many thanks @MartinJaffer-MSFT this solves the issue perfectly, Much appreciated!


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.