Automatically download csv file from a public website using Azure

Sarvesh Pandey 71 Reputation points
2023-01-03T06:11:14.29+00:00

Hi All,

I need to download a CSV file which is publicly available and this file gets updated regularly.
I have designed a flow as per my requirements but i have to download the file manually.

Is there any way to make a process which download the file automatically and store it in ADLS location?

Which tool i can use for this?

275591-image.png

Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
0 comments No comments
{count} votes

7 answers

Sort by: Most helpful
  1. MartinJaffer-MSFT 26,161 Reputation points
    2023-01-24T22:10:35.5866667+00:00

    @Sarvesh Pandey I shared this conundrum with a coworker, who referred me to this article:

    [https://community.powerbi.com/t5/Power-Query/Downloading-stock-data-from-NSE-India/td-p/1805031

    Of particular interest:

    I also found that there is a python package (nsepy) that scraps the data from NSE site.

    I'm thinking try using this package in either Data Factory Custom Activity, or Azure Synapse Python Notebook. Both should be able to write to Azure Data Lake.

    0 comments No comments

  2. Sarvesh Pandey 71 Reputation points
    2023-01-26T19:12:25.43+00:00

    Hi @MartinJaffer-MSFT ,

    I think i need support help to build the flow.

    Currently i am trying to the read the JSON file in Azure Databricks with Standard compute configuration but it's taking too much time to read.

    I am using below command -

    pd.read_json('[https://www1.nseindia.com/live_market/dynaContent/live_analysis/pre_open/all.json')

    and file size is 436kb.

    I also used Command Prompt got this error message -

      File "C:\Users\sarve\AppData\Local\Programs\Python\Python311\Lib\http\client.py", line 287, in _read_status
        raise RemoteDisconnected("Remote end closed connection without"
    http.client.RemoteDisconnected: Remote end closed connection without response
    
    0 comments No comments

Your answer

Answers can be marked as 'Accepted' by the question author and 'Recommended' by moderators, which helps users know the answer solved the author's problem.