How to copy data/files from ADLS to HDFS?

Kajol Patira 1 Reputation point
2021-05-21T15:31:41.793+00:00

How can i copy data/files from ADLS to HDFS?
I have few approaches like:

  1. Using ADF activity
  2. Using DistCp
  3. Using Python Libraries
    Please provide the detail explanation if any one of these will work?
Azure Data Factory
Azure Data Factory
An Azure service for ingesting, preparing, and transforming data at scale.
10,593 questions
{count} votes

1 answer

Sort by: Most helpful
  1. KranthiPakala-MSFT 46,462 Reputation points Microsoft Employee
    2021-05-21T20:16:22.117+00:00

    Hi @Kajol Patira ,

    Welcome to Microsoft Q&A forum and thanks for reaching out.

    You cannot achieve this directly using Copy activity in ADF as HDFS is not supported as sink. Current HDS if only supported as Source. One way to achieve this is by writing your own custom code and using it in Azure Function activity or Customer activity in ADF

    Or using DistCP tool you may try as explained in this document - https://learn.microsoft.com/azure/storage/blobs/data-lake-storage-use-distcp.

    Hope this info helps.

    ----------

    Please don’t forget to Accept Answer and Up-Vote wherever the information provided helps you, this can be beneficial to other community members.


Your answer

Answers can be marked as Accepted Answers by the question author, which helps users to know the answer solved the author's problem.