How to move files from one resource group to another using datafactory

Question

How to move files from one resource group to another using datafactory

Vieira Dias, Leonor 20

Hi there,

I would like to move (copy then delete) files from container A1 in storage account A2 in resource group A3 to the container B1 in storage account B2 in resource group B3. All under the same subscription. I am aware of the function "move" but that has to be done manually each time. I wanted a more systematic process of doing things so that I can apply to other resource groups etc. From what I have read, ADF seems to be the best option but correct me if not. What should I do to connect both storage accounts using data factory and move files between them?

Thank you for your help in advance.

ShaikMaheer-MSFT 38,546 Reputation points Microsoft Employee Moderator

2023-07-26T05:46:13.84+00:00

Hi Vieira Dias, Leonor, Just checking if below answer helps. If yes, please consider hitting Accept Answer button. Accepted answers help community as well. Please let me know if any further queries. Thank you.

Accepted answer

2 additional answers

Your answer

ShaikMaheer-MSFT 38,546 Reputation points Microsoft Employee Moderator

2023-07-26T05:46:13.84+00:00

Hi Vieira Dias, Leonor, Just checking if below answer helps. If yes, please consider hitting Accept Answer button. Accepted answers help community as well. Please let me know if any further queries. Thank you.

Answer 1

Hello @Vieira Dias, Leonor

please find below

To connect both storage accounts using Azure Data Factory and move files between them, follow these steps:

    Set up your Azure Data Factory: If you haven't already, create an Azure Data Factory instance in your Azure subscription. You can do this through the Azure portal by searching for "Data Factory" and following the steps to create a new instance.

    Create linked services: In your Data Factory instance, create linked services to represent your source (storage account A2 in resource group A3) and destination (storage account B2 in resource group B3) storage accounts. Linked services define the connection information and credentials required to connect to the storage accounts.
        For each storage account, go to the "Author & Monitor" section of your Data Factory instance in the Azure portal.
        Click on "Connections" and then "New Linked Service."
        Select the appropriate storage account type (e.g., Azure Blob Storage) and provide the necessary details, such as the storage account name and access key.
        Repeat this process for both storage accounts.

    Create datasets: In Data Factory, datasets represent the data structures you'll be working with. Create two datasets—one for each storage account—to define the structure and location of the files.
        For each storage account, go to the "Author & Monitor" section of your Data Factory instance in the Azure portal.
        Click on "Author" and then "New dataset."
        Select the appropriate dataset type (e.g., Azure Blob Storage) and configure the dataset properties, such as the storage account, container, and file format.
        Repeat this process for both storage accounts.

    Create a pipeline: Pipelines in ADF define the workflow and activities to be performed. Create a pipeline to move the files from the source storage account to the destination storage account.
        In the Data Factory Authoring UI, click on "Author" and then "New pipeline."
        Give your pipeline a name and add activities to it.
        Add a "Copy Data" activity to the pipeline, specifying the source and destination datasets you created earlier.
        Configure the copy activity settings, such as file filters, mappings, and performance options.
        Save and publish your pipeline.

    Trigger the pipeline: Once your pipeline is published, you can trigger it manually or schedule it to run at specific intervals using triggers. Triggers can be set up to automate the movement of files between the storage accounts.
        In the Data Factory Authoring UI, go to the "Trigger" section.
        Create a new trigger and define its properties, such as the start time, recurrence, and execution settings.
        Associate your pipeline with the trigger.

Please accept the answer if this helped

Vieira Dias, Leonor 20 Reputation points

2023-07-24T11:00:45.0733333+00:00

Hello, thank you for your suggestion and help. When I try to create a linked device it will not allow it and display the following error:

Error Message: The remote server returned an error: (403) Forbidden. (ErrorCode: 403, Detail: This request is not authorized to perform this operation.

If you know how to overcome this problem I would be very grateful.
Vieira Dias, Leonor 20 Reputation points

2023-08-15T10:00:06.2933333+00:00

Thank you for your reply!

For someone wanting to do the same, make sure to give permission to the ADF to access the storage accounts you need (under manage identities, "add roles" and give Reader + Contributor). This way the linked services will be able to connect.

Answer 2

Hi Vieira Dias, Leonor,

Thank you for posting query in Microsoft Q&A Platform.

You can achieve this using Copy activity in Azure Data Factory. In copy activity under source tab use dataset which is pointing to files from container A1 in storage account A2 in resource group A3. And under sink tab use dataset pointing to location in container B1 in storage account B2 in resource group B3

Please consider checking below video to understand about copy activity.

Copy Data Activity in Azure Data Factory

If you need want to consider dynamically pointing to files and locations, then you can consider parameterizing datasets and linked services.

Parameterize Datasets in Azure Data Factory

Parameterize Linked Services in Azure Data Factory

Hope this helps. Please let me know if any further queries.

Please consider hitting Accept Answer button. Accepted answers help community as well.

ShaikMaheer-MSFT 38,546 Reputation points Microsoft Employee Moderator

2023-07-24T08:54:30.0433333+00:00

Hi Vieira Dias, Leonor, Just checking if above answer helps. If yes, please consider hitting Accept Answer button. Accepted answers help community as well. Please let me know if any further queries. Thank you.

Answer 3

Nandan Hegde 36,146 MVP Volunteer Moderator

Hey,

while creating the linked server to Azure blob storage in ADF,

please use the below configurations:

Access key would work for both Storage accounts irrespective of the tenant in which the ADF and the Storage account are present.

User's image

To get the Access key , login within storage account and navigate to Access keys :

User's image

Then use copy activity to copy data from 1 location to another

Just for reference :

https://blog.devgenius.io/azure-copy-data-activity-%EF%B8%8F-from-one-blob-container-to-another-96586916efc8

Share via

How to move files from one resource group to another using datafactory

2 additional answers

Your answer