You can use SHIR in linked service rather than Azure IR
https://learn.microsoft.com/en-us/azure/data-factory/concepts-integration-runtime
This browser is no longer supported.
Upgrade to Microsoft Edge to take advantage of the latest features, security updates, and technical support.
Scenario:
Azure and Google Cloud Platform (GCP) are connected through a VPN, and there is a requirement to copy data from Google Cloud Storage (GCS) to Azure Blob Storage in batch mode. One approach being considered is using Azure Data Factory (ADF) for the data transfer.
Question:
While setting up a new linked service in Azure Data Factory to connect to Google Cloud Storage, the configuration prompts to use the Google Cloud Storage API, which typically routes traffic over the public internet. How can we ensure that ADF copies data from GCS to Azure Blob Storage over the private network (VPN or Interconnect) instead of the public internet? Is there a way to configure ADF to utilize the private connection for this regular data transfer?
You can use SHIR in linked service rather than Azure IR
https://learn.microsoft.com/en-us/azure/data-factory/concepts-integration-runtime
Hi @Nikunj Patel
Thanks for the question and using MS Q&A platform.
The self-hosted IR (Integration Runtime) does not need to be deployed in either GCP or Azure. It can be deployed on a machine in your on-premises network that has a private connection (VPN or ExpressRoute) to Azure.
The self-hosted IR acts as a bridge between Azure Data Factory and your on-premises data sources. It securely transfers data between your on-premises storage and Azure storage without going over the public internet.
To connect the self-hosted IR to Google Cloud Storage, you configure a linked service in Azure Data Factory to connect to Google Cloud Storage. Then, specify the self-hosted IR as the integration runtime for this linked service. When the data transfer activity is triggered in Azure Data Factory, it runs on the self-hosted IR in your on-premises network. Finally, the self-hosted IR uses the private connection (VPN or ExpressRoute) to connect to Google Cloud Storage and transfer the data.
For more details refer to this: https://learn.microsoft.com/en-us/azure/data-factory/connector-google-cloud-storage?tabs=data-factory#copy-data-from-google-cloud-storage-using-azure-data-factory-or-synapse-analytics
Hope this helps. Do let us know if you have any further queries.