Load data into storage environments for analytics
The Team Data Science Process requires that data be ingested or loaded into the most appropriate way in each stage. Data destinations can include Azure Blob Storage, SQL Azure databases, SQL Server on Azure VM, HDInsight (Hadoop), Azure Synapse Analytics, and Azure Machine Learning.
The following articles describe how to ingest data into various target environments where the data is stored and processed.
- To/From Azure Blob Storage
- To SQL Server on Azure VM
- To Azure SQL Database
- To Hive tables
- To SQL partitioned tables
- From On-premises SQL Server
Technical and business needs, as well as the initial location, format, and size of your data will determine the best data ingestion plan. It is not uncommon for a best plan to have several steps. This sequence of tasks can include, for example, data exploration, pre-processing, cleaning, down-sampling, and model training. Azure Data Factory is a recommended Azure resource to orchestrate data movement and transformation.
Contributors
This article is maintained by Microsoft. It was originally written by the following contributors.
Principal author:
- Mark Tabladillo | Senior Cloud Solution Architect
To see non-public LinkedIn profiles, sign in to LinkedIn.
Feedback
Submit and view feedback for