Episode

StreamSets on Azure HDInsight

Azure HDInsight is a fully-managed cloud service that makes it easy, fast, and cost-effective to process massive amounts of data. Use the most popular open-source frameworks such as Hadoop, Spark, Hive, LLAP, Kafka, Storm, R & more. Azure HDInsight enables a broad range of scenarios such as ETL, Data Warehousing, Machine Learning, IoT and more.   

StreamSets Data Collector deploys on top of Azure HDInsight application. It provides a full-featured integrated development environment (IDE) that lets you design, test, deploy, and manage any-to-any ingest pipelines that mesh stream and batch data, and include a variety of in-stream transformations - all without having to write custom code. In this video we will learn on how you can install StreamSets, ingest data from multiple sources and monitor your data pipelines

Azure HDInsight application platform: Install solutions built for the Apache Hadoop ecosystem

Install custom HDInsight applications

Try SDC from StreamSets on HDInsight

Streamsets website