Paige's blog
SSH tunnel to endpoints in Azure VNet from Windows
When you deploy virtual machines on Azure, a good practice is to set up Azure Network Security...
Date: 01/17/2017
Backup and Restore Cassandra on Azure
When you run Cassandra on virtual machines on Azure, one way to back up and restore data is to rsync...
Date: 10/07/2016
Backup Cloudera data to Azure Storage
Azure Blob Storage supports an HDFS interface which can be accessed by HDFS clients using the syntax...
Date: 06/19/2016
Run Jupyter Notebook on Cloudera
In a previous blog, we demonstrated how to enable Hue Spark notebook with Livy on CDH. Here we will...
Date: 06/19/2016
Enable Kerberos on Cloudera with Azure AD Domain Service
In this previous blog series we documented how to integrate Active Directory deployed in virtual...
Date: 06/18/2016
Run Hue Spark Notebook on Cloudera
When you deploy a CDH cluster using Cloudera Manager, you can use Hue web UI to run, for example,...
Date: 06/18/2016
Integrating Cloudera cluster with Active Directory (Part 3/3)
In Part 1 and Part 2 of this blog, we covered the first 5 steps, here we will describe the remaining...
Date: 01/02/2016
Integrating Cloudera cluster with Active Directory (Part 2/3)
In Part 1 of this blog, we covered the first 4 steps, here we will describe how to join the LInux...
Date: 01/02/2016
Integrating Cloudera cluster with Active Directory (Part 1/3)
[Update 8/2017: With Cloudera Director support on Azure, you can now automate this whole process of...
Date: 01/02/2016
Real Time Analytics with Azure Event Hubs, Cloudera, and Azure SQL
In this blog post, I will demonstrate how to ingest data from Azure Event Hubs to Spark Streaming...
Date: 11/18/2015
Connect Cloudera to Azure ML Hive Reader
Azure Machine Learning supports Hive as a data source using WebHCat API. In this post, I will show...
Date: 07/03/2015