Avkash Chauhan's Blog
Windows Azure, Windows 8, Cloud Computing, Big Data and Hadoop: All together at one place.. One problem, One solution at One time...
Windows Azure CDN and Referrer Header
The Windows Azure Azure CDN, like any other CDNs, attempts to be a transparent caching layer. The...
Author: Avkash Chauhan - MSFT Date: 02/27/2012
What is ignoreRoleInstanceStatus setting in Windows Azure?
ignoreRoleInstanceStatus is described in WebRole and WorkerRole Schema as below...
Author: Avkash Chauhan - MSFT Date: 02/23/2012
Windows Azure Blob Upload Scenarios
Windows Azure Blob storage API provided following upload scenarios to upload a blob: Scenario [1]:...
Author: Avkash Chauhan - MSFT Date: 02/21/2012
Keys to understand relationship between MapReduce and HDFS
Map Task (HDFS data localization): The unit of input for a map task is an HDFS data block of the...
Author: Avkash Chauhan - MSFT Date: 02/15/2012
Solving SSL Certificate expiration problem with an existing Windows Azure Application
Recently I was working on an issue where the SSL certificate was expired and due to it, the user...
Author: Avkash Chauhan - MSFT Date: 02/14/2012
Hadoop Performance: How storage disk types in individual node will impact the job performance?
As you may have already know that Hadoop Cluster is network and disk, IO intensive. Recently I was...
Author: Avkash Chauhan - MSFT Date: 02/14/2012
How many copies of your blob is stored in Windows Azure Blob Storage?
I was recently asked about if someone store their content at Windows Azure Storage, how secure is it...
Author: Avkash Chauhan - MSFT Date: 02/09/2012
Internals of Hadoop Pig Operators as MapReduce Job
I was recently asked to show that Pig scripts are actually MapReduce jobs so to explain it in very...
Author: Avkash Chauhan - MSFT Date: 02/08/2012
Which one to choose between Pig and Hive?
Technically they both will do the job, you are looking from "either hive or Pig" perspective, means...
Author: Avkash Chauhan - MSFT Date: 02/07/2012
Handling two known issues with Windows Azure node.js SDK 0.5.2
My recent development work with Windows Azure Node.js SDK 0.5.2 helped me to found two issues. Until...
Author: Avkash Chauhan - MSFT Date: 01/31/2012
NougakuDoCompanion: A “Ruby on Rails” companion for Windows Azure
I was recently informed about a great “Ruby on Rails” companion package to run your Ruby...
Author: Avkash Chauhan - MSFT Date: 01/31/2012
Customizing your Hadoop cluster running on your own Windows Azure Subscription
In this article we will learn how to configure the same Hadoop cluster to do some customization. To...
Author: Avkash Chauhan - MSFT Date: 01/29/2012
Creating your own Hadoop cluster on Windows Azure by using your own Windows Azure Subscription account
[As of now this functionality is not available with Hadoop on Windows Azure. These instruction are...
Author: Avkash Chauhan - MSFT Date: 01/28/2012
Windows Azure Troubleshooting - Taking specific Windows Azure Instance offline
When you have lots of instances running on Windows Azure, you may need to investigate issues on a...
Author: Avkash Chauhan - MSFT Date: 01/27/2012
Setting Amazon S3 Storage as data source (s3n://) in Hadoop on Azure (hadooponazure.com) portal
To get your Amazon S3 account setup with Apache Hadoop cluster on Windows Azure you just need you...
Author: Avkash Chauhan - MSFT Date: 01/27/2012
Real World Windows Azure Guidance: Troubleshooting Best Practices for Developing Windows Azure Applications
One of my teammate William Bellamy has written a great article “Troubleshooting Best Practices...
Author: Avkash Chauhan - MSFT Date: 01/26/2012
Node.js and Windows Azure: Creating a blog application using Node.JS and Windows Azure Table & Blob Storage Part 1
In this example I will create a node.js based blob application which will storage all the blog...
Author: Avkash Chauhan - MSFT Date: 01/25/2012
Understanding Map/Reduce job in Apache Hadoop on Windows Azure (A Reverse Approach)
When you run Map/Reduce job in Hadoop cluster on Windows Azure you will get an aggregated progress...
Author: Avkash Chauhan - MSFT Date: 01/20/2012
Windows Azure application VM and (virtual) IP Address
Time to time, I get involved with our Windows Azure partners to discussion IP address configuration...
Author: Avkash Chauhan - MSFT Date: 01/18/2012
Setting Windows Azure Blob Storage (asv) as data source directly from Portal at Hadoop on Azure
After you log into your Hadoop Portal and configured your cluster, you can select “Manage...
Author: Avkash Chauhan - MSFT Date: 01/13/2012
Using Symbolic links with local storage to store large amount of data in Windows Azure Application Drive
By default the application drive in Windows Azure VM has maximum size limitation of 1GB with any...
Author: Avkash Chauhan - MSFT Date: 01/13/2012
With Azure SDK 1.6, Azure Diagnostics is enabled by default can cause thousands of daily transections to Azure Storage
In Windows Azure VM diagnostics is enabled by default so basic log can be collected and transferred...
Author: Avkash Chauhan - MSFT Date: 01/11/2012
Running Apache Pig (Pig Latin) at Apache Hadoop on Windows Azure
Microsoft Distribution of Apache Hadoop comes with Pig Support along with an Interactive JavaScript...
Author: Avkash Chauhan - MSFT Date: 01/10/2012
Accessing Windows Azure blob Storage as Network Drive
If you want to access Windows Azure Blob Storage same as a drive on your development machine you...
Author: Avkash Chauhan - MSFT Date: 01/10/2012
Apache Hadoop on Windows Azure: Few tips and tricks to manage your Hadoop cluster in Windows Azure
In Hadoop cluster, namenode communicate with all the other nodes. Apache Hadoop on Windows Azure...
Author: Avkash Chauhan - MSFT Date: 01/06/2012
Apache Hadoop on Windows Azure: Connecting to Windows Azure Storage from Hadoop Cluster
Microsoft distribution to Apache Hadoop comes by direct connectivity to cloud storage i.e. Windows...
Author: Avkash Chauhan - MSFT Date: 01/05/2012
Apache Hadoop on Windows Azure: How Hadoop cluster was setup on Windows Azure
Once your provide following information to setup your Hadoop cluster in Azure: Cluster DNS Name Type...
Author: Avkash Chauhan - MSFT Date: 01/04/2012
Assess your SQL Server to SQL Azure migration using SQL Azure Compatibility Assessment Tool by SQL Azure Labs
SQL Azure team announced today about the release of a new experimental cloud service, "SQL Azure...
Author: Avkash Chauhan - MSFT Date: 01/04/2012
Apache Hadoop on Windows Azure Part 10 - Running a JavaScript Map/Reduce Job from Interactive JavaScript Console
Microsoft distribution of Apache Hadoop on Windows Azure, let you run JavaScript Map/Reduce jobs...
Author: Avkash Chauhan - MSFT Date: 01/03/2012
Apache Hadoop on Windows Azure Part 9 – Using Interactive JavaScript for Data Visualization
Apache Hadoop on Windows Azure is integrated with a web-based interactive JavaScript console, which...
Author: Avkash Chauhan - MSFT Date: 01/02/2012
Apache Hadoop on Windows Azure Part 8 – Hadoop Map/Reduce Administration from command line in Cluster
After you created your Hadoop cluster in Windows Azure, you can remote into it to start the...
Author: Avkash Chauhan - MSFT Date: 01/01/2012
Wish you all a very Happy New Year 2012!!
Dear Readers, Thank you so much for all your support and comments!! You are the inspiration behind...
Author: Avkash Chauhan - MSFT Date: 01/01/2012
Apache Hadoop on Windows Azure Part 7 – Writing your very own WordCount Hadoop Job in Java and deploying to Windows Azure Cluster
In this article, I will help you writing your own WordCount Hadoop Job and then deploy it to Windows...
Author: Avkash Chauhan - MSFT Date: 12/31/2011
Apache Hadoop on Windows Azure Part 6 - Running 10GB Sort Hadoop Job with TeraSort Option and understanding MapReduce Job administration
In this section we will run the same 10GB sorting Hadoop job with TERASORT option. With TeraSort...
Author: Avkash Chauhan - MSFT Date: 12/30/2011
Apache Hadoop on Windows Azure Part 5 - Running 10GB Sort Hadoop Job with Teragen, TeraSort and TeraValidate Options
This example consists of the 3 map/reduce applications that Owen O'Malley and Arun Murthy used win...
Author: Avkash Chauhan - MSFT Date: 12/30/2011
Apache Hadoop on Windows Azure Part 4- Remote Login to Hadoop node for MapReduce Job and HDFS administration
When you are running Apache Hadoop job in Windows Azure, you have ability to remote into the main...
Author: Avkash Chauhan - MSFT Date: 12/29/2011
Apache Hadoop on Windows Azure Part 3 - Creating a Word Count Hadoop Job with a few twists
In this example I am starting a new Hadoop Job with few intentional errors to understand the...
Author: Avkash Chauhan - MSFT Date: 12/29/2011
Apache Hadoop on Windows Azure Part 2 - Creating a Pi Estimator Hadoop Job
Once you have created a cluster in Windows Azure, you will have a few prebuilt samples provided in...
Author: Avkash Chauhan - MSFT Date: 12/29/2011
Apache Hadoop on Windows Azure Part 1- Creating a new Windows Azure Cluster for Hadoop Job
Once you have applied for Apache Hadoop on Windows Azure CTP account you can create a new cluster...
Author: Avkash Chauhan - MSFT Date: 12/28/2011
Top 12 Articles on Cloud Services and Big Data on Windows Azure in December
Windows Azure Cloud Services Newly Designed Windows Azure Developer Center Article Tutorial: Running...
Author: Avkash Chauhan - MSFT Date: 12/28/2011
How to Modify Registry keys in Windows Azure Virtual Machine from a web or worker role?
If you have a requirement and decided to modify VM registry keys, you have two options: Do it from a...
Author: Avkash Chauhan - MSFT Date: 12/23/2011
Windows Azure: Hands on Lab for Moving Applications to the Cloud
Windows Azure team created a detailed hands on lab to help everyone who wants to move their...
Author: Avkash Chauhan - MSFT Date: 12/19/2011
Windows Azure Resource: A Guide to Claims-Based Identity and Access Control, Second Edition - eBook Download
Map of the book: What is this book about: An Introduction to Claims explains what a claim is and...
Author: Avkash Chauhan - MSFT Date: 12/19/2011
Resources to write .Net based MapReduce jobs for Hadoop using F#
What is Hadoop Streaming: Start learning about Hadoop Streaming:...
Author: Avkash Chauhan - MSFT Date: 12/18/2011
Windows Azure Plugin for Eclipse with Java, December 2011 CTP is now available
Windows Azure Plugin for Eclipse with Java, December 2011 CTP is now available for public download...
Author: Avkash Chauhan - MSFT Date: 12/18/2011