Carl's Blog
Carl Nolan's ramblings on development and data processing
Pushing Hadoop Cluster Configuration Changes using PowerShell
In my previous post I talked about Implementing and Deploying Rack Awareness using PowerShell....
Date: 04/02/2014
Deploying Hadoop Rack Awareness with PowerShell
In a previous post I talked about Implementing Hadoop Rack Awareness with PowerShell. One thing I...
Date: 03/19/2014
Implementing Hadoop Rack Awareness with PowerShell
This post walks-through building a PowerShell script for enabling Rack Awareness in Hadoop. While...
Date: 03/14/2014
Managing Your HDInsight Cluster using PowerShell – Update
Since writing my last post Managing Your HDInsight Cluster and .Net Job Submissions using...
Date: 12/16/2013
Managing Your HDInsight Cluster and .Net Job Submissions using PowerShell
This post explains how best to manage an HDInsight cluster using a management console and Windows...
Date: 12/02/2013
Implementing LOB Storage in Memory Optimized Tables
Memory optimized tables do not have off-row or large object (LOB) storage, and the row size is...
Date: 09/16/2013
Managing Your HDInsight Cluster with PowerShell
An updated version of this post can be found here. This blog post provides a mechanism for managing...
Date: 06/07/2013
Hadoop .Net HDFS File Access
Provided with the Microsoft Distribution of Hadoop, HDInsight, is a C library for HDFS file access....
Date: 02/08/2013
Submitting Hadoop MapReduce Jobs using PowerShell
As always here is a link to the “Generics based Framework for .Net Hadoop MapReduce Job Submission”...
Date: 02/05/2013
Try F# 3.0 Released
From the F# Team Blog: If you're familiar with the site Try F#, which lets you run F# right in your...
Date: 01/23/2013
Hive and XML File Processing
When I put together the “Generics based Framework for .Net Hadoop MapReduce Job Submission” code one...
Date: 12/13/2012
Co-occurrence Approach to an Item Based Recommender Update
In a previous post I talked about a Co-occurrence Approach to an Item Based Recommender, that...
Date: 12/05/2012
Implementing a MapReduce Join with Hadoop and the .Net Framework
I have often been asked how does one implement a Join whilst writing MapReduce code. As such, I...
Date: 11/13/2012
Framework for .Net Hadoop MapReduce Job Submission V1.0 Release
It has been a few months since I have made a change to the “Generics based Framework for .Net...
Date: 10/10/2012
Execution Time Based Heuristic Custom Task Scheduler
If you follow the samples for Parallel Programming with the .Net Framework, you may have come across...
Date: 10/02/2012
Framework for .Net Hadoop MapReduce Job Submission TextOutput Type
Some recent changes made to the “Generics based Framework for .Net Hadoop MapReduce Job Submission”...
Date: 07/10/2012
C# MapReduce Based Co-occurrence Item Based Recommender
As promised, to conclude the Co-occurrence Approach to an Item Based Recommender posts I wanted to...
Date: 07/09/2012
MapReduce Based Co-occurrence Approach to an Item Based Recommender
In a previous post I covered the basics for a Co-occurrence Approach to an Item Based Recommender....
Date: 07/07/2012
Framework for .Net Hadoop MapReduce Job Submission Json Serialization
A while back one of the changes made to the “Generics based Framework for .Net Hadoop...
Date: 07/01/2012
Framework for .Net Hadoop MapReduce Job Submission configuration update
To better support configuring the Stream environment whilst running .Net Streaming jobs I have made...
Date: 06/25/2012
Co-occurrence Approach to an Item Based Recommender
For a while I thought I would tackle the problem of creating an item-based recommender. Firstly I...
Date: 06/23/2012
Framework for .Net Hadoop MapReduce Job Submission Binary Output
To end the week I decided to make a minor change to the “Generics based Framework for .Net...
Date: 06/15/2012
Framework for .Net Hadoop MapReduce Job Submission libjars update
If you have been using the “Generics based Framework for .Net Hadoop MapReduce Job...
Date: 06/11/2012
Hadoop .Net HDFS File Access (Revisited Archived)
Updated post can be found here:...
Date: 06/02/2012
.Net Implementation of a Priority Queue (aka Heap)
I thought I would take a break for a while from Hadoop and put together an F# .Net implementation of...
Date: 05/03/2012
Generics based Framework for .Net Hadoop MapReduce Job Submission
Over the past month I have been working on a framework to allow composition and submission of...
Date: 04/29/2012
.Net Hadoop MapReduce Job Framework - Revisited (Archived)
An updated version of this post can be found at:...
Date: 04/25/2012
Framework for Composing and Submitting .Net Hadoop MapReduce Jobs (Archived)
An updated version of this post can be found at:...
Date: 04/10/2012
Hadoop .Net HDFS File Access (Archived)
Updated post can be found here:...
Date: 02/29/2012
Hadoop Streaming in F# and MapReduce (summary)
With all my recent posts around Hadoop Streaming I thought it would be useful to summarize them into...
Date: 02/22/2012
FSharpChart for Valentines Day
As I have not posted anything about FSharpChart for a while I thought I would do a quick post, one...
Date: 02/14/2012
Hadoop XML Streaming and F# MapReduce
So, to round out the Hadoop Streaming samples I thought I would put together an XML Streaming...
Date: 01/21/2012
Hadoop Streaming and Windows Azure Blob Storage
One of the cool features of the Microsoft Distribution of Hadoop (MDH) is the native support for...
Date: 01/07/2012
Hadoop Streaming and Reporting
If like me you are a .Net developer and have written some Streaming jobs it is not immediately...
Date: 01/06/2012
A lazy evaluation of F# Seq.groupBy for sorted sequences
In doing some recent work with Hadoop I needed to process a sequence which was grouped by a...
Date: 01/02/2012
Hadoop Binary Streaming and PDF File Inclusion
In a previous post I talked about Hadoop Binary Streaming for the processing of Microsoft Office...
Date: 01/01/2012
Hadoop Binary Streaming and F# MapReduce
As mentioned in my previous post Hadoop Streaming not only supports text streaming, but it also...
Date: 12/30/2011
MapReduce Tester: A Quick Word
In my previous post I talked a little about testing the Hadoop Streaming F# MapReduce code; but it...
Date: 12/29/2011
Using Hadoop on Azure JS Console for Data Visualizations
In my last past I demonstrated a F# MapReduce program based on Hadoop Streaming. One thing that...
Date: 12/18/2011
Hadoop Streaming and F# MapReduce
And now for something completely different. As you may know Microsoft has recently announced plans...
Date: 12/16/2011
F# Project and Item Templates (revised)
Over the past few months I have posted up a few Project and Item Templates for F#. Here is a quick...
Date: 12/11/2011
F# WPF Project Template
This template is for a basic F# Windows Presentation Foundation (WPF) project. This template can be...
Date: 12/11/2011
SQL Server Query Performance Analysis using DMVs
Continuing the TSQL theme I thought it would be worthwhile sharing some TSQL scripts that I have...
Date: 11/23/2011
Adventures in TSQL: Using CTEs when performing DELETE/UPDATE operations
It seems in quite a few applications of late there has been the requirement to batch process, in a...
Date: 11/15/2011
Adventures in TSQL: Adding date and time values
With the addition of the SQL Server date and time types, I have often found myself needing to create...
Date: 11/15/2011
Adventures in TSQL: Comma separated string from column values
It seems that several times now I have had the requirement to create a comma separated string from...
Date: 11/15/2011
Creating a Partitioned View in the BAM Archiving Database
When you run the BAM data maintenance package (BAM_DM_<activity name>) BAM copies each...
Date: 11/15/2011
F# Dynamic Lookup Operator and Regex
In a recent post I talked about using String.forall for string matching over the use of Regex:...
Date: 11/11/2011
F# Parallel Execution and Future Tasks
In a previous post I talked about some options around performing parallel executions by providing...
Date: 11/07/2011