Improve the write and read performance of HBase clusters

Completed

You can improve the read and write performance of HDInsight HBase using accelerated writes. Let's now look at the impact of performance with and without accelerated writes.

Without Accelerated Writes Enabled

Accelerated writes feature in Azure HDInsight attaches a premium SSD-managed disk to every HBase Region Server (worker Node) during cluster deployment. Write ahead logs (WAL) are written to the Hadoop Distributed File System (HDFS) mounted on the premium SSD-Managed disks instead of the Azure Page Blobs.

Premium-Managed disks on SSDs offer far superior write performance and improves cluster I/O performance with fault tolerance.

Accelerated Writes feature within HBase.

Customer requirements for low latency writes and reads are constantly evolving over time. While separation of compute and storage in HDInsight HBase offers agility, scalability, reliability, and economies of scale for customers, it also adds network latency to write and read operations. While this latency might be suitable for most customers, customers who need ultra-low latency writes and reads can take advantage of the HDInsight HBase Accelerates writes feature.

With Accelerated Writes Enabled

Read performance

By default, HDInsight HBase provisions a storage account of type Storage (general purpose v1). In order to improve Read performance, Premium Block Blob Storage can be set instead of Storage (general purpose V1) to improve the performance of Reads.

Create a Premium Block Blob Storage account before the cluster creation process.

Create a storage account.

During cluster creation, assign the Premium Block Blob as the Primary Storage account for the cluster as shown below. Premium Block Blocks will start showing only if the Enable Accelerated Writes is selected.

Mapping the storage account in a HDInsight Cluster.

To learn best practices around HDInsight HBase performance management, go to the Troubleshooting HBase performance issues page.

Write performance

In the standard HDInsight HBase offering, HBase Write Ahead Logs (WAL) are stored in Azure Page Blobs. While this works well for most business requirements, for some mission critical applications that require consistently low latency and high throughout I/O, inconsistent latencies in the WAL logs backed by Page Blobs can become a bottleneck for HBase write performance.

HBase Write Ahead Logs (WAL) stored in Azure Page Blobs.