Case studies: Distributed file systems

Beginner
Developer
Student
Azure

Discover how distributed file systems work, then learn about Hadoop and Ceph.

Learning objectives

In this module, you will:

  • Review the design goals and architectural characteristics of Hadoop distributed file system (HDFS)
  • Review the design goals and architectural characteristics of the Ceph file system (Ceph FS)
  • Compare and contrast HDFS and the Ceph file system

In partnership with Dr. Majd Sakr and Carnegie Mellon University.

Prerequisites

  • Understand what cloud computing is, including cloud service models, and common cloud providers
  • Know the technologies that enable cloud computing
  • Understand how cloud service providers pay for and bill for the cloud
  • Know what datacenters are and why they exist
  • Know how datacenters are set up, powered, and provisioned
  • Understand how cloud resources are provisioned and metered
  • Be familiar with the concept of virtualization
  • Know what the different types of virtualization are
  • Understand CPU virtualization
  • Understand memory virtualization
  • Understand I/O virtualization
  • Know about the different types of data and how they are stored