Apache Hadoop Ozone

Ozone is a scalable, distributed object store for Hadoop. Applications like Apache Spark, Hive and YARN, can run against Ozone without any modifications. Ozone comes with a Java client library, a S3 and a command line interface which makes it easy to use Ozone.

Ozone consists of volumes, buckets, and Keys:

  • Volumes are similar to user accounts. Only administrators can create or delete volumes.
  • Buckets are similar to directories. A bucket can contain any number of keys, but buckets cannot contain other buckets.
  • Keys are similar to files. A bucket can contain any number of keys.