What is Apache Hadoop Ozone?

Ozone is a new subproject of Apache Hadoop. It provides an object store semantic for Hadoop.

It uses Hadoop Distributed Data Storage (HDDS) for storage layer. HDDS is another new subporoject of Apache Hadoop.

Scalable

Ozone is designed to scale to tens of billions of files and blocks and, in the future, even more.

Small files or huge number of datanodes are no longer a limitation.

Consistent

Storage Layer uses RAFT protocol for consistentency.

Cloud-native

Hadoop Ozone is designed to work well in containerized environments like YARN and Kubernetes.

Questions and Answers

How can I contribute

The contribution process is the same as for any other Hadoop subproject. The Ozone Contribution Guidelines is available from the wiki.

Does it mean a new Hadoop server component?

Both Ozone and HDDS requires one additional master component - Ozone Manager and Storage Container Manger, respectively. The worker parts of Ozone/HDDS can be started as a HDFS Datanode plugin or standalone.

How can I try it out?

There is a katacoda example which can be used to try out Ozone without download.

You can also download it and try it out