Apache Hadoop 2.10.0
Apache Hadoop 2.10.0 is a minor release in the 2.x.y release line, building upon the previous stable release 2.9.2.
Here is a short overview of the major features and improvements.
-
HDFS
- Consistent reads from standby node. This allows better load balancing on HDFS clusters, especially those with a high read/write operation ratio. See HDFS-12943 and HDFS-14204 for more details.
- Improvements related to rolling upgrade support from 2.x to 3.x.
- NameNode port based selective encryption. This allows enforcing different security constraints for different types of traffic. See HDFS-13541 for more details.
- Cost-based fair call queue. This improves fairness and stability in a large, multi-tenant HDFS cluster. See HDFS-14403 for more details.
-
YARN
- Native support for GPUs as a schedulable resource type. This builds on the above, and was also ported from the 3.x.y release line. See YARN-6223 and YARN-8200 for more information.
Getting Started
The Hadoop documentation includes the information you need to get started using Hadoop. Begin with the Single Node Setup which shows you how to set up a single-node Hadoop installation. Then move on to the Cluster Setup to learn how to set up a multi-node Hadoop installation.