Apache Hadoop
The Apache™ Hadoop® project develops open-source software for reliable, scalable, distributed computing.
The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models. It is designed to scale up from single servers to thousands of machines, each offering local computation and storage. Rather than rely on hardware to deliver high-availability, the library itself is designed to detect and handle failures at the application layer, so delivering a highly-available service on top of a cluster of computers, each of which may be prone to failures.
This is a release of Apache Hadoop 3.3 line.
Key changes include
Users of Apache Hadoop 3.3.4 and earlier should upgrade to this release.
All users are encouraged to read the overview of major changes since release 3.3.4.
For details of bug fixes, improvements, and other enhancements since the previous 3.3.4 release, please check release notes and changelog.
The ABFS connector has a critical bug fix https://issues.apache.org/jira/browse/HADOOP-18546: ABFS. Disable purging list of in-progress reads in abfs stream close().
All users of the abfs connector in hadoop releases 3.3.2+ MUST either upgrade
to this release or disable prefetching by setting
fs.azure.readaheadqueue.depth
to 0
.
This is a release of Apache Hadoop 3.3 line.
It contains a small number security and critical integration fixes since 3.3.3.
Users of Apache Hadoop 3.3.3 should upgrade to this release.
Users of hadoop 2.x and hadoop 3.2 should also upgrade to the 3.3.x line. As well as feature enhancements, this is the sole branch currently receiving fixes for anything other than critical security/data integrity issues.
Users are encouraged to read the overview of major changes since release 3.3.3. For details of bug fixes, improvements, and other enhancements since the previous 3.3.3 release, please check release notes and changelog.
This is the third stable release of Apache Hadoop 3.2 line.
It contains 153 bug fixes, improvements and enhancements since 3.2.3.
Users are encouraged to read the overview of major changes since 3.2.3. For details of 153 bug fixes, improvements, and other enhancements since the previous 3.2.3 release, please check release notes and changelog.
This is the second stable release of Apache Hadoop 2.10 line.
It contains 211 bug fixes, improvements and enhancements since 2.10.1.
Users are encouraged to read the overview of major changes since 2.10.1. For details of 211 bug fixes, improvements, and other enhancements since the previous 2.10.1 release, please check release notes and changelog.
This is the third stable release of the Apache Hadoop 3.3 line.
It contains 23 bug fixes, improvements and enhancements since 3.3.2.
This is primarily a security update; for this reason, upgrading is strongly advised.
Users are encouraged to read the overview of major changes since 3.3.2. For details of bug fixes, improvements, and other enhancements since the previous 3.3.2 release, please check release notes and changelog.
The project includes these modules:
A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop PoweredBy wiki page.
Other Hadoop-related projects at Apache include: