Welcome to Hadoop!
What Is Hadoop?
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
- Hadoop Core, our flagship sub-project, provides a distributed filesystem (HDFS) and support for the MapReduce distributed computing metaphor.
- HBase builds on Hadoop Core to provide a scalable, distributed database.
- ZooKeeper is a highly available and reliable coordination system. Distributed applications use ZooKeeper to store and mediate updates for critical shared state.
Who uses Hadoop?
A wide variety of companies and organizations use Hadoop for both research and production. Users are encouraged to add themselves to the Hadoop users wiki page.
News
2 July 2008 - Hadoop Wins Terabyte Sort Benchmark
Hadoop Wins Terabyte Sort Benchmark: One of Yahoo's Hadoop clusters sorted 1 terabyte of data in 209 seconds, which beat the previous record of 297 seconds in the annual general purpose (daytona) terabyte sort benchmark. This is the first time that either a Java or an open source program has won.
23 January 2008 - Hadoop at ApacheCon Europe
Hadoop will be well represented
at ApacheCon
Europe in Amsterdam this year. Please join us at
one or more of the following sessions:
- April 10: Hadoop BOF
- April 11: A tour of Apache Hadoop by Tom White
- April 11: Programming with Hadoop's MapReduce by Owen O'Malley
- April 11: Deploying Grid Services using Apache Hadoop by Allen Wittenauer
23 January 2008 - Hadoop promoted to Top Level Project
The Lucene PMC and the ASF Board have voted to promote Hadoop to be a top level project. Hadoop can now be found at http://hadoop.apache.org/



