Apache Hadoop Changelog

Release 0.2.0 - 2006-05-05

NEW FEATURES:

JIRA Summary Priority Component Reporter Contributor
HADOOP-51 per-file replication counts Major . Doug Cutting Konstantin Shvachko
HADOOP-148 add a failure count to task trackers Major . Owen O’Malley Owen O’Malley
HADOOP-132 An API for reporting performance metrics Major . David Bowen
HADOOP-189 Add job jar lib, classes, etc. to CLASSPATH when in standalone mode Major . stack Doug Cutting
HADOOP-65 add a record I/O framework to hadoop Minor io, ipc Sameer Paranjpye
HADOOP-191 add hadoopStreaming to src/contrib Major . Michel Tourn Doug Cutting

IMPROVEMENTS:

JIRA Summary Priority Component Reporter Contributor
HADOOP-116 cleaning up /tmp/hadoop/mapred/system Major . raghavendra prabhu Doug Cutting
HADOOP-131 Separate start/stop-dfs.sh and start/stop-mapred.sh scripts Minor . Chris A. Mattmann Doug Cutting
HADOOP-129 FileSystem should not name files with java.io.File Major fs Doug Cutting Doug Cutting
HADOOP-144 the dfs client id isn’t relatable to the map/reduce task ids Major . Owen O’Malley Owen O’Malley
HADOOP-114 Non-informative error message Trivial . Rod Taylor Doug Cutting
HADOOP-142 failed tasks should be rescheduled on different hosts after other jobs Major . Owen O’Malley Owen O’Malley
HADOOP-138 stop all tasks Trivial . Stefan Groschupf Doug Cutting
HADOOP-150 tip and task names should reflect the job name Major . Owen O’Malley Owen O’Malley
HADOOP-166 IPC is unable to invoke methods that use interfaces as parameter Minor ipc Stefan Groschupf Doug Cutting
HADOOP-170 setReplication and related bug fixes Major fs Konstantin Shvachko Konstantin Shvachko
HADOOP-173 optimize allocation of tasks w/ local data Major . Doug Cutting Doug Cutting
HADOOP-167 reducing the number of Configuration & JobConf objects created Major conf Owen O’Malley Owen O’Malley
HADOOP-177 improvement to browse through the map/reduce tasks Minor . Mahadev konar Mahadev konar
HADOOP-178 piggyback block work requests to heartbeats and move block replication/deletion startup delay from datanodes to namenode Major . Hairong Kuang Hairong Kuang
HADOOP-198 adding owen’s examples to exampledriver Minor . Mahadev konar Mahadev konar
HADOOP-96 name server should log decisions that affect data: block creation, removal, replication Critical . Yoram Arnon Hairong Kuang

BUG FIXES:

JIRA Summary Priority Component Reporter Contributor
HADOOP-117 mapred temporary files not deleted Blocker . raghavendra prabhu Doug Cutting
HADOOP-125 LocalFileSystem.makeAbsolute bug on Windows Minor fs p sutter Doug Cutting
HADOOP-126 “hadoop dfs -cp” does not copy crc files Major . Konstantin Shvachko Konstantin Shvachko
HADOOP-92 Error Reporting/logging in MapReduce Minor . Mahadev konar Mahadev konar
HADOOP-128 Failure to replicate dfs block kills client Major . Owen O’Malley Owen O’Malley
HADOOP-118 Namenode does not always clean up pendingCreates Critical . Owen O’Malley Owen O’Malley
HADOOP-139 Deadlock in LocalFileSystem lock/release Major fs Igor Bolotin Doug Cutting
HADOOP-134 JobTracker trapped in a loop if it fails to localize a task Major . Runping Qi Owen O’Malley
HADOOP-133 the TaskTracker.Child.ping thread calls exit Major . Owen O’Malley Owen O’Malley
HADOOP-143 exception call stacks are word wrapped in webapp Major . Owen O’Malley Owen O’Malley
HADOOP-151 RPC code has socket leak? Major ipc p sutter Doug Cutting
HADOOP-69 Unchecked lookup value causes NPE in FSNamesystemgetDatanodeHints Major . Bryan Pendleton
HADOOP-68 “Cannot abandon block during write to <file>” and “Cannot obtain additional block for file <file>” errors during dfs write test Major . Owen O’Malley Owen O’Malley
HADOOP-157 job fails because pendingCreates is not cleaned up after a task fails Major . Owen O’Malley Owen O’Malley
HADOOP-162 concurrent modification exception in FSNamesystem.Lease.releaseLocks Major . Owen O’Malley Owen O’Malley
HADOOP-160 sleeping with locks held Major . Owen O’Malley Owen O’Malley
HADOOP-168 JobSubmissionProtocol and InterTrackerProtocol don’t include “throws IOException” on all methods Minor . Owen O’Malley Owen O’Malley
HADOOP-169 a single failure from locateMapOutputs kills the entire job Critical . Owen O’Malley Owen O’Malley
HADOOP-172 rpc doesn’t handle returning null for a String[] Blocker ipc Owen O’Malley Owen O’Malley
HADOOP-174 jobclient kills job for one timeout Major . Owen O’Malley Owen O’Malley
HADOOP-183 adjust file replication factor when loading image and edits according to replication.min and replication.max Minor . Hairong Kuang Hairong Kuang
HADOOP-137 Different TaskTrackers may get the same task tracker id, thus cause many problems. Critical . Runping Qi Owen O’Malley
HADOOP-186 communication problems in the task tracker cause long latency Major . Owen O’Malley Owen O’Malley
HADOOP-188 more unprotected RPC calls in JobClient.runJob allow loss of job due to timeout Major . Owen O’Malley Owen O’Malley
HADOOP-185 tasks are lost during pollForNewTask Critical . Owen O’Malley Owen O’Malley
HADOOP-190 Job fails though task succeeded if we fail to exit Major . stack
HADOOP-192 Trivial JRE 1.5 versus 1.4 bug Blocker . David Bowen
HADOOP-182 lost task trackers should not update status of completed jobs Major . Owen O’Malley Owen O’Malley
HADOOP-63 problem with webapp when start a jobtracker Minor . Hairong Kuang Hairong Kuang

TESTS:

JIRA Summary Priority Component Reporter Contributor
HADOOP-187 simple distributed dfs random data writer & sort example applications Major . Owen O’Malley Owen O’Malley
HADOOP-184 hadoop nightly build and regression test on a cluster Minor . Mahadev konar Mahadev konar
HADOOP-193 DFS i/o benchmark. Major fs Konstantin Shvachko Konstantin Shvachko
HADOOP-194 Distributed checkup of the file system consistency. Major . Konstantin Shvachko Konstantin Shvachko