Apache Hadoop 0.23.3 Release Notes

These release notes cover new developer and user-facing incompatibilities, important issues, features, and major improvements.


distcp skips CRC on 0 byte files.


FsShell’s “mkdir” no longer implicitly creates all non-existent parent directories. The command adopts the posix compliant behavior of requiring the “-p” flag to auto-create parent directories.


Resolve sporadic distcp issue due to having two DistCp classes (v1 & v2) in the classpath.


WARNING: No release note provided for this incompatible change.


WARNING: No release note provided for this incompatible change.


-Djava.library.path in mapred.child.java.opts can cause issues with native libraries. LD_LIBRARY_PATH through mapred.child.env should be used instead.


The Job Summary log may contain commas in values that are escaped by a ’' character. This was true before, but is more likely to be exposed now.


ContainerTokens now have an expiry interval so that stale tokens cannot be used for launching containers.


Removes two sets of previously available config properties:

  1. ( yarn.scheduler.fifo.minimum-allocation-mb and yarn.scheduler.fifo.maximum-allocation-mb ) and,
  2. ( yarn.scheduler.capacity.minimum-allocation-mb and yarn.scheduler.capacity.maximum-allocation-mb )

In favor of two new, generically named properties:

  1. yarn.scheduler.minimum-allocation-mb - This acts as the floor value of memory resource requests for containers.
  2. yarn.scheduler.maximum-allocation-mb - This acts as the ceiling value of memory resource requests for containers.

Both these properties need to be set at the ResourceManager (RM) to take effect, as the RM is where the scheduler resides.

Also changes the default minimum and maximums to 128 MB and 10 GB respectively.


Note that to apply this you should first run the script - ./MAPREDUCE-3543v3.sh svn, then apply the patch.

If this is merged to more then trunk, the version inside of hadoop-tools/hadoop-gridmix/pom.xml will need to be udpated accordingly.


Fixed a bug in MR client to redirect to JobHistoryServer correctly when RM forgets the app.