Skip navigation links

Package org.apache.hadoop.hdfs

A distributed implementation of FileSystem.

See: Description

Package org.apache.hadoop.hdfs Description

A distributed implementation of FileSystem. This is loosely modelled after Google's GFS.

The most important difference is that unlike GFS, Hadoop DFS files have strictly one writer at any one time. Bytes are always appended to the end of the writer's stream. There is no notion of "record appends" or "mutations" that are then checked or reordered. Writers simply emit a byte stream. That byte stream is guaranteed to be stored in the order written.

Skip navigation links

Copyright © 2023 Apache Software Foundation. All rights reserved.