|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.hadoop.io.SequenceFile.Sorter
public static class SequenceFile.Sorter
Sorts key/value pairs in a sequence-format file.
For best performance, applications should make sure that the Writable.readFields(DataInput)
implementation of their keys is
very efficient. In particular, it should avoid allocating memory.
Nested Class Summary | |
---|---|
static interface |
SequenceFile.Sorter.RawKeyValueIterator
The interface to iterate over raw keys/values of SequenceFiles. |
class |
SequenceFile.Sorter.SegmentDescriptor
This class defines a merge segment. |
Constructor Summary | |
---|---|
SequenceFile.Sorter(FileSystem fs,
Class<? extends WritableComparable> keyClass,
Class valClass,
Configuration conf)
Sort and merge files containing the named classes. |
|
SequenceFile.Sorter(FileSystem fs,
RawComparator comparator,
Class keyClass,
Class valClass,
Configuration conf)
Sort and merge using an arbitrary RawComparator . |
Method Summary | |
---|---|
SequenceFile.Writer |
cloneFileAttributes(Path inputFile,
Path outputFile,
Progressable prog)
Clones the attributes (like compression of the input file and creates a corresponding Writer |
int |
getFactor()
Get the number of streams to merge at once. |
int |
getMemory()
Get the total amount of buffer memory, in bytes. |
SequenceFile.Sorter.RawKeyValueIterator |
merge(List<SequenceFile.Sorter.SegmentDescriptor> segments,
Path tmpDir)
Merges the list of segments of type SegmentDescriptor |
SequenceFile.Sorter.RawKeyValueIterator |
merge(Path[] inNames,
boolean deleteInputs,
int factor,
Path tmpDir)
Merges the contents of files passed in Path[] |
SequenceFile.Sorter.RawKeyValueIterator |
merge(Path[] inNames,
boolean deleteInputs,
Path tmpDir)
Merges the contents of files passed in Path[] using a max factor value that is already set |
void |
merge(Path[] inFiles,
Path outFile)
Merge the provided files. |
SequenceFile.Sorter.RawKeyValueIterator |
merge(Path[] inNames,
Path tempDir,
boolean deleteInputs)
Merges the contents of files passed in Path[] |
void |
setFactor(int factor)
Set the number of streams to merge at once. |
void |
setMemory(int memory)
Set the total amount of buffer memory, in bytes. |
void |
setProgressable(Progressable progressable)
Set the progressable object in order to report progress. |
void |
sort(Path[] inFiles,
Path outFile,
boolean deleteInput)
Perform a file sort from a set of input files into an output file. |
void |
sort(Path inFile,
Path outFile)
The backwards compatible interface to sort. |
SequenceFile.Sorter.RawKeyValueIterator |
sortAndIterate(Path[] inFiles,
Path tempDir,
boolean deleteInput)
Perform a file sort from a set of input files and return an iterator. |
void |
writeFile(SequenceFile.Sorter.RawKeyValueIterator records,
SequenceFile.Writer writer)
Writes records from RawKeyValueIterator into a file represented by the passed writer |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public SequenceFile.Sorter(FileSystem fs, Class<? extends WritableComparable> keyClass, Class valClass, Configuration conf)
public SequenceFile.Sorter(FileSystem fs, RawComparator comparator, Class keyClass, Class valClass, Configuration conf)
RawComparator
.
Method Detail |
---|
public void setFactor(int factor)
public int getFactor()
public void setMemory(int memory)
public int getMemory()
public void setProgressable(Progressable progressable)
public void sort(Path[] inFiles, Path outFile, boolean deleteInput) throws IOException
inFiles
- the files to be sortedoutFile
- the sorted output filedeleteInput
- should the input files be deleted as they are read?
IOException
public SequenceFile.Sorter.RawKeyValueIterator sortAndIterate(Path[] inFiles, Path tempDir, boolean deleteInput) throws IOException
inFiles
- the files to be sortedtempDir
- the directory where temp files are created during sortdeleteInput
- should the input files be deleted as they are read?
IOException
public void sort(Path inFile, Path outFile) throws IOException
inFile
- the input file to sortoutFile
- the sorted output file
IOException
public SequenceFile.Sorter.RawKeyValueIterator merge(List<SequenceFile.Sorter.SegmentDescriptor> segments, Path tmpDir) throws IOException
SegmentDescriptor
segments
- the list of SegmentDescriptorstmpDir
- the directory to write temporary files into
IOException
public SequenceFile.Sorter.RawKeyValueIterator merge(Path[] inNames, boolean deleteInputs, Path tmpDir) throws IOException
inNames
- the array of path namesdeleteInputs
- true if the input files should be deleted when
unnecessarytmpDir
- the directory to write temporary files into
IOException
public SequenceFile.Sorter.RawKeyValueIterator merge(Path[] inNames, boolean deleteInputs, int factor, Path tmpDir) throws IOException
inNames
- the array of path namesdeleteInputs
- true if the input files should be deleted when
unnecessaryfactor
- the factor that will be used as the maximum merge fan-intmpDir
- the directory to write temporary files into
IOException
public SequenceFile.Sorter.RawKeyValueIterator merge(Path[] inNames, Path tempDir, boolean deleteInputs) throws IOException
inNames
- the array of path namestempDir
- the directory for creating temp files during mergedeleteInputs
- true if the input files should be deleted when
unnecessary
IOException
public SequenceFile.Writer cloneFileAttributes(Path inputFile, Path outputFile, Progressable prog) throws IOException
inputFile
- the path of the input file whose attributes should be
clonedoutputFile
- the path of the output fileprog
- the Progressable to report status during the file write
IOException
public void writeFile(SequenceFile.Sorter.RawKeyValueIterator records, SequenceFile.Writer writer) throws IOException
records
- the RawKeyValueIteratorwriter
- the Writer created earlier
IOException
public void merge(Path[] inFiles, Path outFile) throws IOException
inFiles
- the array of input path namesoutFile
- the final output file
IOException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |