Package org.apache.hadoop.mapreduce.lib.input

Interface Summary
SequenceFileInputFilter.Filter filter interface

Class Summary
CombineFileInputFormat<K,V> An abstract InputFormat that returns CombineFileSplit's in InputFormat.getSplits(JobContext) method.
CombineFileRecordReader<K,V> A generic RecordReader that can hand out different recordReaders for each chunk in a CombineFileSplit.
CombineFileSplit A sub-collection of input files.
DelegatingInputFormat<K,V> An InputFormat that delegates behavior of paths to multiple other InputFormats.
DelegatingMapper<K1,V1,K2,V2> An Mapper that delegates behavior of paths to multiple other mappers.
DelegatingRecordReader<K,V> This is a delegating RecordReader, which delegates the functionality to the underlying record reader in TaggedInputSplit
FileInputFormat<K,V> A base class for file-based InputFormats.
FileSplit A section of an input file.
KeyValueLineRecordReader This class treats a line in the input as a key/value pair separated by a separator character.
KeyValueTextInputFormat An InputFormat for plain text files.
LineRecordReader Treats keys as offset in file and value as line.
MultipleInputs This class supports MapReduce jobs that have multiple input paths with a different InputFormat and Mapper for each path
NLineInputFormat NLineInputFormat which splits N lines of input as one split.
SequenceFileAsBinaryInputFormat InputFormat reading keys, values from SequenceFiles in binary (raw) format.
SequenceFileAsBinaryInputFormat.SequenceFileAsBinaryRecordReader Read records from a SequenceFile as binary (raw) bytes.
SequenceFileAsTextInputFormat This class is similar to SequenceFileInputFormat, except it generates SequenceFileAsTextRecordReader which converts the input keys and values to their String forms by calling toString() method.
SequenceFileAsTextRecordReader This class converts the input keys and values to their String forms by calling toString() method.
SequenceFileInputFilter<K,V> A class that allows a map/red job to work on a sample of sequence files.
SequenceFileInputFilter.FilterBase base class for Filters
SequenceFileInputFilter.MD5Filter This class returns a set of records by examing the MD5 digest of its key against a filtering frequency f.
SequenceFileInputFilter.PercentFilter This class returns a percentage of records The percentage is determined by a filtering frequency f using the criteria record# % f == 0.
SequenceFileInputFilter.RegexFilter Records filter by matching key to regex
SequenceFileInputFormat<K,V> An InputFormat for SequenceFiles.
SequenceFileRecordReader<K,V> An RecordReader for SequenceFiles.
TextInputFormat An InputFormat for plain text files.

Enum Summary

Exception Summary
InvalidInputException This class wraps a list of problems with the input, so that the user can get a list of problems together instead of finding and fixing them one by one.

Copyright © 2009 The Apache Software Foundation