org.apache.hadoop.mapred
Class MultiFileSplit
java.lang.Object
org.apache.hadoop.mapreduce.InputSplit
org.apache.hadoop.mapreduce.lib.input.CombineFileSplit
org.apache.hadoop.mapred.lib.CombineFileSplit
org.apache.hadoop.mapred.MultiFileSplit
- All Implemented Interfaces:
- Writable, InputSplit
@InterfaceAudience.Public
@InterfaceStability.Stable
public class MultiFileSplit
- extends CombineFileSplit
A sub-collection of input files. Unlike FileSplit
, MultiFileSplit
class does not represent a split of a file, but a split of input files
into smaller sets. The atomic unit of split is a file.
MultiFileSplit can be used to implement RecordReader
's, with
reading one record per file.
- See Also:
FileSplit
,
MultiFileInputFormat
MultiFileSplit
public MultiFileSplit(JobConf job,
Path[] files,
long[] lengths)
getLocations
public String[] getLocations()
throws IOException
- Description copied from class:
CombineFileSplit
- Returns all the Paths where this input-split resides
- Specified by:
getLocations
in interface InputSplit
- Overrides:
getLocations
in class CombineFileSplit
- Returns:
- a new array of the node nodes.
- Throws:
IOException
toString
public String toString()
- Overrides:
toString
in class CombineFileSplit
Copyright © 2014 Apache Software Foundation. All Rights Reserved.