org.apache.hadoop.mapreduce.lib.partition
Class InputSampler<K,V>
java.lang.Object
   org.apache.hadoop.conf.Configured
org.apache.hadoop.conf.Configured
       org.apache.hadoop.mapreduce.lib.partition.InputSampler<K,V>
org.apache.hadoop.mapreduce.lib.partition.InputSampler<K,V>
- All Implemented Interfaces: 
- Configurable, Tool
- @InterfaceAudience.Public
@InterfaceStability.Stable
public class InputSampler<K,V> 
- extends Configured- implements Tool
Utility for collecting samples and writing a partition file for
 TotalOrderPartitioner.
 
 
 
 
| Methods inherited from class java.lang.Object | 
| clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait | 
 
 
InputSampler
public InputSampler(Configuration conf)
writePartitionFile
public static <K,V> void writePartitionFile(Job job,
                                            InputSampler.Sampler<K,V> sampler)
                               throws IOException,
                                      ClassNotFoundException,
                                      InterruptedException
- Write a partition file for the given job, using the Sampler provided.
 Queries the sampler for a sample keyset, sorts by the output key
 comparator, selects the keys for each rank, and writes to the destination
 returned from TotalOrderPartitioner.getPartitionFile(org.apache.hadoop.conf.Configuration).
 
- 
 
- 
- Throws:
- IOException
- ClassNotFoundException
- InterruptedException
 
run
public int run(String[] args)
        throws Exception
- Driver for InputSampler from the command line.
 Configures a JobConf instance and calls writePartitionFile(org.apache.hadoop.mapreduce.Job, org.apache.hadoop.mapreduce.lib.partition.InputSampler.Sampler).
 
- 
- Specified by:
- runin interface- Tool
 
- 
- Parameters:
- args- command specific arguments.
- Returns:
- exit code.
- Throws:
- Exception
 
main
public static void main(String[] args)
                 throws Exception
- 
 
- 
- Throws:
- Exception
 
Copyright © 2009 The Apache Software Foundation