Class InputSampler<K,V>

java.lang.Object
org.apache.hadoop.conf.Configured
org.apache.hadoop.mapreduce.lib.partition.InputSampler<K,V>
All Implemented Interfaces:
Configurable, Tool
Direct Known Subclasses:
InputSampler

@Public @Stable public class InputSampler<K,V> extends Configured implements Tool
Utility for collecting samples and writing a partition file for TotalOrderPartitioner.
  • Nested Class Summary

    Nested Classes
    Modifier and Type
    Class
    Description
    static class 
    org.apache.hadoop.mapreduce.lib.partition.InputSampler.IntervalSampler<K,V>
    Sample from s splits at regular intervals.
    static class 
    org.apache.hadoop.mapreduce.lib.partition.InputSampler.RandomSampler<K,V>
    Sample from random points in the input.
    static interface 
    org.apache.hadoop.mapreduce.lib.partition.InputSampler.Sampler<K,V>
    Interface to sample using an InputFormat.
    static class 
    org.apache.hadoop.mapreduce.lib.partition.InputSampler.SplitSampler<K,V>
    Samples the first n records from s splits.
  • Constructor Summary

    Constructors
    Constructor
    Description
     
  • Method Summary

    Modifier and Type
    Method
    Description
    static void
    main(String[] args)
     
    int
    run(String[] args)
    Driver for InputSampler from the command line.
    static <K, V> void
    writePartitionFile(Job job, org.apache.hadoop.mapreduce.lib.partition.InputSampler.Sampler<K,V> sampler)
    Write a partition file for the given job, using the Sampler provided.

    Methods inherited from class org.apache.hadoop.conf.Configured

    getConf, setConf

    Methods inherited from class java.lang.Object

    clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

    Methods inherited from interface org.apache.hadoop.conf.Configurable

    getConf, setConf