Package org.apache.hadoop.streaming


package org.apache.hadoop.streaming
Hadoop Streaming is a utility which allows users to create and run Map-Reduce jobs with any executables (e.g. Unix shell utilities) as the mapper and/or the reducer.
  • Classes
    Class
    Description
    org.apache.hadoop.streaming.AutoInputFormat
    An InputFormat that tries to deduce the types of the input files automatically.
    org.apache.hadoop.streaming.DumpTypedBytes
    Utility program that fetches all files that match a given pattern and dumps their content to stdout as typed bytes.
    org.apache.hadoop.streaming.Environment
    This is a class used to get the current environment on the host machines running the map/reduce.
    org.apache.hadoop.streaming.HadoopStreaming
    The main entry point.
    org.apache.hadoop.streaming.JarBuilder
    This class is the main class for generating job.jar for Hadoop Streaming jobs.
    org.apache.hadoop.streaming.LoadTypedBytes
    Utility program that reads typed bytes from standard input and stores them in a sequence file for which the path is given as an argument.
    org.apache.hadoop.streaming.PathFinder
    Maps a relative pathname to an absolute pathname using the PATH environment.
    org.apache.hadoop.streaming.PipeCombiner
     
    org.apache.hadoop.streaming.PipeMapper
    A generic Mapper bridge.
    org.apache.hadoop.streaming.PipeMapRed
    Shared functionality for PipeMapper, PipeReducer.
    org.apache.hadoop.streaming.PipeMapRunner<K1,V1,K2,V2>
     
    org.apache.hadoop.streaming.PipeReducer
    A generic Reducer bridge.
    org.apache.hadoop.streaming.StreamBaseRecordReader
    Shared functionality for hadoopStreaming formats.
    org.apache.hadoop.streaming.StreamInputFormat
    An input format that selects a RecordReader based on a JobConf property.
    org.apache.hadoop.streaming.StreamJob
    All the client-side work happens here.
    org.apache.hadoop.streaming.StreamKeyValUtil
     
    org.apache.hadoop.streaming.StreamUtil
    Utilities used in streaming
    org.apache.hadoop.streaming.StreamXmlRecordReader
    A way to interpret XML fragments as Mapper input records.