Class PartialFileOutputCommitter
java.lang.Object
org.apache.hadoop.mapreduce.OutputCommitter
org.apache.hadoop.mapreduce.lib.output.PathOutputCommitter
org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
org.apache.hadoop.mapreduce.lib.output.PartialFileOutputCommitter
- All Implemented Interfaces:
PartialOutputCommitter
@Checkpointable
@Public
@Evolving
public class PartialFileOutputCommitter
extends FileOutputCommitter
implements PartialOutputCommitter
An
OutputCommitter that commits files specified
in job output directory i.e. ${mapreduce.output.fileoutputformat.outputdir}.-
Field Summary
Fields inherited from class org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
FILEOUTPUTCOMMITTER_ALGORITHM_VERSION, FILEOUTPUTCOMMITTER_ALGORITHM_VERSION_DEFAULT, FILEOUTPUTCOMMITTER_CLEANUP_FAILURES_IGNORED, FILEOUTPUTCOMMITTER_CLEANUP_FAILURES_IGNORED_DEFAULT, FILEOUTPUTCOMMITTER_CLEANUP_SKIPPED, FILEOUTPUTCOMMITTER_CLEANUP_SKIPPED_DEFAULT, FILEOUTPUTCOMMITTER_FAILURE_ATTEMPTS, FILEOUTPUTCOMMITTER_FAILURE_ATTEMPTS_DEFAULT, FILEOUTPUTCOMMITTER_TASK_CLEANUP_ENABLED, FILEOUTPUTCOMMITTER_TASK_CLEANUP_ENABLED_DEFAULT, PENDING_DIR_NAME, SUCCEEDED_FILE_NAME, SUCCESSFUL_JOB_OUTPUT_DIR_MARKER, TEMP_DIR_NAME -
Constructor Summary
ConstructorsConstructorDescriptionPartialFileOutputCommitter(Path outputPath, JobContext context) PartialFileOutputCommitter(Path outputPath, TaskAttemptContext context) -
Method Summary
Modifier and TypeMethodDescriptionvoidRemove all previously committed outputs from prior executions of this task.getCommittedTaskPath(int appAttemptId, TaskAttemptContext context) Compute the path where the output of a committed task is stored until the entire job is committed for a specific application attempt.Methods inherited from class org.apache.hadoop.mapreduce.lib.output.FileOutputCommitter
abortJob, abortTask, abortTask, cleanupJob, commitJob, commitJobInternal, commitTask, commitTask, getCommittedTaskPath, getCommittedTaskPath, getJobAttemptPath, getJobAttemptPath, getJobAttemptPath, getOutputPath, getTaskAttemptPath, getTaskAttemptPath, getWorkPath, isCommitJobRepeatable, isRecoverySupported, needsTaskCommit, needsTaskCommit, recoverTask, setupJob, setupTask, toStringMethods inherited from class org.apache.hadoop.mapreduce.lib.output.PathOutputCommitter
hasOutputPathMethods inherited from class org.apache.hadoop.mapreduce.OutputCommitter
isRecoverySupported
-
Constructor Details
-
PartialFileOutputCommitter
- Throws:
IOException
-
PartialFileOutputCommitter
- Throws:
IOException
-
-
Method Details
-
getCommittedTaskPath
Description copied from class:FileOutputCommitterCompute the path where the output of a committed task is stored until the entire job is committed for a specific application attempt.- Overrides:
getCommittedTaskPathin classFileOutputCommitter- Parameters:
appAttemptId- the id of the application attempt to usecontext- the context of any task.- Returns:
- the path where the output of a committed task is stored.
-
cleanUpPartialOutputForTask
Description copied from interface:PartialOutputCommitterRemove all previously committed outputs from prior executions of this task.- Specified by:
cleanUpPartialOutputForTaskin interfacePartialOutputCommitter- Parameters:
context- Context for cleaning up previously promoted output.- Throws:
IOException- If cleanup fails, then the state of the task my not be well defined.
-