Package | Description |
---|---|
org.apache.hadoop.fs.azurebfs.commit |
Support for manifest committer.
|
org.apache.hadoop.fs.s3a.commit.magic |
This is the "Magic" committer and support.
|
org.apache.hadoop.mapred | |
org.apache.hadoop.mapred.lib | |
org.apache.hadoop.mapreduce | |
org.apache.hadoop.mapreduce.lib.db | |
org.apache.hadoop.mapreduce.lib.input | |
org.apache.hadoop.mapreduce.lib.join | |
org.apache.hadoop.mapreduce.lib.output | |
org.apache.hadoop.mapreduce.lib.output.committer.manifest |
Intermediate manifest committer.
|
Modifier and Type | Method and Description |
---|---|
ManifestCommitter |
AzureManifestCommitterFactory.createOutputCommitter(Path outputPath,
TaskAttemptContext context) |
Modifier and Type | Method and Description |
---|---|
void |
MagicS3GuardCommitter.abortTask(TaskAttemptContext context)
Abort a task.
|
void |
MagicS3GuardCommitter.commitTask(TaskAttemptContext context) |
protected Path |
MagicS3GuardCommitter.getBaseTaskAttemptPath(TaskAttemptContext context) |
Path |
MagicS3GuardCommitter.getTaskAttemptPath(TaskAttemptContext context)
Compute the path where the output of a task attempt is stored until
that task is committed.
|
Path |
MagicS3GuardCommitter.getTempTaskAttemptPath(TaskAttemptContext context)
Get a temporary directory for data.
|
protected org.apache.hadoop.fs.s3a.commit.files.PendingSet |
MagicS3GuardCommitter.loadPendingCommits(TaskAttemptContext context)
Loads pending commits from either memory or from the remote store (S3) based on the config.
|
boolean |
MagicS3GuardCommitter.needsTaskCommit(TaskAttemptContext context)
Did this task write any files in the work directory?
Probes for a task existing by looking to see if the attempt dir exists.
|
Constructor and Description |
---|
MagicS3GuardCommitter(Path outputPath,
TaskAttemptContext context)
Create a task committer.
|
Modifier and Type | Interface and Description |
---|---|
interface |
TaskAttemptContext |
Modifier and Type | Method and Description |
---|---|
void |
OutputCommitter.abortTask(TaskAttemptContext taskContext)
This method implements the new interface by calling the old method.
|
void |
OutputCommitter.commitTask(TaskAttemptContext taskContext)
This method implements the new interface by calling the old method.
|
boolean |
OutputCommitter.needsTaskCommit(TaskAttemptContext taskContext)
This method implements the new interface by calling the old method.
|
void |
OutputCommitter.recoverTask(TaskAttemptContext taskContext)
This method implements the new interface by calling the old method.
|
void |
OutputCommitter.setupTask(TaskAttemptContext taskContext)
This method implements the new interface by calling the old method.
|
Modifier and Type | Method and Description |
---|---|
RecordReader<K,V> |
CombineFileInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
Modifier and Type | Interface and Description |
---|---|
interface |
MapContext<KEYIN,VALUEIN,KEYOUT,VALUEOUT>
The context that is given to the
Mapper . |
interface |
ReduceContext<KEYIN,VALUEIN,KEYOUT,VALUEOUT>
The context passed to the
Reducer . |
interface |
TaskInputOutputContext<KEYIN,VALUEIN,KEYOUT,VALUEOUT>
A context object that allows input and output from the task.
|
Modifier and Type | Method and Description |
---|---|
abstract void |
OutputCommitter.abortTask(TaskAttemptContext taskContext)
Discard the task output.
|
abstract void |
RecordWriter.close(TaskAttemptContext context)
Close this
RecordWriter to future operations. |
abstract void |
OutputCommitter.commitTask(TaskAttemptContext taskContext)
To promote the task's temporary output to final output location.
|
abstract RecordReader<K,V> |
InputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
Create a record reader for a given split.
|
abstract OutputCommitter |
OutputFormat.getOutputCommitter(TaskAttemptContext context)
Get the output committer for this output format.
|
abstract RecordWriter<K,V> |
OutputFormat.getRecordWriter(TaskAttemptContext context)
Get the
RecordWriter for the given task. |
abstract void |
RecordReader.initialize(InputSplit split,
TaskAttemptContext context)
Called once at initialization.
|
abstract boolean |
OutputCommitter.needsTaskCommit(TaskAttemptContext taskContext)
Check whether task needs a commit.
|
void |
OutputCommitter.recoverTask(TaskAttemptContext taskContext)
Recover the task output.
|
abstract void |
OutputCommitter.setupTask(TaskAttemptContext taskContext)
Sets up output for the task.
|
Modifier and Type | Method and Description |
---|---|
RecordReader<LongWritable,T> |
DBInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context)
Create a record reader for a given split.
|
OutputCommitter |
DBOutputFormat.getOutputCommitter(TaskAttemptContext context) |
RecordWriter<K,V> |
DBOutputFormat.getRecordWriter(TaskAttemptContext context)
Get the
RecordWriter for the given task. |
void |
DBRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
Modifier and Type | Field and Description |
---|---|
protected TaskAttemptContext |
CombineFileRecordReader.context |
Constructor and Description |
---|
CombineFileRecordReader(CombineFileSplit split,
TaskAttemptContext context,
Class<? extends RecordReader<K,V>> rrClass)
A generic RecordReader that can hand out different recordReaders
for each chunk in the CombineFileSplit.
|
CombineFileRecordReaderWrapper(FileInputFormat<K,V> inputFormat,
CombineFileSplit split,
TaskAttemptContext context,
Integer idx) |
Modifier and Type | Method and Description |
---|---|
RecordReader<K,TupleWritable> |
CompositeInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext taskContext)
Construct a CompositeRecordReader for the children of this InputFormat
as defined in the init expression.
|
abstract ComposableRecordReader<K,V> |
ComposableInputFormat.createRecordReader(InputSplit split,
TaskAttemptContext context) |
void |
WrappedRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
CompositeRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
void |
MultiFilterRecordReader.initialize(InputSplit split,
TaskAttemptContext context) |
Modifier and Type | Method and Description |
---|---|
void |
BindingPathOutputCommitter.abortTask(TaskAttemptContext taskContext) |
void |
FileOutputCommitter.abortTask(TaskAttemptContext context)
Delete the work directory
|
void |
PartialFileOutputCommitter.cleanUpPartialOutputForTask(TaskAttemptContext context) |
void |
PartialOutputCommitter.cleanUpPartialOutputForTask(TaskAttemptContext context)
Remove all previously committed outputs from prior executions of this task.
|
void |
BindingPathOutputCommitter.commitTask(TaskAttemptContext taskContext) |
void |
FileOutputCommitter.commitTask(TaskAttemptContext context)
Move the files from the work directory to the job output directory
|
static PathOutputCommitter |
PathOutputCommitterFactory.createCommitter(Path outputPath,
TaskAttemptContext context)
Create the committer factory for a task attempt and destination, then
create the committer from it.
|
protected PathOutputCommitter |
PathOutputCommitterFactory.createFileOutputCommitter(Path outputPath,
TaskAttemptContext context)
Create an instance of the default committer, a
FileOutputCommitter
for a task. |
PathOutputCommitter |
PathOutputCommitterFactory.createOutputCommitter(Path outputPath,
TaskAttemptContext context)
Create an output committer for a task attempt.
|
Path |
PartialFileOutputCommitter.getCommittedTaskPath(int appAttemptId,
TaskAttemptContext context) |
protected Path |
FileOutputCommitter.getCommittedTaskPath(int appAttemptId,
TaskAttemptContext context)
Compute the path where the output of a committed task is stored until the
entire job is committed for a specific application attempt.
|
Path |
FileOutputCommitter.getCommittedTaskPath(TaskAttemptContext context)
Compute the path where the output of a committed task is stored until
the entire job is committed.
|
static Path |
FileOutputCommitter.getCommittedTaskPath(TaskAttemptContext context,
Path out) |
Path |
FileOutputFormat.getDefaultWorkFile(TaskAttemptContext context,
String extension)
Get the default path and filename for the output format.
|
OutputCommitter |
NullOutputFormat.getOutputCommitter(TaskAttemptContext context) |
OutputCommitter |
FilterOutputFormat.getOutputCommitter(TaskAttemptContext context) |
OutputCommitter |
FileOutputFormat.getOutputCommitter(TaskAttemptContext context) |
OutputCommitter |
LazyOutputFormat.getOutputCommitter(TaskAttemptContext context) |
RecordWriter<K,V> |
NullOutputFormat.getRecordWriter(TaskAttemptContext context) |
RecordWriter<K,V> |
FilterOutputFormat.getRecordWriter(TaskAttemptContext context) |
RecordWriter<K,V> |
SequenceFileOutputFormat.getRecordWriter(TaskAttemptContext context) |
RecordWriter<WritableComparable<?>,Writable> |
MapFileOutputFormat.getRecordWriter(TaskAttemptContext context) |
abstract RecordWriter<K,V> |
FileOutputFormat.getRecordWriter(TaskAttemptContext job) |
RecordWriter<K,V> |
LazyOutputFormat.getRecordWriter(TaskAttemptContext context) |
RecordWriter<BytesWritable,BytesWritable> |
SequenceFileAsBinaryOutputFormat.getRecordWriter(TaskAttemptContext context) |
RecordWriter<K,V> |
TextOutputFormat.getRecordWriter(TaskAttemptContext job) |
protected org.apache.hadoop.io.SequenceFile.Writer |
SequenceFileOutputFormat.getSequenceWriter(TaskAttemptContext context,
Class<?> keyClass,
Class<?> valueClass) |
Path |
FileOutputCommitter.getTaskAttemptPath(TaskAttemptContext context)
Compute the path where the output of a task attempt is stored until
that task is committed.
|
static Path |
FileOutputCommitter.getTaskAttemptPath(TaskAttemptContext context,
Path out)
Compute the path where the output of a task attempt is stored until
that task is committed.
|
static String |
FileOutputFormat.getUniqueFile(TaskAttemptContext context,
String name,
String extension)
Generate a unique filename, based on the task id, name, and extension
|
boolean |
BindingPathOutputCommitter.needsTaskCommit(TaskAttemptContext taskContext) |
boolean |
FileOutputCommitter.needsTaskCommit(TaskAttemptContext context)
Did this task write any files in the work directory?
|
void |
BindingPathOutputCommitter.recoverTask(TaskAttemptContext taskContext) |
void |
FileOutputCommitter.recoverTask(TaskAttemptContext context) |
void |
BindingPathOutputCommitter.setupTask(TaskAttemptContext taskContext) |
void |
FileOutputCommitter.setupTask(TaskAttemptContext context)
No task setup required.
|
Constructor and Description |
---|
BindingPathOutputCommitter(Path outputPath,
TaskAttemptContext context)
Instantiate.
|
FileOutputCommitter(Path outputPath,
TaskAttemptContext context)
Create a file output committer
|
PartialFileOutputCommitter(Path outputPath,
TaskAttemptContext context) |
PathOutputCommitter(Path outputPath,
TaskAttemptContext context)
Constructor for a task attempt.
|
Modifier and Type | Method and Description |
---|---|
void |
ManifestCommitter.abortTask(TaskAttemptContext context)
Abort a task.
|
void |
ManifestCommitter.commitTask(TaskAttemptContext context)
Commit the task.
|
ManifestCommitter |
ManifestCommitterFactory.createOutputCommitter(Path outputPath,
TaskAttemptContext context) |
Path |
ManifestCommitter.getTaskAttemptPath(TaskAttemptContext context)
Compute the path where the output of a task attempt is stored until
that task is committed.
|
Path |
ManifestCommitter.getTaskManifestPath(TaskAttemptContext context)
The path to where the manifest file of a task attempt will be
saved when the task is committed.
|
boolean |
ManifestCommitter.needsTaskCommit(TaskAttemptContext context)
Always return true.
|
void |
ManifestCommitter.recoverTask(TaskAttemptContext taskContext) |
void |
ManifestCommitter.setupTask(TaskAttemptContext context)
Set up a task through a
SetupTaskStage . |
Constructor and Description |
---|
ManifestCommitter(Path outputPath,
TaskAttemptContext context)
Create a committer.
|
Copyright © 2024 Apache Software Foundation. All rights reserved.