StreamWriter (Spark 2.3.3 JavaDoc)

All Superinterfaces:

DataSourceWriter
```
@InterfaceStability.Evolving
public interface StreamWriter
extends DataSourceWriter
```
A DataSourceWriter for use with structured streaming. This writer handles commits and aborts relative to an epoch ID determined by the execution engine. DataWriter implementations generated by a StreamWriter may be reused for multiple epochs, and so must reset any internal state after a successful commit.

Method Summary

All Methods Instance Methods Abstract Methods Default Methods
Modifier and Type	Method and Description
`void`	`abort(long epochId, WriterCommitMessage[] messages)` Aborts this writing job because some data writers are failed and keep failing when retry, or the Spark job fails with some unknown reasons, or `commit(WriterCommitMessage[])` fails.
`default void`	`abort(WriterCommitMessage[] messages)` Aborts this writing job because some data writers are failed and keep failing when retry, or the Spark job fails with some unknown reasons, or `DataSourceWriter.commit(WriterCommitMessage[])` fails.
`void`	`commit(long epochId, WriterCommitMessage[] messages)` Commits this writing job for the specified epoch with a list of commit messages.
`default void`	`commit(WriterCommitMessage[] messages)` Commits this writing job with a list of commit messages.

Methods inherited from interface org.apache.spark.sql.sources.v2.writer.DataSourceWriter
createWriterFactory

- Method Detail
  - commit
```
void commit(long epochId,
            WriterCommitMessage[] messages)
```
    Commits this writing job for the specified epoch with a list of commit messages. The commit messages are collected from successful data writers and are produced by DataWriter.commit(). If this method fails (by throwing an exception), this writing job is considered to have been failed, and the execution engine will attempt to call abort(WriterCommitMessage[]). To support exactly-once processing, writer implementations should ensure that this method is idempotent. The execution engine may call commit() multiple times for the same epoch in some circumstances.
  - abort
```
void abort(long epochId,
           WriterCommitMessage[] messages)
```
    Aborts this writing job because some data writers are failed and keep failing when retry, or the Spark job fails with some unknown reasons, or commit(WriterCommitMessage[]) fails. If this method fails (by throwing an exception), the underlying data source may require manual cleanup. Unless the abort is triggered by the failure of commit, the given messages should have some null slots as there maybe only a few data writers that are committed before the abort happens, or some data writers were committed but their commit messages haven't reached the driver when the abort is triggered. So this is just a "best effort" for data sources to clean up the data left by data writers.
  - commit
```
default void commit(WriterCommitMessage[] messages)
```
    Description copied from interface: DataSourceWriter
    
    Commits this writing job with a list of commit messages. The commit messages are collected from successful data writers and are produced by DataWriter.commit(). If this method fails (by throwing an exception), this writing job is considered to to have been failed, and DataSourceWriter.abort(WriterCommitMessage[]) would be called. The state of the destination is undefined and @DataSourceWriter.abort(WriterCommitMessage[]) may not be able to deal with it. Note that, one partition may have multiple committed data writers because of speculative tasks. Spark will pick the first successful one and get its commit message. Implementations should be aware of this and handle it correctly, e.g., have a coordinator to make sure only one data writer can commit, or have a way to clean up the data of already-committed writers.
    
    Specified by:
    
    commit in interface DataSourceWriter
  - abort
```
default void abort(WriterCommitMessage[] messages)
```
    Description copied from interface: DataSourceWriter
    
    Aborts this writing job because some data writers are failed and keep failing when retry, or the Spark job fails with some unknown reasons, or DataSourceWriter.commit(WriterCommitMessage[]) fails. If this method fails (by throwing an exception), the underlying data source may require manual cleanup. Unless the abort is triggered by the failure of commit, the given messages should have some null slots as there maybe only a few data writers that are committed before the abort happens, or some data writers were committed but their commit messages haven't reached the driver when the abort is triggered. So this is just a "best effort" for data sources to clean up the data left by data writers.
    
    Specified by:
    
    abort in interface DataSourceWriter

Interface StreamWriter

Method Summary

Methods inherited from interface org.apache.spark.sql.sources.v2.writer.DataSourceWriter

Method Detail

commit

abort

commit

abort