Interface Batch
@Evolving
public interface Batch
A physical representation of a data source scan for batch queries. This interface is used to
 provide physical information, like how many partitions the scanned data has, and how to read
 records from the partitions.
- Since:
- 3.0.0
- 
Method SummaryModifier and TypeMethodDescriptionReturns a factory to create aPartitionReaderfor eachInputPartition.Returns a list ofinput partitions.
- 
Method Details- 
planInputPartitionsInputPartition[] planInputPartitions()Returns a list ofinput partitions. EachInputPartitionrepresents a data split that can be processed by one Spark task. The number of input partitions returned here is the same as the number of RDD partitions this scan outputs.If the Scansupports filter pushdown, this Batch is likely configured with a filter and is responsible for creating splits for that filter, which is not a full scan.This method will be called only once during a data source scan, to launch one Spark job. 
- 
createReaderFactoryPartitionReaderFactory createReaderFactory()Returns a factory to create aPartitionReaderfor eachInputPartition.
 
-