WholeTextFileRDD (Spark 1.2.1 JavaDoc)

Object
- org.apache.spark.rdd.RDD<scala.Tuple2<K,V>>
- - org.apache.spark.rdd.NewHadoopRDD<String,String>
  - - org.apache.spark.rdd.WholeTextFileRDD

All Implemented Interfaces:

java.io.Serializable, Logging, SparkHadoopMapReduceUtil
```
public class WholeTextFileRDD
extends NewHadoopRDD<String,String>
```
Analogous to MapPartitionsRDD, but passes in an InputSplit to the given function rather than the index of the partition.

See Also:
Serialized Form

Nested Class Summary
- Nested classes/interfaces inherited from class org.apache.spark.rdd.NewHadoopRDD
  NewHadoopRDD.NewHadoopMapPartitionsWithSplitRDD<U,T>, NewHadoopRDD.NewHadoopMapPartitionsWithSplitRDD$

Constructor Summary

Constructors
Constructor and Description
`WholeTextFileRDD(SparkContext sc, Class<? extends WholeTextFileInputFormat> inputFormatClass, Class<String> keyClass, Class<String> valueClass, org.apache.hadoop.conf.Configuration conf, int minPartitions)`

Method Summary

Methods
Modifier and Type Method and Description

Partition[] getPartitions()
Implemented by subclasses to return the set of partitions in this RDD.
- Methods inherited from class org.apache.spark.rdd.NewHadoopRDD
  compute, getConf, getPreferredLocations, mapPartitionsWithInputSplit
- Methods inherited from class org.apache.spark.rdd.RDD
  aggregate, cache, cartesian, checkpoint, checkpointData, coalesce, collect, collect, collectPartitions, computeOrReadCheckpoint, conf, context, count, countApprox, countApproxDistinct, countApproxDistinct, countByValue, countByValueApprox, creationSite, dependencies, distinct, distinct, doCheckpoint, elementClassTag, filter, filterWith, first, flatMap, flatMapWith, fold, foreach, foreachPartition, foreachWith, getCheckpointFile, getCreationSite, getNarrowAncestors, getStorageLevel, glom, groupBy, groupBy, groupBy, id, intersection, intersection, intersection, isCheckpointed, iterator, keyBy, map, mapPartitions, mapPartitionsWithContext, mapPartitionsWithIndex, mapPartitionsWithSplit, mapWith, markCheckpointed, max, min, name, partitioner, partitions, persist, persist, pipe, pipe, pipe, preferredLocations, randomSplit, reduce, repartition, retag, retag, sample, saveAsObjectFile, saveAsTextFile, saveAsTextFile, setName, sortBy, sparkContext, subtract, subtract, subtract, take, takeOrdered, takeSample, toArray, toDebugString, toJavaRDD, toLocalIterator, top, toString, union, unpersist, zip, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipPartitions, zipWithIndex, zipWithUniqueId
- Methods inherited from class Object
  equals, getClass, hashCode, notify, notifyAll, wait, wait, wait
- Methods inherited from interface org.apache.spark.mapreduce.SparkHadoopMapReduceUtil
  firstAvailableClass, newJobContext, newTaskAttemptContext, newTaskAttemptID
- Methods inherited from interface org.apache.spark.Logging
  initializeIfNecessary, initializeLogging, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning

Methods
Modifier and Type	Method and Description
`Partition[]`	`getPartitions()` Implemented by subclasses to return the set of partitions in this RDD.

Constructor Detail

WholeTextFileRDD

public WholeTextFileRDD(SparkContext sc,
                Class<? extends WholeTextFileInputFormat> inputFormatClass,
                Class<String> keyClass,
                Class<String> valueClass,
                org.apache.hadoop.conf.Configuration conf,
                int minPartitions)

Method Detail
- getPartitions
```
public Partition[] getPartitions()
```
  Description copied from class: RDD
  
  Implemented by subclasses to return the set of partitions in this RDD. This method will only be called once, so it is safe to implement a time-consuming computation in it.
  
  Overrides:
  
  getPartitions in class NewHadoopRDD<String,String>

Class WholeTextFileRDD

Nested Class Summary

Nested classes/interfaces inherited from class org.apache.spark.rdd.NewHadoopRDD

Constructor Summary

Method Summary

Methods inherited from class org.apache.spark.rdd.NewHadoopRDD

Methods inherited from class org.apache.spark.rdd.RDD

Methods inherited from class Object

Methods inherited from interface org.apache.spark.mapreduce.SparkHadoopMapReduceUtil

Methods inherited from interface org.apache.spark.Logging

Constructor Detail

WholeTextFileRDD

Method Detail

getPartitions