spark.rdd

class CoGroupedRDD[K] extends RDD[(K, Seq[Seq[_]])] with Logging
class CoalescedRDD[T] extends RDD[T]

Coalesce the partitions of a parent RDD (prev) into fewer partitions, so that each partition of this RDD computes one or more of the parent ones.
class HadoopRDD[K, V] extends RDD[(K, V)]

An RDD that reads a Hadoop dataset as specified by a JobConf (e.
class NewHadoopRDD[K, V] extends RDD[(K, V)] with HadoopMapReduceUtil
class PipedRDD[T] extends RDD[String]

An RDD that pipes the contents of each parent partition through an external command (printing them one per line) and returns the output as a collection of strings.
class RepartitionShuffledRDD[K, V] extends ShuffledRDD[K, V, V]

Repartition a key-value pair RDD.
class SampledRDD[T] extends RDD[T]
class ShuffledAggregatedRDD[K, V, C] extends ShuffledRDD[K, V, C]

The resulting RDD from shuffle and running (hash-based) aggregation.
abstract class ShuffledRDD[K, V, C] extends RDD[(K, C)]

The resulting RDD from a shuffle (e.
class ShuffledSortedRDD[K, V] extends RepartitionShuffledRDD[K, V]

A sort-based shuffle (that doesn't apply aggregation).
class UnionRDD[T] extends RDD[T] with Serializable

rdd