spark.api.java.JavaRDDLike

Abstract Value Members

implicit abstract val classManifest: ClassManifest[T]
abstract def rdd: RDD[T]
abstract def wrapRDD(rdd: RDD[T]): This

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def !=(arg0: Any): Boolean

Definition Classes
Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def ==(arg0: Any): Boolean

Definition Classes
Any
def aggregate[U](zeroValue: U)(seqOp: Function2[U, T, U], combOp: Function2[U, U, U]): U

Aggregate the elements of each partition, and then the results for all the partitions, using given combine functions and a neutral "zero value".
Aggregate the elements of each partition, and then the results for all the partitions, using given combine functions and a neutral "zero value". This function can return a different result type, U, than the type of this RDD, T. Thus, we need one operation for merging a T into an U and one operation for merging two U's, as in scala.TraversableOnce. Both of these functions are allowed to modify and return their first argument instead of creating a new U to avoid memory allocation.
final def asInstanceOf[T0]: T0

Definition Classes
Any
def cartesian[U](other: spark.api.java.JavaRDDLike[U, _]): JavaPairRDD[T, U]

Return the Cartesian product of this RDD and another one, that is, the RDD of all pairs of elements (a, b) where a is in this and b is in other.
def clone(): AnyRef

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws()
def collect(): List[T]

Return an array that contains all of the elements in this RDD.
def context: SparkContext

The SparkContext that this RDD was created on.
def count(): Long

Return the number of elements in the RDD.
def countApprox(timeout: Long): PartialResult[BoundedDouble]

(Experimental) Approximate version of count() that returns a potentially incomplete result within a timeout, even if not all tasks have finished.
def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

(Experimental) Approximate version of count() that returns a potentially incomplete result within a timeout, even if not all tasks have finished.
def countByValue(): Map[T, Long]

Return the count of each unique value in this RDD as a map of (value, count) pairs.
Return the count of each unique value in this RDD as a map of (value, count) pairs. The final combine step happens locally on the master, equivalent to running a single reduce task.
def countByValueApprox(timeout: Long): PartialResult[Map[T, BoundedDouble]]

(Experimental) Approximate version of countByValue().
def countByValueApprox(timeout: Long, confidence: Double): PartialResult[Map[T, BoundedDouble]]

(Experimental) Approximate version of countByValue().
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def finalize(): Unit

Attributes
protected[lang]
Definition Classes
AnyRef
Annotations
@throws()
def first(): T

Return the first element in this RDD.
def flatMap(f: DoubleFlatMapFunction[T]): JavaDoubleRDD

Return a new RDD by first applying a function to all elements of this RDD, and then flattening the results.
def flatMap[U](f: FlatMapFunction[T, U]): JavaRDD[U]

Return a new RDD by first applying a function to all elements of this RDD, and then flattening the results.
def flatMap[K, V](f: PairFlatMapFunction[T, K, V]): JavaPairRDD[K, V]

Definition Classes
PairFlatMapWorkaround
def fold(zeroValue: T)(f: Function2[T, T, T]): T

Aggregate the elements of each partition, and then the results for all the partitions, using a given associative function and a neutral "zero value".
Aggregate the elements of each partition, and then the results for all the partitions, using a given associative function and a neutral "zero value". The function op(t1, t2) is allowed to modify t1 and return it as its result value to avoid object allocation; however, it should not modify t2.
def foreach(f: VoidFunction[T]): Unit

Applies a function f to all elements of this RDD.
final def getClass(): java.lang.Class[_]

Definition Classes
AnyRef → Any
def getStorageLevel: StorageLevel

Get the RDD's current storage level, or StorageLevel.
Get the RDD's current storage level, or StorageLevel.NONE if none is set.
def glom(): JavaRDD[List[T]]

Return an RDD created by coalescing all elements within each partition into an array.
def groupBy[K](f: Function[T, K], numSplits: Int): JavaPairRDD[K, List[T]]

Return an RDD of grouped elements.
Return an RDD of grouped elements. Each group consists of a key and a sequence of elements mapping to that key.
def groupBy[K](f: Function[T, K]): JavaPairRDD[K, List[T]]

Return an RDD of grouped elements.
Return an RDD of grouped elements. Each group consists of a key and a sequence of elements mapping to that key.
def hashCode(): Int

Definition Classes
AnyRef → Any
def id: Int

A unique ID for this RDD (within its SparkContext).
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def iterator(split: Split): Iterator[T]

Internal method to this RDD; will read from cache if applicable, or otherwise compute it.
Internal method to this RDD; will read from cache if applicable, or otherwise compute it. This should not be called by users directly, but is available for implementors of custom subclasses of RDD.
def map[K2, V2](f: PairFunction[T, K2, V2]): JavaPairRDD[K2, V2]

Return a new RDD by applying a function to all elements of this RDD.
def map[R](f: DoubleFunction[T]): JavaDoubleRDD

Return a new RDD by applying a function to all elements of this RDD.
def map[R](f: Function[T, R]): JavaRDD[R]

Return a new RDD by applying a function to all elements of this RDD.
def mapPartitions[K, V](f: PairFlatMapFunction[Iterator[T], K, V]): JavaPairRDD[K, V]

Return a new RDD by applying a function to each partition of this RDD.
def mapPartitions(f: DoubleFlatMapFunction[Iterator[T]]): JavaDoubleRDD

Return a new RDD by applying a function to each partition of this RDD.
def mapPartitions[U](f: FlatMapFunction[Iterator[T], U]): JavaRDD[U]

Return a new RDD by applying a function to each partition of this RDD.
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def pipe(command: List[String], env: Map[String, String]): JavaRDD[String]

Return an RDD created by piping elements to a forked external process.
def pipe(command: List[String]): JavaRDD[String]

Return an RDD created by piping elements to a forked external process.
def pipe(command: String): JavaRDD[String]

Return an RDD created by piping elements to a forked external process.
def reduce(f: Function2[T, T, T]): T

Reduces the elements of this RDD using the specified associative binary operator.
def saveAsObjectFile(path: String): Unit

Save this RDD as a SequenceFile of serialized objects.
def saveAsTextFile(path: String): Unit

Save this RDD as a text file, using string representations of elements.
def splits: List[Split]

Set of partitions in this RDD.
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def take(num: Int): List[T]

Take the first num elements of the RDD.
Take the first num elements of the RDD. This currently scans the partitions *one by one*, so it will be slow if a lot of partitions are required. In that case, use collect() to get the whole RDD instead.
def takeSample(withReplacement: Boolean, num: Int, seed: Int): List[T]
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws()
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws()
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws()

JavaRDDLike

trait JavaRDDLike[T, This <: JavaRDDLike[T, This]] extends PairFlatMapWorkaround[T]

Abstract Value Members

implicit abstract val classManifest: ClassManifest[T]

abstract def rdd: RDD[T]

abstract def wrapRDD(rdd: RDD[T]): This

Concrete Value Members

final def !=(arg0: AnyRef): Boolean

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: AnyRef): Boolean

final def ==(arg0: Any): Boolean

def aggregate[U](zeroValue: U)(seqOp: Function2[U, T, U], combOp: Function2[U, U, U]): U

final def asInstanceOf[T0]: T0

def cartesian[U](other: spark.api.java.JavaRDDLike[U, _]): JavaPairRDD[T, U]

def clone(): AnyRef

def collect(): List[T]

def context: SparkContext

def count(): Long

def countApprox(timeout: Long): PartialResult[BoundedDouble]

def countApprox(timeout: Long, confidence: Double): PartialResult[BoundedDouble]

def countByValue(): Map[T, Long]

def countByValueApprox(timeout: Long): PartialResult[Map[T, BoundedDouble]]

def countByValueApprox(timeout: Long, confidence: Double): PartialResult[Map[T, BoundedDouble]]

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def finalize(): Unit

def first(): T

def flatMap(f: DoubleFlatMapFunction[T]): JavaDoubleRDD

def flatMap[U](f: FlatMapFunction[T, U]): JavaRDD[U]

def flatMap[K, V](f: PairFlatMapFunction[T, K, V]): JavaPairRDD[K, V]

def fold(zeroValue: T)(f: Function2[T, T, T]): T

def foreach(f: VoidFunction[T]): Unit

final def getClass(): java.lang.Class[_]

def getStorageLevel: StorageLevel

def glom(): JavaRDD[List[T]]

def groupBy[K](f: Function[T, K], numSplits: Int): JavaPairRDD[K, List[T]]

def groupBy[K](f: Function[T, K]): JavaPairRDD[K, List[T]]

def hashCode(): Int

def id: Int

final def isInstanceOf[T0]: Boolean

def iterator(split: Split): Iterator[T]

def map[K2, V2](f: PairFunction[T, K2, V2]): JavaPairRDD[K2, V2]

def map[R](f: DoubleFunction[T]): JavaDoubleRDD

def map[R](f: Function[T, R]): JavaRDD[R]

def mapPartitions[K, V](f: PairFlatMapFunction[Iterator[T], K, V]): JavaPairRDD[K, V]

def mapPartitions(f: DoubleFlatMapFunction[Iterator[T]]): JavaDoubleRDD

def mapPartitions[U](f: FlatMapFunction[Iterator[T], U]): JavaRDD[U]

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def pipe(command: List[String], env: Map[String, String]): JavaRDD[String]

def pipe(command: List[String]): JavaRDD[String]

def pipe(command: String): JavaRDD[String]

def reduce(f: Function2[T, T, T]): T

def saveAsObjectFile(path: String): Unit

def saveAsTextFile(path: String): Unit

def splits: List[Split]

final def synchronized[T0](arg0: ⇒ T0): T0

def take(num: Int): List[T]

def takeSample(withReplacement: Boolean, num: Int, seed: Int): List[T]

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from PairFlatMapWorkaround[T]

Inherited from Serializable

Inherited from AnyRef

Inherited from Any