Packages

t

org.apache.spark.util.random

RandomSampler

trait RandomSampler[T, U] extends Pseudorandom with Cloneable with Serializable

Developer API

A pseudorandom sampler. It is possible to change the sampled item type. For example, we might want to add weights for stratified sampling or importance sampling. Should only use transformations that are tied to the sampler and cannot be applied after sampling.

T

item type

U

sampled item type

Annotations
@DeveloperApi()
Source
RandomSampler.scala
Linear Supertypes
Serializable, Serializable, Cloneable, Cloneable, Pseudorandom, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. RandomSampler
  2. Serializable
  3. Serializable
  4. Cloneable
  5. Cloneable
  6. Pseudorandom
  7. AnyRef
  8. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Abstract Value Members

  1. abstract def sample(): Int

    Whether to sample the next item or not.

    Whether to sample the next item or not. Return how many times the next item will be sampled. Return 0 if it is not sampled.

  2. abstract def setSeed(seed: Long): Unit

    Set random seed.

    Set random seed.

    Definition Classes
    Pseudorandom

Concrete Value Members

  1. def clone(): RandomSampler[T, U]

    return a copy of the RandomSampler object

    return a copy of the RandomSampler object

    Definition Classes
    RandomSampler → AnyRef
  2. def sample(items: Iterator[T]): Iterator[U]

    take a random sample