Interface RandomSampler<T,U>

Type Parameters:
T - item type
U - sampled item type
All Superinterfaces:
Cloneable, Pseudorandom, Serializable
All Known Implementing Classes:
BernoulliCellSampler, BernoulliSampler, PoissonSampler

public interface RandomSampler<T,U> extends Pseudorandom, Cloneable, Serializable
:: DeveloperApi :: A pseudorandom sampler. It is possible to change the sampled item type. For example, we might want to add weights for stratified sampling or importance sampling. Should only use transformations that are tied to the sampler and cannot be applied after sampling.

  • Method Summary

    Modifier and Type
    Method
    Description
    return a copy of the RandomSampler object
    int
    Whether to sample the next item or not.
    scala.collection.Iterator<U>
    sample(scala.collection.Iterator<T> items)
    take a random sample

    Methods inherited from interface org.apache.spark.util.random.Pseudorandom

    setSeed
  • Method Details

    • sample

      scala.collection.Iterator<U> sample(scala.collection.Iterator<T> items)
      take a random sample
    • sample

      int sample()
      Whether to sample the next item or not. Return how many times the next item will be sampled. Return 0 if it is not sampled.
      Returns:
      (undocumented)
    • clone

      RandomSampler<T,U> clone()
      return a copy of the RandomSampler object