Randomly splits this
DataFramewith the provided weights.
New in version 1.4.0.
list of doubles as weights with which to split the
DataFrame. Weights will be normalized if they don’t sum up to 1.0.
- seedint, optional
The seed for sampling.
>>> splits = df4.randomSplit([1.0, 2.0], 24) >>> splits.count() 2
>>> splits.count() 2