public class FPGrowth
extends Object
implements org.apache.spark.internal.Logging, scala.Serializable
param: minSupport the minimal support level of the frequent pattern, any pattern that appears more than (minSupport * size-of-the-dataset) times will be output param: numPartitions number of partitions used by parallel FP-growth
Modifier and Type | Class and Description |
---|---|
static class |
FPGrowth.FreqItemset<Item>
Frequent itemset.
|
Constructor and Description |
---|
FPGrowth()
Constructs a default instance with default parameters {minSupport:
0.3 , numPartitions: same
as the input data}. |
Modifier and Type | Method and Description |
---|---|
<Item,Basket extends Iterable<Item>> |
run(JavaRDD<Basket> data)
Java-friendly version of
run . |
<Item> FPGrowthModel<Item> |
run(RDD<Object> data,
scala.reflect.ClassTag<Item> evidence$4)
Computes an FP-Growth model that contains frequent itemsets.
|
FPGrowth |
setMinSupport(double minSupport)
Sets the minimal support level (default:
0.3 ). |
FPGrowth |
setNumPartitions(int numPartitions)
Sets the number of partitions used by parallel FP-growth (default: same as input data).
|
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
$init$, initializeForcefully, initializeLogIfNecessary, initializeLogIfNecessary, initializeLogIfNecessary$default$2, initLock, isTraceEnabled, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning, org$apache$spark$internal$Logging$$log__$eq, org$apache$spark$internal$Logging$$log_, uninitialize
public FPGrowth()
0.3
, numPartitions: same
as the input data}.
public FPGrowth setMinSupport(double minSupport)
0.3
).
minSupport
- (undocumented)public FPGrowth setNumPartitions(int numPartitions)
numPartitions
- (undocumented)public <Item> FPGrowthModel<Item> run(RDD<Object> data, scala.reflect.ClassTag<Item> evidence$4)
data
- input data set, each element contains a transactionevidence$4
- (undocumented)FPGrowthModel
public <Item,Basket extends Iterable<Item>> FPGrowthModel<Item> run(JavaRDD<Basket> data)
run
.data
- (undocumented)