public abstract class LDAModel extends Object implements Saveable
This abstraction permits for different underlying representations, including local and distributed data structures.
Modifier and Type | Method and Description |
---|---|
scala.Tuple2<int[],double[]>[] |
describeTopics()
Return the topics described by weighted terms.
|
abstract scala.Tuple2<int[],double[]>[] |
describeTopics(int maxTermsPerTopic)
Return the topics described by weighted terms.
|
abstract Vector |
docConcentration()
Concentration parameter (commonly named "alpha") for the prior placed on documents'
distributions over topics ("theta").
|
abstract int |
k()
Number of topics
|
abstract double |
topicConcentration()
Concentration parameter (commonly named "beta" or "eta") for the prior placed on topics'
distributions over terms.
|
abstract Matrix |
topicsMatrix()
Inferred topics, where each topic is represented by a distribution over terms.
|
abstract int |
vocabSize()
Vocabulary size (number of terms or terms in the vocabulary)
|
public abstract scala.Tuple2<int[],double[]>[] describeTopics(int maxTermsPerTopic)
maxTermsPerTopic
- Maximum number of terms to collect for each topic.public scala.Tuple2<int[],double[]>[] describeTopics()
WARNING: If vocabSize and k are large, this can return a large object!
public abstract Vector docConcentration()
This is the parameter to a Dirichlet distribution.
public abstract int k()
public abstract double topicConcentration()
This is the parameter to a symmetric Dirichlet distribution.
public abstract Matrix topicsMatrix()
public abstract int vocabSize()