COSINE
DistanceMeasure
CREATED_SPARK_VERSION
HiveExternalCatalog
CalendarIntervalType
types
CanonicalRandomVertexCut
PartitionStrategy
Catalog
catalog
CatalystScan
sources
Categorical
FeatureType
CategoricalSplit
tree
CharType
types
CheckRuleBuilder
SparkSessionExtensions
ChiSqSelector
feature feature
ChiSqSelectorModel
feature feature
ChiSqTestResult
test
ChiSquareTest
stat
Classification
Algo
ClassificationModel
classification classification
Classifier
classification
ClusteredDistribution
partitioning
ClusteringEvaluator
evaluation
ClusteringSummary
clustering
CoGroupFunction
function
CoGroupedRDD
rdd
CodegenMetrics
source
CollectionAccumulator
util
Column
sql catalog
ColumnName
sql
ColumnVector
vectorized
ColumnarArray
vectorized
ColumnarBatch
vectorized
ColumnarMap
vectorized
ColumnarRow
vectorized
ComplexFutureAction
spark
CompressionCodec
io
Conf
SVDPlusPlus
ConnectedComponents
lib
ConnectionFactory
JdbcRDD
ConstantInputDStream
dstream
Continuous
FeatureType
ContinuousInputPartition
reader
ContinuousInputPartitionReader
streaming
ContinuousReadSupport
v2
ContinuousReader
streaming
ContinuousSplit
tree
CoordinateMatrix
distributed
Correlation
stat
CountMinSketch
sketch
CountVectorizer
feature
CountVectorizerModel
feature
CreatableRelationProvider
sources
CreateHiveTableAsSelectCommand
execution
CrossValidator
tuning
CrossValidatorModel
tuning
CrossValidatorModelWriter
CrossValidatorModel
cache
JavaDoubleRDD JavaPairRDD JavaRDD Graph EdgeRDDImpl GraphImpl VertexRDDImpl BlockMatrix RDD Dataset JavaDStream JavaPairDStream DStream
cacheSize
SparkExecutorInfo
cacheTable
SQLContext Catalog
calculate
Entropy Gini Impurity Variance
call
CoGroupFunction DoubleFlatMapFunction DoubleFunction FilterFunction FlatMapFunction FlatMapFunction2 FlatMapGroupsFunction FlatMapGroupsWithStateFunction ForeachFunction ForeachPartitionFunction Function Function0 Function2 Function3 Function4 MapFunction MapGroupsFunction MapGroupsWithStateFunction MapPartitionsFunction PairFlatMapFunction PairFunction ReduceFunction VoidFunction VoidFunction2 UDF0 UDF1 UDF10 UDF11 UDF12 UDF13 UDF14 UDF15 UDF16 UDF17 UDF18 UDF19 UDF2 UDF20 UDF21 UDF22 UDF3 UDF4 UDF5 UDF6 UDF7 UDF8 UDF9
callSite
RDDInfo
callUDF
functions
canEqual
ExecutorInfo MutablePair
canHandle
JdbcDialect
canWrite
DataType
cancel
ComplexFutureAction FutureAction SimpleFutureAction
cancelAllJobs
SparkContext JavaSparkContext
cancelJob
SparkContext
cancelJobGroup
SparkContext JavaSparkContext
cancelStage
SparkContext
cartesian
JavaRDDLike RDD
caseSensitive
StopWordsRemover
cast
Column
catalog
SparkSession sql HiveSessionStateBuilder
catalogString
ArrayType DataType MapType StructType
categoricalCols
FeatureHasher
categoricalFeaturesInfo
Strategy
categories
Split
categoryMaps
VectorIndexerModel
categorySizes
OneHotEncoderModel
cause
AnalysisException StreamingQueryException
cbrt
functions
ceil
functions Decimal
changePrecision
Decimal
checkSingleVsMultiColumnParams
ParamValidators
checkpoint
JavaRDDLike Graph EdgeRDDImpl GraphImpl VertexRDDImpl HadoopRDD RDD Dataset StreamingContext JavaDStreamLike JavaStreamingContext DStream
checkpointAppName
Builder
checkpointFile
SparkContext JavaSparkContext
checkpointInterval
HasCheckpointInterval Strategy Builder
chiSqTest
Statistics
child
ScriptTransformationExec Not
className
ExceptionFailure Function
classTag
JavaDoubleRDD JavaPairRDD JavaRDD JavaRDDLike JavaDStream JavaDStreamLike JavaInputDStream JavaPairDStream JavaReceiverInputDStream
classification
ml mllib
classpathEntries
ApplicationEnvironmentInfo
clean
WriteAheadLog
clear
Params ExecutionListenerManager
clearActive
SQLContext
clearActiveSession
SparkSession
clearCache
SQLContext Catalog
clearCallSite
SparkContext JavaSparkContext
clearDefaultSession
SparkSession
clearDependencies
CoGroupedRDD RDD ShuffledRDD UnionRDD
clearJobGroup
SparkContext JavaSparkContext
clearThreshold
LogisticRegressionModel SVMModel
clone
SparkConf ExperimentalMethods Decimal ExecutionListenerManager StorageLevel BernoulliCellSampler BernoulliSampler PoissonSampler RandomSampler
cloneComplement
BernoulliCellSampler
close
JavaSparkContext NioBufferedFileInputStream ReadAheadInputStream DeserializationStream SerializationStream ForeachWriter SparkSession HiveOutputWriter ArrowColumnVector ColumnVector ColumnarBatch TimeTrackingOutputStream JavaStreamingContext WriteAheadLog InMemoryStore LevelDB
closeWriter
HadoopWriteConfigUtil
closeableIterator
KVStoreView
closureSerializer
SparkEnv
cloudWatchCredentials
Builder
cls
ObjectType
clsTag
Encoder
cluster
ClusteringSummary Assignment scheduler
clusterCenters
BisectingKMeansModel KMeansModel BisectingKMeansModel KMeansModel StreamingKMeansModel
clusterSizes
ClusteringSummary
clusterWeights
StreamingKMeansModel
clustering
ml mllib
coalesce
JavaDoubleRDD JavaPairRDD JavaRDD PartitionCoalescer RDD Dataset functions
coefficientMatrix
LogisticRegressionModel
coefficientStandardErrors
GeneralizedLinearRegressionTrainingSummary LinearRegressionSummary
coefficients
LinearSVCModel LogisticRegressionModel AFTSurvivalRegressionModel GeneralizedLinearRegressionModel LinearRegressionModel
cogroup
JavaPairRDD PairRDDFunctions KeyValueGroupedDataset JavaPairDStream PairDStreamFunctions
col
Dataset functions
colIter
DenseMatrix Matrix SparseMatrix DenseMatrix Matrix SparseMatrix
colPtrs
SparseMatrix SparseMatrix
colRegex
Dataset
colStats
Statistics
collect
JavaRDDLike EdgeRDDImpl RDD Dataset
collectAsList
Dataset
collectAsMap
JavaPairRDD PairRDDFunctions
collectAsync
JavaRDDLike AsyncRDDActions
collectEdges
GraphOps
collectNeighborIds
GraphOps
collectNeighbors
GraphOps
collectPartitions
JavaRDDLike
collectSubModels
HasCollectSubModels
collect_list
functions
collect_set
functions
collectionAccumulator
SparkContext
colsPerBlock
BlockMatrix
column
functions ColumnarBatch
columnSchema
ImageSchema
columnSimilarities
IndexedRowMatrix RowMatrix
columns
Dataset
combineByKey
JavaPairRDD PairRDDFunctions JavaPairDStream PairDStreamFunctions
combineByKeyWithClassTag
PairRDDFunctions
combineCombinersByKey
Aggregator
combineValuesByKey
Aggregator
commit
ContinuousReader MicroBatchReader DataSourceWriter DataWriter StreamWriter
commitJob
FileCommitProtocol HadoopMapReduceCommitProtocol
commitTask
FileCommitProtocol HadoopMapReduceCommitProtocol SparkHadoopMapRedUtil
compare
Decimal RDDInfo
compileValue
JdbcDialect
completed
ApplicationAttemptInfo
completedTasks
ExecutorSummary
completionTime
StageInfo JobData StageData
compressed
Matrix Vector Vector
compressedColMajor
Matrix
compressedInputStream
CompressionCodec LZ4CompressionCodec LZFCompressionCodec SnappyCompressionCodec ZStdCompressionCodec
compressedOutputStream
CompressionCodec LZ4CompressionCodec LZFCompressionCodec SnappyCompressionCodec ZStdCompressionCodec
compressedRowMajor
Matrix
compute
EdgeRDD VertexRDD Gradient HingeGradient L1Updater LeastSquaresGradient LogisticGradient SimpleUpdater SquaredL2Updater Updater CoGroupedRDD HadoopRDD JdbcRDD NewHadoopRDD PartitionPruningRDD RDD ShuffledRDD UnionRDD JavaDStream JavaPairDStream ConstantInputDStream DStream ReceiverInputDStream
computeColumnSummaryStatistics
RowMatrix
computeCost
BisectingKMeansModel KMeansModel BisectingKMeansModel KMeansModel
computeCovariance
RowMatrix
computeError
Loss
computeGramianMatrix
IndexedRowMatrix RowMatrix
computeInitialPredictionAndError
GradientBoostedTreesModel
computePreferredLocations
InputFormatInfo
computePrincipalComponents
RowMatrix
computePrincipalComponentsAndExplainedVariance
RowMatrix
computeSVD
IndexedRowMatrix RowMatrix
concat
functions
concat_ws
functions
conf
SparkEnv SparkSession RelationConversions
confidence
Rule BoundedDouble CountMinSketch
config
Builder
configuration
tree InputFormatInfo
confusionMatrix
MulticlassMetrics
connectedComponents
GraphOps
consequent
Rule
contains
SparkConf ParamMap Column RuntimeConfig Metadata
containsDelimiters
HiveOptions
containsNull
ArrayType
context
InterruptibleIterator JavaRDDLike GeneralMLWriter MLReader MLWriter RDD JavaDStreamLike DStream
conv
functions
convertMatrixColumnsFromML
MLUtils
convertMatrixColumnsToML
MLUtils
convertToCanonicalEdges
GraphOps
convertVectorColumnsFromML
MLUtils
convertVectorColumnsToML
MLUtils
copy
Estimator Model Pipeline PipelineModel PipelineStage Predictor Transformer UnaryTransformer DecisionTreeClassificationModel DecisionTreeClassifier GBTClassificationModel GBTClassifier LinearSVC LinearSVCModel LogisticRegression LogisticRegressionModel MultilayerPerceptronClassificationModel MultilayerPerceptronClassifier NaiveBayes NaiveBayesModel OneVsRest OneVsRestModel RandomForestClassificationModel RandomForestClassifier BisectingKMeans BisectingKMeansModel DistributedLDAModel GaussianMixture GaussianMixtureModel KMeans KMeansModel LDA LocalLDAModel PowerIterationClustering BinaryClassificationEvaluator ClusteringEvaluator Evaluator MulticlassClassificationEvaluator RegressionEvaluator Binarizer BucketedRandomProjectionLSH BucketedRandomProjectionLSHModel Bucketizer ChiSqSelector ChiSqSelectorModel CountVectorizer CountVectorizerModel FeatureHasher HashingTF IDF IDFModel Imputer ImputerModel IndexToString Interaction MaxAbsScaler MaxAbsScalerModel MinHashLSH MinHashLSHModel MinMaxScaler MinMaxScalerModel OneHotEncoder OneHotEncoderEstimator OneHotEncoderModel PCA PCAModel PolynomialExpansion QuantileDiscretizer RFormula RFormulaModel RegexTokenizer SQLTransformer StandardScaler StandardScalerModel StopWordsRemover StringIndexer StringIndexerModel Tokenizer VectorAssembler VectorIndexer VectorIndexerModel VectorSizeHint VectorSlicer Word2Vec Word2VecModel FPGrowth FPGrowthModel PrefixSpan DenseMatrix DenseVector Matrix SparseMatrix SparseVector Vector ParamMap Params ALS ALSModel AFTSurvivalRegression AFTSurvivalRegressionModel DecisionTreeRegressionModel DecisionTreeRegressor GBTRegressionModel GBTRegressor GeneralizedLinearRegression GeneralizedLinearRegressionModel IsotonicRegression IsotonicRegressionModel LinearRegression LinearRegressionModel RandomForestRegressionModel RandomForestRegressor CrossValidator CrossValidatorModel TrainValidationSplit TrainValidationSplitModel DenseMatrix DenseVector Matrix SparseMatrix SparseVector Vector ExponentialGenerator GammaGenerator LogNormalGenerator PoissonGenerator RandomDataGenerator StandardNormalGenerator UniformGenerator WeibullGenerator Strategy Row ColumnarArray ColumnarMap ColumnarRow AccumulatorV2 CollectionAccumulator DoubleAccumulator LegacyAccumulatorWrapper LongAccumulator StatCounter KVIndex
copyAndReset
AccumulatorV2 CollectionAccumulator
copyValues
Params
coresGranted
ApplicationInfo
coresPerExecutor
ApplicationInfo
corr
Correlation Statistics DataFrameStatFunctions functions
cos
functions
cosh
functions
count
JavaRDDLike EdgeRDDImpl VertexRDDImpl Summarizer MultivariateOnlineSummarizer MultivariateStatisticalSummary RDD Dataset KeyValueGroupedDataset RelationalGroupedDataset typed functions JavaDStreamLike DStream DoubleAccumulator LongAccumulator StatCounter InMemoryStore KVStore LevelDB
countApprox
JavaRDDLike RDD
countApproxDistinct
JavaRDDLike RDD
countApproxDistinctByKey
JavaPairRDD PairRDDFunctions
countAsync
JavaRDDLike AsyncRDDActions
countByKey
JavaPairRDD PairRDDFunctions
countByKeyApprox
JavaPairRDD PairRDDFunctions
countByValue
JavaRDDLike RDD JavaDStreamLike DStream
countByValueAndWindow
JavaDStreamLike DStream
countByValueApprox
JavaRDDLike RDD
countByWindow
JavaDStreamLike DStream
countDistinct
functions
countMinSketch
DataFrameStatFunctions
countTowardsTaskFailures
ExecutorLostFailure FetchFailed TaskCommitDenied TaskFailedReason TaskKilled
cov
MultivariateGaussian DataFrameStatFunctions
covar_pop
functions
covar_samp
functions
crc32
functions
create
JdbcRDD PartitionPruningRDD ProcessingTime RateEstimator
createCombiner
Aggregator
createCommitter
HadoopWriteConfigUtil
createContinuousReader
ContinuousReadSupport ContinuousInputPartition
createDataFrame
SQLContext SparkSession
createDataWriter
DataWriterFactory
createDataset
SQLContext SparkSession
createExternalTable
SQLContext Catalog
createGlobalTempView
Dataset
createJobContext
HadoopWriteConfigUtil
createMicroBatchReader
MicroBatchReadSupport
createModel
LogisticRegressionWithLBFGS LogisticRegressionWithSGD SVMWithSGD GeneralizedLinearAlgorithm LassoWithSGD LinearRegressionWithSGD RidgeRegressionWithSGD
createOrReplaceGlobalTempView
Dataset
createOrReplaceTempView
Dataset
createPartitionReader
InputPartition
createRDDWithLocalProperties
DStream
createRawLSHModel
BucketedRandomProjectionLSH MinHashLSH
createReader
ReadSupport
createRelation
CreatableRelationProvider RelationProvider SchemaRelationProvider
createSink
StreamSinkProvider
createSource
StreamSourceProvider
createStream
KinesisUtils
createStreamWriter
StreamWriteSupport
createTable
Catalog
createTaskAttemptContext
HadoopWriteConfigUtil
createTempView
Dataset
createTransformFunc
UnaryTransformer DCT ElementwiseProduct NGram Normalizer PolynomialExpansion RegexTokenizer Tokenizer
createUnsafe
Decimal
createWriter
WriteSupport
createWriterFactory
DataSourceWriter
crossJoin
Dataset
crosstab
DataFrameStatFunctions
csv
DataFrameReader DataFrameWriter DataStreamReader
cube
Dataset
cume_dist
functions
currentAttemptId
SparkStageInfo
currentDatabase
Catalog
currentRow
Window functions
current_date
functions
current_timestamp
functions
customMetrics
StateOperatorProgress