3.5.4
Overview
Programming Guides
Quick Start
RDDs, Accumulators, Broadcasts Vars
SQL, DataFrames, and Datasets
Structured Streaming
Spark Streaming (DStreams)
MLlib (Machine Learning)
GraphX (Graph Processing)
SparkR (R on Spark)
PySpark (Python on Spark)
API Docs
Scala
Java
Python
R
SQL, Built-in Functions
Deploying
Overview
Submitting Applications
Spark Standalone
Mesos
YARN
Kubernetes
More
Configuration
Monitoring
Tuning Guide
Job Scheduling
Security
Hardware Provisioning
Migration Guide
Building Spark
Contributing to Spark
Third Party Projects
MLlib: Main Guide
Basic statistics
Data sources
Pipelines
Extracting, transforming and selecting features
Classification and Regression
Clustering
Collaborative filtering
Frequent Pattern Mining
Model selection and tuning
Advanced topics
MLlib: RDD-based API Guide
Data types
Basic statistics
Classification and regression
Collaborative filtering
Clustering
Dimensionality reduction
Feature extraction and transformation
Frequent pattern mining
Evaluation metrics
PMML model export
Optimization (developer)
Tree ensemble methods
This section has been moved into the
classification and regression section
.