Packages 
Package Description
org.apache.hadoop.hive.ql.io.orc  
org.apache.spark
Core Spark classes in Scala.
org.apache.spark.api.java
Spark Java programming APIs.
org.apache.spark.api.java.function
Set of interfaces to represent functions in Spark's Java API.
org.apache.spark.api.r  
org.apache.spark.broadcast
Spark's broadcast variables, used to broadcast immutable datasets to all nodes.
org.apache.spark.graphx
ALPHA COMPONENT GraphX is a graph processing framework built on top of Spark.
org.apache.spark.graphx.impl  
org.apache.spark.graphx.lib
Various analytics functions for graphs.
org.apache.spark.graphx.util
Collections of utilities used by graphx.
org.apache.spark.input  
org.apache.spark.internal.config  
org.apache.spark.io
IO codecs used for compression.
org.apache.spark.launcher
Library for launching Spark applications.
org.apache.spark.mapred  
org.apache.spark.ml
Spark ML is a component that adds a new set of machine learning APIs to let users quickly assemble and configure practical machine learning pipelines.
org.apache.spark.ml.ann  
org.apache.spark.ml.attribute
ML attributes
org.apache.spark.ml.classification  
org.apache.spark.ml.clustering  
org.apache.spark.ml.evaluation  
org.apache.spark.ml.feature
Feature transformers The `ml.feature` package provides common feature transformers that help convert raw data or features into more suitable forms for model fitting.
org.apache.spark.ml.impl  
org.apache.spark.ml.linalg  
org.apache.spark.ml.param  
org.apache.spark.ml.param.shared  
org.apache.spark.ml.r  
org.apache.spark.ml.recommendation  
org.apache.spark.ml.regression  
org.apache.spark.ml.source.libsvm  
org.apache.spark.ml.stat.distribution  
org.apache.spark.ml.tree  
org.apache.spark.ml.tree.impl  
org.apache.spark.ml.tuning  
org.apache.spark.ml.util  
org.apache.spark.mllib.classification  
org.apache.spark.mllib.classification.impl  
org.apache.spark.mllib.clustering  
org.apache.spark.mllib.evaluation  
org.apache.spark.mllib.evaluation.binary  
org.apache.spark.mllib.feature  
org.apache.spark.mllib.fpm  
org.apache.spark.mllib.linalg  
org.apache.spark.mllib.linalg.distributed  
org.apache.spark.mllib.optimization  
org.apache.spark.mllib.pmml  
org.apache.spark.mllib.pmml.export  
org.apache.spark.mllib.random  
org.apache.spark.mllib.rdd  
org.apache.spark.mllib.recommendation  
org.apache.spark.mllib.regression  
org.apache.spark.mllib.regression.impl  
org.apache.spark.mllib.stat  
org.apache.spark.mllib.stat.correlation  
org.apache.spark.mllib.stat.distribution  
org.apache.spark.mllib.stat.test  
org.apache.spark.mllib.tree  
org.apache.spark.mllib.tree.configuration  
org.apache.spark.mllib.tree.impurity  
org.apache.spark.mllib.tree.loss  
org.apache.spark.mllib.tree.model  
org.apache.spark.mllib.util  
org.apache.spark.partial  
org.apache.spark.rdd
Provides implementation's of various RDDs.
org.apache.spark.rpc.netty  
org.apache.spark.scheduler
Spark's DAG scheduler.
org.apache.spark.scheduler.cluster  
org.apache.spark.scheduler.cluster.mesos  
org.apache.spark.scheduler.local  
org.apache.spark.security  
org.apache.spark.serializer
Pluggable serializers for RDD and shuffle data.
org.apache.spark.sql  
org.apache.spark.sql.api.java
Allows the execution of relational queries, including those expressed in SQL using Spark.
org.apache.spark.sql.api.r  
org.apache.spark.sql.catalog  
org.apache.spark.sql.expressions  
org.apache.spark.sql.expressions.javalang  
org.apache.spark.sql.expressions.scalalang  
org.apache.spark.sql.hive  
org.apache.spark.sql.hive.execution  
org.apache.spark.sql.hive.orc  
org.apache.spark.sql.internal
All classes in this package are considered an internal API to Spark and are subject to change between minor releases.
org.apache.spark.sql.jdbc  
org.apache.spark.sql.sources  
org.apache.spark.sql.types  
org.apache.spark.sql.util  
org.apache.spark.status.api.v1  
org.apache.spark.storage  
org.apache.spark.storage.memory  
org.apache.spark.streaming  
org.apache.spark.streaming.api.java
Java APIs for spark streaming.
org.apache.spark.streaming.dstream
Various implementations of DStreams.
org.apache.spark.streaming.flume
Spark streaming receiver for Flume.
org.apache.spark.streaming.kafka
Kafka receiver for spark streaming.
org.apache.spark.streaming.kinesis  
org.apache.spark.streaming.receiver  
org.apache.spark.streaming.scheduler  
org.apache.spark.streaming.ui  
org.apache.spark.streaming.util  
org.apache.spark.ui  
org.apache.spark.ui.env  
org.apache.spark.ui.exec  
org.apache.spark.ui.jobs  
org.apache.spark.ui.storage  
org.apache.spark.util
Spark utilities.
org.apache.spark.util.io  
org.apache.spark.util.random
Utilities for random number generation.
org.apache.spark.util.sketch