
  • package root
    Definition Classes
  • package org
    Definition Classes
  • package apache
    Definition Classes
  • package spark

    Core Spark functionality.

    Core Spark functionality. org.apache.spark.SparkContext serves as the main entry point to Spark, while org.apache.spark.rdd.RDD is the data type representing a distributed collection, and provides most parallel operations.

    In addition, org.apache.spark.rdd.PairRDDFunctions contains operations available only on RDDs of key-value pairs, such as groupByKey and join; org.apache.spark.rdd.DoubleRDDFunctions contains operations available only on RDDs of Doubles; and org.apache.spark.rdd.SequenceFileRDDFunctions contains operations available on RDDs that can be saved as SequenceFiles. These operations are automatically available on any RDD of the right type (e.g. RDD[(Int, Int)] through implicit conversions.

    Java programmers should reference the package for Spark programming APIs in Java.

    Classes and methods marked with Experimental are user-facing features which have not been officially adopted by the Spark project. These are subject to change or removal in minor releases.

    Classes and methods marked with Developer API are intended for advanced users want to extend Spark through lower level interfaces. These are subject to changes or removal in minor releases.

    Definition Classes
  • package sql

    Allows the execution of relational queries, including those expressed in SQL using Spark.

    Allows the execution of relational queries, including those expressed in SQL using Spark.

    Definition Classes
  • object SparkSession extends Logging with Serializable
    Definition Classes
  • Builder
  • LogStringContext

class Builder extends Logging

Builder for SparkSession.

Linear Supertypes
Logging, AnyRef, Any
  1. Alphabetic
  2. By Inheritance
  1. Builder
  2. Logging
  3. AnyRef
  4. Any
  1. Hide All
  2. Show All
  1. Public
  2. Protected

Instance Constructors

  1. new Builder()

Type Members

  1. implicit class LogStringContext extends AnyRef
    Definition Classes

Value Members

  1. def appName(name: String): Builder

    Sets a name for the application, which will be shown in the Spark web UI.

    Sets a name for the application, which will be shown in the Spark web UI. If no application name is set, a randomly generated name will be used.



  2. def config(conf: SparkConf): Builder

    Sets a list of config options based on the given SparkConf.

    Sets a list of config options based on the given SparkConf.



  3. def config(map: Map[String, Any]): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  4. def config(map: Map[String, Any]): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  5. def config(key: String, value: Boolean): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  6. def config(key: String, value: Double): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  7. def config(key: String, value: Long): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  8. def config(key: String, value: String): Builder

    Sets a config option.

    Sets a config option. Options set using this method are automatically propagated to both SparkConf and SparkSession's own configuration.



  9. def enableHiveSupport(): Builder

    Enables Hive support, including connectivity to a persistent Hive metastore, support for Hive serdes, and Hive user-defined functions.

    Enables Hive support, including connectivity to a persistent Hive metastore, support for Hive serdes, and Hive user-defined functions.



  10. def getOrCreate(): SparkSession

    Gets an existing SparkSession or, if there is no existing one, creates a new one based on the options set in this builder.

    Gets an existing SparkSession or, if there is no existing one, creates a new one based on the options set in this builder.

    This method first checks whether there is a valid thread-local SparkSession, and if yes, return that one. It then checks whether there is a valid global default SparkSession, and if yes, return that one. If no valid global default SparkSession exists, the method creates a new SparkSession and assigns the newly created SparkSession as the global default.

    In case an existing SparkSession is returned, the non-static config options specified in this builder will be applied to the existing SparkSession.



  11. def master(master: String): Builder

    Sets the Spark master URL to connect to, such as "local" to run locally, "local[4]" to run locally with 4 cores, or "spark://master:7077" to run on a Spark standalone cluster.

    Sets the Spark master URL to connect to, such as "local" to run locally, "local[4]" to run locally with 4 cores, or "spark://master:7077" to run on a Spark standalone cluster.



  12. def withExtensions(f: (SparkSessionExtensions) => Unit): Builder

    Inject extensions into the SparkSession.

    Inject extensions into the SparkSession. This allows a user to add Analyzer rules, Optimizer rules, Planning Strategies or a customized parser.

