pyspark.sql.SparkSession.sql

SparkSession.sql(sqlQuery)[source]

Returns a DataFrame representing the result of the given query.

New in version 2.0.0.

Returns
DataFrame

Examples

>>> df.createOrReplaceTempView("table1")
>>> df2 = spark.sql("SELECT field1 AS f1, field2 as f2 from table1")
>>> df2.collect()
[Row(f1=1, f2='row1'), Row(f1=2, f2='row2'), Row(f1=3, f2='row3')]