pyspark.sql.DataFrame.cache

DataFrame.cache()[source]

Persists the DataFrame with the default storage level (MEMORY_AND_DISK).

New in version 1.3.0.

Notes

The default storage level has changed to MEMORY_AND_DISK to match Scala in 2.0.