pyspark.SparkContext.setCheckpointDir

SparkContext.setCheckpointDir(dirName)[source]

Set the directory under which RDDs are going to be checkpointed. The directory must be an HDFS path if running on a cluster.