pyspark.sql.DataFrame.tail

DataFrame.tail(num)[source]

Returns the last num rows as a list of Row.

Running tail requires moving data into the application’s driver process, and doing so with a very large num can crash the driver process with OutOfMemoryError.

New in version 3.0.0.

Examples

>>> df.tail(1)
[Row(age=5, name='Bob')]