pyspark.pandas.Series.unstack

Series.unstack(level: int = - 1) → pyspark.pandas.frame.DataFrame[source]

Unstack, a.k.a. pivot, Series with MultiIndex to produce DataFrame. The level involved will automatically get sorted.

Parameters
levelint, str, or list of these, default last level

Level(s) to unstack, can pass level name.

Returns
DataFrame

Unstacked Series.

Notes

Unlike pandas, pandas-on-Spark doesn’t check whether an index is duplicated or not because the checking of duplicated index requires scanning whole data which can be quite expensive.

Examples

>>> s = ps.Series([1, 2, 3, 4],
...               index=pd.MultiIndex.from_product([['one', 'two'],
...                                                 ['a', 'b']]))
>>> s
one  a    1
     b    2
two  a    3
     b    4
dtype: int64
>>> s.unstack(level=-1).sort_index()
     a  b
one  1  2
two  3  4
>>> s.unstack(level=0).sort_index()
   one  two
a    1    3
b    2    4