pyspark.pandas.Series.unstack¶

Series.unstack(level: int = - 1) → pyspark.pandas.frame.DataFrame[source]¶

Unstack, a.k.a. pivot, Series with MultiIndex to produce DataFrame. The level involved will automatically get sorted.

Parameters

levelint, str, or list of these, default last level: Level(s) to unstack, can pass level name.

Returns

DataFrame: Unstacked Series.

Notes

Unlike pandas, pandas-on-Spark doesn’t check whether an index is duplicated or not because the checking of duplicated index requires scanning whole data which can be quite expensive.

Examples

>>> s = ps.Series([1, 2, 3, 4],
...               index=pd.MultiIndex.from_product([['one', 'two'],
...                                                 ['a', 'b']]))
>>> s
one  a    1
     b    2
two  a    3
     b    4
dtype: int64

>>> s.unstack(level=-1).sort_index()
     a  b
one  1  2
two  3  4

>>> s.unstack(level=0).sort_index()
   one  two
a    1    3
b    2    4

pyspark.pandas.Series.sort_values pyspark.pandas.Series.explode