pyspark.sql.functions.array_repeat

pyspark.sql.functions.array_repeat(col: ColumnOrName, count: Union[ColumnOrName, int]) → pyspark.sql.column.Column[source]

Collection function: creates an array containing a column repeated count times.

New in version 2.4.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
colColumn or str

column name or column that contains the element to be repeated

countColumn or str or int

column name, column, or int containing the number of times to repeat the first argument

Returns
Column

an array of repeated elements.

Examples

>>> df = spark.createDataFrame([('ab',)], ['data'])
>>> df.select(array_repeat(df.data, 3).alias('r')).collect()
[Row(r=['ab', 'ab', 'ab'])]