pyspark.sql.DataFrame.colRegex

DataFrame.colRegex(colName)[source]

Selects column based on the column name specified as a regex and returns it as Column.

New in version 2.3.0.

Parameters
colNamestr

string, column name specified as a regex.

Examples

>>> df = spark.createDataFrame([("a", 1), ("b", 2), ("c",  3)], ["Col1", "Col2"])
>>> df.select(df.colRegex("`(Col1)?+.+`")).show()
+----+
|Col2|
+----+
|   1|
|   2|
|   3|
+----+