Skip to contents

Return a new SparkDataFrame with the specified columns added or replaced.

Usage

mutate(.data, ...)

transform(`_data`, ...)

# S4 method for SparkDataFrame
mutate(.data, ...)

# S4 method for SparkDataFrame
transform(`_data`, ...)

Arguments

.data

a SparkDataFrame.

...

additional column argument(s) each in the form name = col.

_data

a SparkDataFrame.

Value

A new SparkDataFrame with the new columns added or replaced.

Note

mutate since 1.4.0

transform since 1.5.0

Examples

if (FALSE) {
sparkR.session()
path <- "path/to/file.json"
df <- read.json(path)
newDF <- mutate(df, newCol = df$col1 * 5, newCol2 = df$col1 * 2)
names(newDF) # Will contain newCol, newCol2
newDF2 <- transform(df, newCol = df$col1 / 5, newCol2 = df$col1 * 2)

df <- createDataFrame(list(list("Andy", 30L), list("Justin", 19L)), c("name", "age"))
# Replace the "age" column
df1 <- mutate(df, age = df$age + 1L)
}