(Scala-specific) Returns a new DataFrame that drops rows containing less than
minNonNulls
non-null values in the specified columns.
Returns a new DataFrame that drops rows containing less than minNonNulls
non-null
values in the specified columns.
Returns a new DataFrame that drops rows containing less than minNonNulls
non-null values.
(Scala-specific) Returns a new DataFrame that drops rows containing null values in the specified columns.
(Scala-specific) Returns a new DataFrame that drops rows containing null values in the specified columns.
If how
is "any", then drop rows containing any null values in the specified columns.
If how
is "all", then drop rows only if every specified column is null for that row.
Returns a new DataFrame that drops rows containing null values in the specified columns.
Returns a new DataFrame that drops rows containing null values in the specified columns.
If how
is "any", then drop rows containing any null values in the specified columns.
If how
is "all", then drop rows only if every specified column is null for that row.
(Scala-specific) Returns a new that drops rows containing any null values in the specified columns.
Returns a new DataFrame that drops rows containing any null values in the specified columns.
Returns a new DataFrame that drops rows containing null values.
Returns a new DataFrame that drops rows containing null values.
If how
is "any", then drop rows containing any null values.
If how
is "all", then drop rows only if every column is null for that row.
Returns a new DataFrame that drops rows containing any null values.
(Scala-specific) Returns a new DataFrame that replaces null values.
(Scala-specific) Returns a new DataFrame that replaces null values.
The key of the map is the column name, and the value of the map is the replacement value.
The value must be of the following type: Int
, Long
, Float
, Double
, String
.
For example, the following replaces null values in column "A" with string "unknown", and null values in column "B" with numeric value 1.0.
df.na.fill(Map( "A" -> "unknown", "B" -> 1.0 ))
Returns a new DataFrame that replaces null values.
Returns a new DataFrame that replaces null values.
The key of the map is the column name, and the value of the map is the replacement value.
The value must be of the following type: Integer
, Long
, Float
, Double
, String
.
For example, the following replaces null values in column "A" with string "unknown", and null values in column "B" with numeric value 1.0.
import com.google.common.collect.ImmutableMap; df.na.fill(ImmutableMap.of("A", "unknown", "B", 1.0));
(Scala-specific) Returns a new DataFrame that replaces null values in specified string columns.
(Scala-specific) Returns a new DataFrame that replaces null values in specified string columns. If a specified column is not a string column, it is ignored.
Returns a new DataFrame that replaces null values in specified string columns.
Returns a new DataFrame that replaces null values in specified string columns. If a specified column is not a string column, it is ignored.
(Scala-specific) Returns a new DataFrame that replaces null values in specified numeric columns.
(Scala-specific) Returns a new DataFrame that replaces null values in specified numeric columns. If a specified column is not a numeric column, it is ignored.
Returns a new DataFrame that replaces null values in specified numeric columns.
Returns a new DataFrame that replaces null values in specified numeric columns. If a specified column is not a numeric column, it is ignored.
Returns a new that replaces null values in string columns with value
.
Returns a new DataFrame that replaces null values in numeric columns with value
.
:: Experimental :: Functionality for working with missing data in DataFrames.