corr
corr.Rd
Computes the Pearson Correlation Coefficient for two Columns.
Usage
corr(x, ...)
# S4 method for Column
corr(x, col2)
# S4 method for SparkDataFrame
corr(x, colName1, colName2, method = "pearson")
Arguments
- x
a Column or a SparkDataFrame.
- ...
additional argument(s). If
x
is a Column, a Column should be provided. Ifx
is a SparkDataFrame, two column names should be provided.- col2
a (second) Column.
- colName1
the name of the first column
- colName2
the name of the second column
- method
Optional. A character specifying the method for calculating the correlation. only "pearson" is allowed now.
Examples
if (FALSE) {
df <- createDataFrame(cbind(model = rownames(mtcars), mtcars))
head(select(df, corr(df$mpg, df$hp)))}
if (FALSE) {
corr(df, "mpg", "hp")
corr(df, "mpg", "hp", method = "pearson")}