How to create covariance matrix in R?

Question

I'm trying to build covariance matrix from a scratch (cov() function). My task is not to use any package. Hence I created my functions:

meanf <- function(x){
sum(x) / length(x)
}

sampleCov <- function(x,y){
  stopifnot(identical(length(x), length(y)))
  sum((x - meanf(x)) * (y - meanf(y))) / (length(x) - 1)
}

> sampleCov(winequality_red$quality, winequality_red$alcohol)
[1] 0.409789

Unfortunately, I'm stuck here. All loops I tried to apply are missing any point. Of course it's possible to just copy the sampleCov function and make it for every possible combination but that's not my point.

It's not clear to me what your question/problem actually is. — Dason
Firstly I'd like to apply a function on one certain column and calculate it with the remaining ones: (x, y1), (x, y2), (x, y3), (x, yn+1). — Tom
function, length and sum are all in the package "Base" so what are you actually allowed to use? — rg255

PKumar PKumar · Accepted Answer · 2018-05-19T17:27:08

If I understand you correctly then I believe you want to recreate a covariate output like the one returned by cov function.

OPs given function:

meanf <- function(x){
    sum(x) / length(x)
}

sampleCov <- function(x,y){
    stopifnot(identical(length(x), length(y)))
    sum((x - meanf(x)) * (y - meanf(y))) / (length(x) - 1)
}

You can try this way, I have taken mtcars data here:

Covariate Function:

vars <- names(mtcars)
egrid <- expand.grid(vars, vars)
egrid <- data.frame(sapply(egrid, as.character),stringsAsFactors = F)
egrid <- egrid[order(egrid$Var1, egrid$Var2),]
mat <- vector("list", nrow(egrid))

for(i in 1:nrow(egrid)){
    mat[[i]] <- sampleCov(mtcars[,egrid[i,"Var1"]], mtcars[,egrid[i,"Var2"]])
}

finaldat <- cbind(egrid, cov = do.call('rbind', mat))
finaldat_list <- split(finaldat,  finaldat$Var1)
mat_form <- do.call('cbind', finaldat_list)

cov_values <- mat_form[,grepl("\\.cov",names(mat_form))]
col_values <- mat_form[,paste0(egrid$Var1[1],".Var2")]

final_matrix_cov <- cbind(col_values, cov_values)

Sample Output:

> final_matrix_cov
    col_values       am.cov    carb.cov     cyl.cov    disp.cov
9          mpg   1.80393145 -5.36310484  -9.1723790  -633.09721
20         cyl  -0.46572581  1.52016129   3.1895161   199.66028
31        disp -36.56401210 79.06875000 199.6602823 15360.79983
42          hp  -8.32056452 83.03629032 101.9314516  6721.15867

How to create covariance matrix in R?

3 Answers