3
votes

I am trying to create a new variable (var5) that takes the value from one of the other numeric variables in the dataframe (var1, var2 or var3), based on the level of a factor variable (var4)

variables=data.frame(var1=c(2,3,4,5), var2=c(20,30,40,50), var3=c(200,300,400,500), var4=c('var1','var2','var3','var1'))

variables

  var1 var2 var3 var4
1    2   20  200 var1
2    3   30  300 var2
3    4   40  400 var3
4    5   50  500 var1 

The expected result would be this:

  var1 var2 var3 var4 var5
1    2   20  200 var1    2
2    3   30  300 var2   30
3    4   40  400 var3  400
4    5   50  500 var1    5

I could not find a way to work it out.

4

4 Answers

1
votes

I'm sure someone can come up with something fancier, but dplyr::case_when() will get the job done:

library(dplyr)

data.frame(
  var1 = c(2, 3, 4, 5),
  var2 = c(20, 30, 40, 50),
  var3 = c(200, 300, 400, 500),
  var4 = c('var1', 'var2', 'var3', 'var1')) %>%
  mutate(var5 = case_when(var4 == "var1" ~ var1,
                          var4 == "var2" ~ var2,
                          var4 == "var3" ~ var3))
3
votes
for(i in 1:nrow(df)) {
  df$var5[i] <- df[which(colnames(df)==df$var4[i]), i]
}

with sapply (thank you andrew :-) )

df$var5 <- unlist(sapply(seq_along(df), function(x) df[x, match(df$var4[x], names(df))]))

This works dynamically. (case_when can be a pain if you have 100 different columns).

1
votes

Using data.table:

library(data.table)
setDT(variables)[, var5 := ifelse(var4 == "var1", var1, ifelse(var4 == "var2", var2, var3))]



    var1 var2 var3 var4 var5
1:    2   20  200 var1    2
2:    3   30  300 var2   30
3:    4   40  400 var3  400
4:    5   50  500 var1    5
1
votes
variables=data.frame(var1=c(2,3,4,5), var2=c(20,30,40,50), 
                     var3=c(200,300,400,500), var4=c('var1','var2','var3','var1'))

library(tidyverse)

variables %>%
  mutate(var5 = pmap_chr(
    .l = .,
    .f = function(...){
      row <-  c(...)
      cols <- row["var4"] %>% `[[`(1)
      vals <- row[cols] %>% str_c()
      return(vals)
    }
  ))

#>   var1 var2 var3 var4 var5
#> 1    2   20  200 var1    2
#> 2    3   30  300 var2   30
#> 3    4   40  400 var3  400
#> 4    5   50  500 var1    5

Created on 2019-06-20 by the reprex package (v0.3.0) in reference to mutate specific column by evaluating lookup cell value