Join two data frames based on one column of a frame and two columns of another

Question

So I have two data frames, info and towers, with examples in the following:

Info:

ID             Date
1132           01/09/2015
1156           02/09/2015
1132           04/09/2015
1101           04/09/2015

Towers:

Tower   ID1   ID2
    1   1132  1101
    2   1520  1156

The values in the ID column of Info will always match either ID1 or ID2 in Towers. I want to join the frames based on that information, so my joined frame should be:

ID             Date         Tower
1132           01/09/2015       1
1156           02/09/2015       2
1132           04/09/2015       1
1101           04/09/2015       2

I know dplyr's semi_join makes something like what I need, but I understand it requires a match in both value and column name. Given that these columns have different names, I don't know if it will work properly. Is there a method I could use here?

You should probably look at as.Date and learn to format those properly. Also, please make your example reproducible next time so it can be copy-pasted in by others. — Frank
@Frank Yes I already worked in the format. For learning purposes, what do you mean exactly as a reproducible example? — Rono
I'm referring to the extra stuff in Sumedh's answer below, that looks like structure(...) If you copy-paste that into your R session, it will return your example data.frame. This sort of thing should be included with a question. For info on how to go about this, check out stackoverflow.com/questions/5963269/… — Frank

Sumedh Sumedh · Accepted Answer · 2016-08-02T04:33:12

library(dplyr)

tidyr::gather(df2, Tower2, ID, -Tower) %>% select(-Tower2) %>% right_join(df, "ID")

df

structure(list(ID = c(1132, 1156, 1132, 1101), Date = structure(c(1L, 
2L, 3L, 3L), .Label = c("01/09/2015", "02/09/2015", "04/09/2015"
), class = "factor")), .Names = c("ID", "Date"), row.names = c(NA, 
-4L), class = "data.frame")

df2

structure(list(Tower = 1:2, ID1 = c(1132L, 1520L), ID2 = c(1101L, 
1156L)), .Names = c("Tower", "ID1", "ID2"), class = "data.frame", row.names = c(NA, 
-2L))

Join two data frames based on one column of a frame and two columns of another

4 Answers