Selecting Rows in a Column Contingent on Two Variables in R

Question

I am working with a data set that contains multiple observations for each prescription a patient is taking, with many different patients. Patients typically take one of several drugs, which are indicated as their own binary variables, Drug1, Drug2 and so on.

I am attempting to pull out only the individuals that have switched from one drug to the other, i.e, have a 1 in Drug1 column and Drug2, but these occur in different rows.

I have attempted to use newdata <- mydata[which(Drug1 == 1 & Drug2 == 1),] however, this assumes that the 1's are in the same row, which they are not.

Is there a way to select the patients that have received both drugs, but the indicator variables are in different rows?

Thank you

Brandon LeBeau Brandon LeBeau · Accepted Answer · 2017-07-25T17:07:28

I believe this is a solution to what you are asking using dplyr.

data <- data.frame(id = rep(c(1, 2, 3, 4), each = 2),
               drug1 = c(1, 0, 0, 0, 0, 1, 1, 1),
               drug2 = c(0, 1, 1, 1, 1, 0, 0, 0)
               )
library(dplyr)
data %>%
  group_by(id) %>%
  mutate(both_drugs = ifelse(any(drug1 == 1)  & any(drug2 == 1), 1, 0)) %>%
  filter(both_drugs == 1)

Selecting Rows in a Column Contingent on Two Variables in R

2 Answers