how can I calculate a conditional sum with True/False information?

Question

I'm having trouble figuring out how code for the divisions of numerical values: (col1)/(col2), based on the True/False values in columns 3 & 4

I have 500 rows of data and I'm trying to calculate the mean yield of a crop (kg crop/hectares) based on different conditions. I'm trying to answer a question like "what would the mean yield be if the condition in column 3 was True and column 4 was False?"

EDIT: here is example data.

col 1   col2   col 3   col4
1.5     2.0     T       T
1.5     2.0     F       T
2.5     5.0     F       F
2.5     5.0     F       T

so I'm trying to find the mean of col1/col2 if, for example, col3 = F and col4 = T

thank you!

It's easier to help you if you include a simple reproducible example with sample input and desired output that can be used to test and verify possible solutions. For your example, just include a few rows rather than all 500. — MrFlick
As a general note: If one of the answers has helped you with your problem, you should accept it as correct. This helps the community know that questions have been answered. — Dylan_Gomes

iod iod · Accepted Answer · 2019-11-20T18:33:38

You need to subset your data based on the two conditions. You can do that using [col3 & !col4], like so:

mean(with(data,col1[col3 & !col4]/col2[col3 & !col4]))

(with is just an easier way to not have to keep writing data$ every time).

For example, here's some fake data:

data<-data.frame(col1=1:5,col2=10:6,col3=c(TRUE,TRUE,TRUE,FALSE,FALSE),col4=c(FALSE,TRUE,FALSE,FALSE,TRUE))

and here's what you get from my solution:

mean(with(data,col1[col3 & !col4]/col2[col3 & !col4]))
[1] 0.2375

how can I calculate a conditional sum with True/False information?

3 Answers