Filtering and summing rows in dplyr

Question

I have a data that I want to first filter some rows and sum those remaining rows.

The filtering conditions as follows;

for gr==1 find the last occurrence of y_value==10 and keep the all rows before it (including the last occurrence of this value 10 row)!
for gr==2 find the first occurrence of y_value==10 and keep all the rows after it (including the first occurrence of this value 10 row)!

The data is like this;

df <- data.frame(gr=rep(c(1,2),c(8,7)), 
                 y_value=c(c(2,10,10,8,10,6,0,0),c(0,0,10,10,6,8,10)))



    gr y_value
1   1       2
2   1      10
3   1      10
4   1       8
5   1      10
6   1       6
7   1       0
8   1       0
9   2       0
10  2       0
11  2      10
12  2      10
13  2       6
14  2       8
15  2      10

I tried this in the light of summing-rows-based-on-conditional-in-groups;

df_temp <- df %>% 
  group_by(gr) %>% 
  mutate(rows_to_aggregate=cumsum(y_value==10)) %>% 
  filter(ifelse(gr==1, rows_to_aggregate !=0, ifelse(gr==2, rows_to_aggregate ==0 | y_value==10, rows_to_aggregate ==0))) %>% 
  filter(ifelse(gr==1, row_number(gr) != 1, ifelse(gr==2, row_number(gr) != n(), rows_to_aggregate ==0)))

but the if I do rows_to_aggregate !=0 in gr==1 the rows in the interest will be gone! Any guide at this point will be appreciated!

Paul Paul · Accepted Answer · 2017-11-07T23:47:20

df_to_aggregate <- df %>% 
    group_by(gr) %>% 
    mutate(rows_to_aggregate = cumsum(y_value == 10)) %>% 
    filter(!(gr == 1 & rows_to_aggregate == max(rows_to_aggregate) & y_value != 10)) %>%
    filter(!(gr == 2 & rows_to_aggregate == 0)) %>%
    select(-rows_to_aggregate)
df_to_aggregate

# A tibble: 10 x 2
# Groups:   gr [2]
     gr y_value
  <dbl>   <dbl>
1     1       2
2     1      10
3     1      10
4     1       8
5     1      10
6     2      10
7     2      10
8     2       6
9     2       8
10    2      10

Filtering and summing rows in dplyr

3 Answers