How can I split my data frame into rows of 10 using tidyverse?

Question

I have a data frame with daily values. A sample of the data looks something like this:

data<-data.frame(day=c(1:20), score=c(8,15,8,20,40,1,6,42,81,18,55,35,37,85,66,12,32,42,22,64), value=c(1,0,0,0,0,0,0,1,0,0,0,0,1,0,0,0,0,0,0,0))

The real data set comprises ~2000 rows.

I would like to be able to split the data frame into tibbles. Each tibble will consist of 10 rows. The first row of each tibble will be whenever value = 1.

Some rows will therefore be represented in more than one tibble.

Is it possible to do this using tidyverse packages?

Thanks in advance.

r2evans r2evans · Accepted Answer · 2020-07-13T16:43:39

Programmatically, "split into rows of 10" and "first row of each tibble ... value = 1" are two different things. I'll go with the second:

split(data, cumsum(data$value == 1))
# $`1`
#   day score value
# 1   1     8     1
# 2   2    15     0
# 3   3     8     0
# 4   4    20     0
# 5   5    40     0
# 6   6     1     0
# 7   7     6     0
# $`2`
#    day score value
# 8    8    42     1
# 9    9    81     0
# 10  10    18     0
# 11  11    55     0
# 12  12    35     0
# $`3`
#    day score value
# 13  13    37     1
# 14  14    85     0
# 15  15    66     0
# 16  16    12     0
# 17  17    32     0
# 18  18    42     0
# 19  19    22     0
# 20  20    64     0

Cuing off of Allan's alternative interpretation, similarly:

lapply(which(data$value == 1), function(i) data[i:min(nrow(data), i+9),])
# [[1]]
#    day score value
# 1    1     8     1
# 2    2    15     0
# 3    3     8     0
# 4    4    20     0
# 5    5    40     0
# 6    6     1     0
# 7    7     6     0
# 8    8    42     1
# 9    9    81     0
# 10  10    18     0
# [[2]]
#    day score value
# 8    8    42     1
# 9    9    81     0
# 10  10    18     0
# 11  11    55     0
# 12  12    35     0
# 13  13    37     1
# 14  14    85     0
# 15  15    66     0
# 16  16    12     0
# 17  17    32     0
# [[3]]
#    day score value
# 13  13    37     1
# 14  14    85     0
# 15  15    66     0
# 16  16    12     0
# 17  17    32     0
# 18  18    42     0
# 19  19    22     0
# 20  20    64     0

How can I split my data frame into rows of 10 using tidyverse?

4 Answers