Add fixed number of rows for each group with values based on another column

Question

I have a large dataframe containing IDs and a start date of intervention for each ID:

And I would like to add 2 rows to each ID with subsequent dates as the values in those rows:

Is there any way using dplyr if possible? Other ways are also fine!

Just do df1[rep(seq_len(nrow(df1)), each = 3),] or using tidyverse df1 %>% uncount(3) — akrun
The dates for the rows need to be increasing, not duplicates! — TYL
ok, in that case df1 %>% uncount(3) %>% group_by(ID) %>% mutate(Date = seq(Date[1], length.out = n(), by = 1)) — akrun
It is a bit perplexing that when a dupe is tagged by me, it got reopened. It was only a simple thing to do — akrun

akrun akrun · Accepted Answer · 2019-07-08T03:50:07

We expand the data by uncounting, then grouped by 'ID', get the sequence from the first 'Date' to the number of rows (n()) while incrementing by 1

library(tidyverse)
df1 %>%
  uncount(3) %>% 
  group_by(ID) %>% 
  mutate(Date = seq(Date[1], length.out = n(), by = 1))
# A tibble: 9 x 2
# Groups:   ID [3]
#     ID  Date
#  <int> <dbl>
#1     1 17228
#2     1 17229
#3     1 17230
#4     2 17226
#5     2 17227
#6     2 17228
#7     3 17230
#8     3 17231
#9     3 17232

Or another option is unnest a list column

df1 %>%
   group_by(ID) %>% 
   mutate(Date = list(Date[1] + 0:2)) %>% 
   unnest

Or with complete

df1 %>%
   group_by(ID) %>%
   complete(Date = first(Date) + 0:2)

Or using base R (pasteing from the comments)

within(df1[rep(seq_len(nrow(df1)), each = 3),], Date <- Date + 0:2)

Or more compactly in data.table

library(data.table)
setDT(df1)[, .(Date = Date  + 0:2), ID]

Add fixed number of rows for each group with values based on another column

3 Answers