I have a very simple question about referencing data columns within a nested dataframe.
For a reproducible example, I'll nest mtcars
by the two values of variable am
:
library(tidyverse)
mtcars_nested <- mtcars %>%
group_by(am) %>%
nest()
mtcars_nested
which gives data that looks like this.
#> # A tibble: 2 x 2
#> # Groups: am [2]
#> am data
#> <dbl> <list>
#> 1 1 <tibble [13 × 10]>
#> 2 0 <tibble [19 × 10]>
If I now wanted to use purrr::map
to take the mean of mpg
for each level of am
I wonder why this doesn't work:
take_mean_mpg <- function(df){
mean(df[["data"]]$mpg)
}
map(mtcars_nested, take_mean_mpg)
Error in df[["data"]] : subscript out of bounds
Or maybe a simpler question is: How should I properly reference the mpg
column, once it's nested. I know that this doesn't work:
mtcars_nested[["data"]]$mpg