Taking a simple dataframe from the R built-in dataset airquality and checking their missing values:
airquality %>% summary
While this works:
airquality %>% map_df(is.na) %>% map_df(sum)
Ozone Solar.R Wind Temp Month Day
<int> <int> <int> <int> <int> <int>
1 37 7 0 0 0 0
, and this - in purrr syntax - works too:
airquality %>% map_df(~sum(is.na(.)))
Ozone Solar.R Wind Temp Month Day
<int> <int> <int> <int> <int> <int>
1 37 7 0 0 0 0
, this doesn't work:
airquality %>% map_df(sum(is.na(.)))
Ozone Solar.R Wind Temp Month Day
<int> <int> <dbl> <int> <int> <int>
1 23 148 8 82 6 13
My question is: How can you explain the last result?
Where exactly does the calculation happen - in dplyr or purrr?