With a data frame, I'm using dplyr to aggregate some column like below.
> data <- data.frame(a=rep(1:2,3), b=c(6:11))
> data
a b
1 1 6
2 2 7
3 1 8
4 2 9
5 1 10
6 2 11
> data %>% group_by(a) %>% summarize(tot=sum(b))
# A tibble: 2 x 2
a tot
<int> <int>
1 1 24
2 2 27
This is perfect. However I want to create a re-usable function for this such that a column name can be passed as argument.
Looking at answers to related questions like here, I tried the following.
sumByColumn <- function(df, colName) {
df %>%
group_by(a) %>%
summarize(tot=sum(colName))
df
}
However I'm not able to get it working.
> sumByColumn(data, "b")
Error in summarise_impl(.data, dots) :
Evaluation error: invalid 'type' (character) of argument.
> sumByColumn(data, b)
Error in summarise_impl(.data, dots) :
Evaluation error: object 'b' not found.
>
dplyr
– bouncyball