dataframe operations among two columns and adding result as a third column

Question

Given a data frame with columns:

"length1" integer as characters
"length2" each element is a string of numbers

I would like to get the percentage of the length2 column with respect to the length1 column. So something like df$length2 / df$lenght1 *100. Pls see the following minimal example:

> df=data.frame(length1=c("10","12","14"))
> df$length2=list("2,3,4","4,5,3","3,2,6")
> df

length1 length2
1      10   2,3,4
2      12   4,5,3
3      14   3,2,6

> dfresult=df
> dfresult$resultInPercent=list("20,30,40","33,41,25","21,14,42")
> dfresult

  length1 length2 resultInPercent
1      10   2,3,4        20,30,40
2      12   4,5,3        33,41,25
3      14   3,2,6        21,14,42

I cant get it to work, my approach was:

dfresult=apply(df, 1, function(x) 
{

  lapply(lapply(lapply(x$length2,strsplit,split=","),as.numeric),function(y)
     {
        round(as.numeric(y)/as.numeric(x$length1)*100)
     }

  )
 } 
)

Error in lapply(lapply(x$length2, strsplit, split = ","), as.numeric) : (list) object cannot be coerced to type 'double'

I gave up here and got the feeling what I do is way to complicated.

Steven Beaupré Steven Beaupré · Accepted Answer · 2016-07-29T12:03:37

Another idea:

library(dplyr)
library(tidyr)

df %>%
  separate_rows(length2) %>% 
  mutate_all(funs(as.numeric(as.character(.)))) %>%
  group_by(length1) %>%
  summarise(l2 = list(length2), 
            l3 = list(round(100 * length2 / length1)))

Which gives:

## A tibble: 3 x 3
#  length1        l2        l3
#    <dbl>    <list>    <list>
#1      10 <dbl [3]> <dbl [3]>
#2      12 <dbl [3]> <dbl [3]>
#3      14 <dbl [3]> <dbl [3]>

This store the results in lists making it easily accessible for further operations:

#Observations: 3
#Variables: 3
#$ length1 <dbl> 10, 12, 14
#$ l2      <list> [<2, 3, 4>, <4, 5, 3>, <3, 2, 6>]
#$ l3      <list> [<20, 30, 40>, <33, 42, 25>, <21, 14, 43>]

dataframe operations among two columns and adding result as a third column

3 Answers