Table generation R

Question

I have a similar dataset:

val<-c("Y","N")
test<-data.frame(age=rnorm(n=100,mean=50,sd=10),var1=sample(val,100,T),var2=sample(val,100,T),var3=sample(val,100,T),sex=sample(c("F","M"),100,T))

I´d like to create a summary reporting the mean age for each category using Hmisc.

library(Hmisc)
summary.formula(age~sex+var1+var2+var3,data=test)

However, var1-var3 actually belong under the same categorical variable with levels var1,var and var3 instead of Y/N. Furthermore, these are not mutually exclusive. So, is it possible somehow to create a variable var4 with these different levels that are not mutually exclusive and type

summary.formula(age~sex+var4,data=test)

and have an output like:

+-------+-+---+----+
|       | |N  |age |
+-------+-+---+----+
|sex    |F| 44|48.0|
|       |M| 56|50.8|
+-------+-+---+----+
|var4   |var1| xx|yy|
|       |var2| xx|yy|
        |var3| xx|yy|
+-------+-+---+----+
|Overall| |100|49.6|
+-------+-+---+----+

Any help would be much appreciated...

I don't understand what you want. It's not clear to me what Var4 would be or how the subsetting in the is supposed to work for Var4. — Dason

Justin Justin · Accepted Answer · 2012-05-04T21:13:53

How bout paste0? (or paste(..., sep='') if you're not on R2.15)

> test$var4 <- paste0(test$var1, test$var2, test$var3)
> summary.formula(age~sex+var4, data=test)
age    N=100

+-------+---+---+--------+
|       |   |  N|     age|
+-------+---+---+--------+
|    sex|  F| 50|50.25440|
|       |  M| 50|51.32134|
+-------+---+---+--------+
|   var4|NNN| 13|46.64417|
|       |NNY| 17|51.34456|
|       |NYN| 15|52.92185|
|       |NYY| 17|47.35685|
|       |YNN|  9|50.91647|
|       |YNY|  7|48.04489|
|       |YYN| 10|53.23713|
|       |YYY| 12|56.14394|
+-------+---+---+--------+
|Overall|   |100|50.78787|
+-------+---+---+--------+
>

Table generation R

2 Answers