Just find how to create a split violin plot from here and then how to extend this for more than two groups from here. But I do not understand how to tranfer this to my own data.
Now, I am stuck in creating my own split violin plot.
I am aiming for something like this; each group should represent one cancer entity (left the reference incidence, right the cohort's incidence).
Following the first link (see above), I used this code to create the function geom_split_violin
GeomSplitViolin <- ggproto("GeomSplitViolin", GeomViolin, draw_group = function(self, data, ..., draw_quantiles = NULL){
data <- transform(data, xminv = x - violinwidth * (x - xmin), xmaxv = x + violinwidth * (xmax - x))
grp <- data[1,'group']
newdata <- plyr::arrange(transform(data, x = if(grp%%2==1) xminv else xmaxv), if(grp%%2==1) y else -y)
newdata <- rbind(newdata[1, ], newdata, newdata[nrow(newdata), ], newdata[1, ])
newdata[c(1,nrow(newdata)-1,nrow(newdata)), 'x'] <- round(newdata[1, 'x'])
if (length(draw_quantiles) > 0 & !scales::zero_range(range(data$y))) {
stopifnot(all(draw_quantiles >= 0), all(draw_quantiles <=
1))
quantiles <- create_quantile_segment_frame(data, draw_quantiles)
aesthetics <- data[rep(1, nrow(quantiles)), setdiff(names(data), c("x", "y")), drop = FALSE]
aesthetics$alpha <- rep(1, nrow(quantiles))
both <- cbind(quantiles, aesthetics)
quantile_grob <- GeomPath$draw_panel(both, ...)
ggplot2:::ggname("geom_split_violin", grobTree(GeomPolygon$draw_panel(newdata, ...), quantile_grob))
}
else {
ggplot2:::ggname("geom_split_violin", GeomPolygon$draw_panel(newdata, ...))
}
})
geom_split_violin <- function (mapping = NULL, data = NULL, stat = "ydensity", position = "identity", ..., draw_quantiles = NULL, trim = TRUE, scale = "area", na.rm = FALSE, show.legend = NA, inherit.aes = TRUE) {
layer(data = data, mapping = mapping, stat = stat, geom = GeomSplitViolin, position = position, show.legend = show.legend, inherit.aes = inherit.aes, params = list(trim = trim, scale = scale, draw_quantiles = draw_quantiles, na.rm = na.rm, ...))
}
I do not understand the full code above and guess, that this code will create a split violin plot with only two groups.
Hopefully, my request is not to general and you guys can show my a way how to reach my plot.
All I reached is
my_data <- read.csv("Test2.csv", sep = ";")
y <- data_vp$Age.groups
x <- cbind(data_vp[,3:8])
m <- cbind(data_vp[,3:5], data_vp[,6:8])
ggplot(my_data,
aes(x = x,
y = y,
fill = m)) +
geom_split_violin()
After getting your answer @missuse I tried again with this code:
raw_df <- read.csv("Test2.csv", sep = ";")
View(raw_df)
inc_all <- raw_df[,2:7]
inc_oM <- raw_df[,2:4]
inc_mM <- raw_df[,5:7]
dff <- data.frame(y = raw_df$Age.groups,
groups = as.factor(inc_all),
split = as.factor(inc_oM + inc_mM))
ggplot(dff,
aes(x = groups,
y = y,
fill = split)) +
geom_split_violin()
Does not work :(