I've created a logistic model using glm with ~10 predictors and a binary response variable. The model was created using a subset of my full dataset (~8000 observation) by randomly selecting 3000 observations, putting these in a new dataset (newdata) and fitting the glm to newdata.
In order to assess the model, I would like to see how well the model describes the data in a different dataset (testdata) which has a random selection of e.g. ~1000 observations from the full dataset. How would I go about doing this in R?
I have created both confidence intervals for coefficients and looked at Wald-statistics and LRT for assessing statistical significance of my model, but would like to be able to see how well it describes a randomly chosen selection of the full dataset.
Thanks a bunch!