I have a data file ( 1 million rows) that has one outcome variable as Status ( Yes / no ) with three continuous variables and 5 nominal variables ( 5 categories in each variable ) I want to predict the outcome i.e status. I wanted to know which type of analysis is good for building up the model. I have seen logit, probit, logistic regression. I am confused on what to start and analyse the variables that are more likely useful for analysis.
data file: gender,region,age,company,speciality,jobrole,diag,labs,orders,status
M,west,41,PA,FPC, Assistant,code18,27,3,yes
M,Southwest,65,CV,FPC,Worker,code18,69,11,no
M,South,27,DV,IMC,Assistant,invalid,62,13,no
M,Southwest,18,CV,IMC,Worker,code8,6,1,yes
PS: Using R language. Any help would be greatly appreciated Thanks !