I am very new to machine learning. Sorry if there are any mistakes in my English.
I am using the weka J48 Classification for prediction in true or false. I have almost 999K training set which i used to train the model. I used the cross validation method with 3 folds to train the Model which gives me accuracy of ~84%.
Now after storing the model. i tried to test it on 50k dataset. which is giving very bad results and 50% of them are mismatch. I have 11 attributes with nominal and numeric fields.
I dont know why its happening.
I have two questions.
- How can i train to perform better on test set.
- what could be possible issues.
I am using weka api in java.