I am using a Random Forest Classifier and I want to perform k-fold cross validation. My dataset is already split in 10 different subsets, so I'd like to use them to do k-fold cross validation, without using automatic functions that randomly split the dataset. Is it possible in Python?
Random Forest doesn't have the partial_fit() method, so I can't do an incremental fit.
partial_fit()or other splitting functions have to do here? Do you have any specific difficulty in running aforloop, and in each iteration fitting to the (concatenated) 9 subsets while testing in the remaining one? If yes, please post what you have tried so far and the specific issues encountered. Otherwise, as @BartoszKP has already noticed, the answer to your question is simply "yes, it is possible" (and it has nothing to do with Random Forest in particular, or any other specific algorithm whatsoever). - desertnaut