I am currently trying to preprocess a very large dataset with a lot of categorical features for Scikit-Learns' RandomForest Model (Regression). The nature of the categorical data requires to not have any ordinality added through encoding schemes.
The H2o ML-Framework (Link) offers of enum-encoding which would suite perfectly for my data. However I rely on Scikit-Learns RandomForest.
Is anyone aware of some enum-encoding for Scikit-Learn Models? (One-Hot-Encoding is not an option)
Thanks in Advance!