I am currently using H2O's AutoML for a data science project. However, nowhere in the documentation or on the internet or in the code I can find how AutoML treats factor variables - does it do one-hot encoding? Label encoding? Something more advanced? Does it consider how many levels there are? Does it depend on the algorithm?
Currently, AutoML performs really badly (marginally above the baseline), and I suspect it's because it doesn't treat categoricals right, which make up about 90% of my predictors.