My data has a lot of categorical features. I encode them using Dict_vectorizer.
For example df['color']=['green','blue','white']
df['size']=['small','big','medium'] .
I use RandomForest algorithm. When I check the values of feature_importances I get different values for each category. green = 2.45*10^-2 blue =6.2 *10^-3 and so on.
Shouldn't all encoded category values have same value of feature_importances. Like all categories of color have the same importance and all values of size have the same importance? Is there a way by which I can explicitly define feature_importances? Note: I understand