Useful links - Machine Learning Encoding
-
one-hot-encoding of features
- https://www.mygreatlearning.com/blog/label-encoding-in-python/
- https://www.analyticsvidhya.com/blog/2020/03/one-hot-encoding-vs-label-encoding-using-scikit-learn/
- https://kiwidamien.github.io/are-you-getting-burned-by-one-hot-encoding.html
- https://machinelearningmastery.com/one-hot-encoding-for-categorical-data/
-
encoding of target/label
- https://scikit-learn.org/stable/modules/preprocessing_targets.html#preprocessing-targets
- can also take column of strings - see multiilabel classification
-
Why one hot-encoding can decrease performance of decision trees and random forests - curse of dimensionality
-
why decision trees don't require one hot encoding