Skip to content

Latest commit

 

History

History
38 lines (25 loc) · 1.34 KB

12-categorical-variables.md

File metadata and controls

38 lines (25 loc) · 1.34 KB

2.12 Categorical variables

Slides

Notes

Categorical variables are typically strings, and pandas identify them as object types. These variables need to be converted to a numerical form because the ML models can interpret only numerical features. It is possible to incorporate certain categories from a feature, not necessarily all of them. This transformation from categorical to numerical variables is known as One-Hot encoding.

The entire code of this project is available in this jupyter notebook.

⚠️ The notes are written by the community.
If you see an error here, please create a PR with a fix.

Comments

This way of encoding categorical features is called "one-hot encoding". We'll learn more about it in Session 3.

Navigation