Automated machine learning is the process of automating the process of applying machine learning to real-world problems.
It mainly focuses on two major aspects -
- Data acquisition/collection:- Data acquisition is the process for bringing data that has been created by a source outside the organization, into the organization, for production use.
- Prediction:- It is a result we get on an algorithm after it training it on a particular dataset and applied it on a new data
Firstly Data Preprocessing is very broad in its context because it is one of the most time-consuming tasks and it includes many subtasks such as Data Cleaning, Data Transformation, Feature Selection, etc. Secondly deciding which model is the best for your problem often requires more experience than knowledge and it is also a very time-consuming task.
Auto ML tends to automate the maximum number of steps in an ML pipeline - with a minimum amount of human effort and without compromising the model's performance.
- Applying machine learning models to our problems usually requires computer science skills, domain expertise, and mathematical expertise. Getting an expert with all these skills is not always a walk in the park
- AutoML also reduces the amount of time it would take to develop and test a machine learning model.
- AutoML also reduces bias and errors that occur when a human being is designing the machine learning models.
- AutoML is a step towards making ML accessible to everyone.
There are two major concepts to grasp as far as AutoML is concerned: -
Neural architecture search is the process of automating the design of neural networks. Usually, reinforcement learning or evolutionary algorithms are used in the design of these networks. In reinforcement learning, models are punished for low accuracies and rewarded for high accuracies. Using this technique the model will always obtain higher accuracies.
Transfer learning is a machine learning technique where a model is trained on one problem and applying it to a different but related problem. This enables us to obtain high accuracies while using less computation time and power. Transfer learning works best for problems where the datasets are similar to the ones used in pre-trained models.
Some common automl framework
- AUTO SKLEARN
- AUTO KERAS
- H20
- TPOT(Tree-Based Pipeline Optimization Tool)
- AUTO GLUON
- ML Box
- FEATURE TOOLS