Waffer_Fault_Detection

Project OverView

The Waffer Fault Detection project in Python involves using machine learning algorithms to detect and classify defects on silicon waffers in semiconductor manufacturing.

Dataset:

For each Waffer we get values from 500+ sensors and based on that we tell whether the waffer is functioning(+1) or faulty(-1)
Click Here to get Data

Architecture:

Data Validation:

In this step, we perform different steps of validation like,
Is the filename valid?
Are all columns present?
Name of each column
Data type of each columns

Data Insertion in Database

In this step we perform the following things,
Database Creation and connection
Table creation in the database
Insertion of files in the table

Model Training

Data Export from Db:

The data in a stored database is exported as a CSV file to be used for model training.

Data Preprocessing:

In this step we check for null values in each column. If null values are present we use KNN Imputer to fill in those values with the mean of k neighbours of it.
Also we will remove the columns which have a standard deviation of 0, it means all the values in that column are same and hence that column won't add any meaning to the model training.

Clustering:

The idea behind clustering is to find enteries(rows) that are relatively similar to each other, create cluster and train separte model for each cluster. This Technique lets us get better accuracy by grouping similar data together.
We use Kmeans to cluster of preprocessed data and save the model for later use.

Model Selection:

After clusters are created, we find the best model for each cluster. Two algorithms are used RandomForest and XGBoost. We perform Grid Search CV to get both models for best paramenters and then compare their accuracy to get the better model.

Model Prediction

Here also all the above steps like Data Validation, Data Insertion in Database, Data Preprocessing and Clustering is performed. Based on the cluster group, the model is loaded and prediction is made.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.idea		.idea
Clustering		Clustering
DataBase		DataBase
Data_Validation		Data_Validation
File_Ops		File_Ops
Logger		Logger
Model		Model
Predict_Result		Predict_Result
Prediction_Batch_Files		Prediction_Batch_Files
Prediction_Logs		Prediction_Logs
Prediction_Validation		Prediction_Validation
Schema		Schema
Training_Batch_Files		Training_Batch_Files
Training_Logs		Training_Logs
Training_Validation		Training_Validation
Transform_Data		Transform_Data
templates		templates
.gitignore		.gitignore
Prediction_Data.csv		Prediction_Data.csv
README.md		README.md
Training_Data.csv		Training_Data.csv
X.csv		X.csv
flask_monitoringdashboard.db		flask_monitoringdashboard.db
main.py		main.py
temp.csv		temp.csv
temp.txt		temp.txt
test.py		test.py
y.csv		y.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Waffer_Fault_Detection

Project OverView

Dataset:

Architecture:

Data Validation:

Data Insertion in Database

Model Training

Data Export from Db:

Data Preprocessing:

Clustering:

Model Selection:

Model Prediction

Project Screen

About

Releases

Packages

Contributors 2

Languages

AzeemWaqarRao/Waffer_Fault_Detection

Folders and files

Latest commit

History

Repository files navigation

Waffer_Fault_Detection

Project OverView

Dataset:

Architecture:

Data Validation:

Data Insertion in Database

Model Training

Data Export from Db:

Data Preprocessing:

Clustering:

Model Selection:

Model Prediction

Project Screen

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages