DekaBank-BERT-Transformer: Fine-Tuning and Evaluation

(From a Hackathon hosted by Deka)

Overview

This repository contains a Jupyter Notebook that demonstrates the fine-tuning and evaluation of a BERT-based model for sequence classification tasks using PyTorch and Hugging Face's Transformers library. The notebook walks through the process of training a BERT model on a custom dataset, monitoring the training process, and evaluating model performance using accuracy metrics.

This notebook is from my participation at a workshop event at Deka, and was intended for educational purposes only.

Setup

Prerequisites

Ensure you have the following Python packages installed:

numpy
pandas
torch
transformers
keras
matplotlib
scikit-learn

Configuration

Reproducibility: The notebook sets seeds for random, numpy, and torch to ensure reproducibility across different runs. The seed value used is 42.
Data Preparation: The notebook assumes you have a dataset ready for training and validation. The data should be preprocessed and loaded into PyTorch DataLoaders.

Usage

Running the Notebook

Load the Notebook: Open the Jupyter Notebook file (.ipynb) in your Jupyter environment.
Adjust Hyperparameters: Modify parameters such as the number of epochs, learning rate, and batch size according to your needs.
Execute Cells: Run all the cells in the notebook to execute the training and evaluation pipeline.

Code Sections

Data Preparation:
- Tokenization: Convert text data into input IDs and attention masks suitable for BERT.
- DataLoaders: Create PyTorch DataLoaders for batching and shuffling training and validation data.
Model Training:
- Initialization: Load a pre-trained BERT model and configure the optimizer and learning rate scheduler.
- Training Loop: Perform fine-tuning by iterating over the training data, calculating loss, and updating model parameters using backpropagation.
Model Evaluation:
- Validation: Evaluate the model on the validation dataset to measure performance.
- Accuracy Calculation: Use a custom accuracy function to compute the proportion of correct predictions.

Results and Analysis

Training Loss: Monitored during each epoch to observe the convergence of the model.
Validation Accuracy: Calculated at the end of each epoch to evaluate the model's performance on unseen data.
Matthews Correlation Coefficient (MCC): In the end we managed to achieve a MCC of ~0.375 which is decent considering we didnt do any hyperparamater tuning.

Acknowledgments

Hugging Face: For providing the BERT model and Transformers library.
PyTorch: For its deep learning framework that enables efficient training and evaluation.
Deka
Chris McCormick's Blog

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
LICENSE		LICENSE
README.md		README.md
coding_challenge_dekabank.ipynb		coding_challenge_dekabank.ipynb
solved_coding_challenge_dekabank.ipynb		solved_coding_challenge_dekabank.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DekaBank-BERT-Transformer: Fine-Tuning and Evaluation

Overview

Contents

Setup

Prerequisites

Configuration

Usage

Running the Notebook

Code Sections

Results and Analysis

Acknowledgments

About

Releases

Packages

Languages

License

adham-elaraby/Deka-BERT-Transformer

Folders and files

Latest commit

History

Repository files navigation

DekaBank-BERT-Transformer: Fine-Tuning and Evaluation

Overview

Contents

Setup

Prerequisites

Configuration

Usage

Running the Notebook

Code Sections

Results and Analysis

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages