Names Classification using character-level LSTM model

Description :

A character-level LSTM reads words as a series of characters and outputs a prediction and a “hidden state” at each step, feeding its previous hidden state into each next step. We take the final prediction to be the output, i.e. which class the word belongs to. In this project, we will train a few thousand names from 18 languages of origin, and predict which language a name is from based on the spelling.

Dataset:

The dataset consists of names from 18 languages of origin and can be found in the data folder. There are 18 files of different languages, consisting of one name in each line.

Relevant Files:

The project is broken down in 3 files:

dataset.py : Loading, pre-processing and splitting the dataset using Data Loader
model.py : Defining the layers of the LSTM model
main.py : Training the training dataset using the defined model and predicting languages for test data. Visualizing traing and test loss and accuracy on test datasets

Requirements

Python 3.6.10
Numpy 1.18.4
Tensorboard 2.0.0
Pytorch 1.5.0
Torchvision 0.6.0
Matplotlib 3.2.1
Scikit-learn 0.23.1

Command to Run:

python main.py \
--datapath data/names \
--outdir output/ \
--epochlen 13 \
--modelname modelv \
--lr 0.05 \
--embed_dim 50 \
--hidden_size 100

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Names Classification using character-level LSTM model

Description :

Dataset:

Relevant Files:

Requirements

Command to Run:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data		data
images		images
README.md		README.md
classifying_names_with_character-level_RNN.ipynb		classifying_names_with_character-level_RNN.ipynb
dataset.py		dataset.py
main.py		main.py
model.py		model.py

YasserdahouML/character-level-RNN

Folders and files

Latest commit

History

Repository files navigation

Names Classification using character-level LSTM model

Description :

Dataset:

Relevant Files:

Requirements

Command to Run:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages