Udacity_DeepLearning_CharacterLevelRNNExercise

This is my implementation of the Character-based RNN Udacity exercise. The model generates sentences character by character, using its previous picks as inputs for the next.

Dataset

The model was trained using the english version of the book Anna Karenina, stored at the path ./data/anna.txt

Architecture

The model has the following architecture:

An LSTM with 2 hidden layers of 256 nodes
A Batch Normalization layer to avoid overfitting
A Fully Connected layer for the output

Training

The model was trained for 10 epochs using a 0.5 dropout probability for the LSTM nodes. The final loss was 1.2885.

Try it yourself

You can try the model yourself by cloning this repo and running the Character_Level_RNN_Exercise.ipynb notebook. To make it work make sure you have the following packages:

Numpy
Torch

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.ipynb_checkpoints		.ipynb_checkpoints
assets		assets
data		data
Character_Level_RNN_Exercise.ipynb		Character_Level_RNN_Exercise.ipynb
README.md		README.md
rnn_10_epoch.net		rnn_10_epoch.net

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity_DeepLearning_CharacterLevelRNNExercise

Dataset

Architecture

Training

Try it yourself

About

Releases

Packages

Languages

tom-ph/Udacity_DeepLearning_CharacterLevelRNNExercise

Folders and files

Latest commit

History

Repository files navigation

Udacity_DeepLearning_CharacterLevelRNNExercise

Dataset

Architecture

Training

Try it yourself

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages