Next-Character Prediction RNN

This repository contains code for training a rudementary character-level Recurrent Neural Network (RNN) that generates text based on a given input sequence. The model predicts the next character in a sequence of characters, trained on a text corpus such as Shakespeare or other literature. The codebase is for learning purposes, PyTorch contains a better optimized solution.

Project Overview

This project uses a Recurrent Neural Network (RNN) to learn character-level text generation. The model learns to predict the next character in a sequence based on previous characters. After training, the model can generate text that resembles the style of the input data, such as a Shakespearean play or classic literature.

The code is structured to:

Preprocess the text data.
Train an RNN model using a character-level one-hot encoded input.
Generate new text by predicting the next character in a sequence.

Model Architecture

The model is a simple RNN with the following structure:

Input Layer: One-hot encoded characters.
Hidden Layer: A recurrent layer (vanilla RNN).
Output Layer: Softmax output to predict the probability distribution of the next character.

Key Components:

RNN Forward Pass: Computes hidden states and outputs for each time step.
Loss Function: Cross-entropy loss between the predicted character and the true character.
Backward Pass: Backpropagation through time (BPTT) to calculate gradients and update weights.

Installation

Clone the repository:

git clone https://github.com/nasirabd/NextChar-RNN.git

Usage

Data Preperation:

Download a text corpus, such as Shakespeare or Alice's Adventures in Wonderland, from Project Gutenberg.
Place the raw text file in the data/ folder. The file should be named raw.txt.

Training the model:

Run the script in the terminal.
```bash
python main.py

Evaluation:

The model will also automatically evaluate after main.py is called.

Generating Text:

After the training the model automatically generates a new text created from learned model weights.

Hyperparameters

You can modify the hyperparameters in main.py to control the training:

hidden_size: Number of hidden units in the RNN.
seq_length: The length of the input character sequence.
learning_rate: Learning rate for optimization.
num_epochs: Number of training epochs.
save_interval: Number of epochs between model saves.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
model		model
tools		tools
trained_models		trained_models
visualization		visualization
README.md		README.md
loss_plot.png		loss_plot.png
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Next-Character Prediction RNN

Table of Contents

Project Overview

Model Architecture

Installation

Usage

Data Preperation:

Training the model:

Evaluation:

Generating Text:

Hyperparameters

About

Releases

Packages

Languages

nasirabd/NextChar-RNN

Folders and files

Latest commit

History

Repository files navigation

Next-Character Prediction RNN

Table of Contents

Project Overview

Model Architecture

Installation

Usage

Data Preperation:

Training the model:

Evaluation:

Generating Text:

Hyperparameters

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages