Skip to content

daypu/simple-gpt

Repository files navigation

simple-gpt

Intro

This project implements a simple chat program based on the Transformer architecture, trained using the openwebtext dataset.

Run

  1. Use the following code to install the required dependencies.
pip install -r requirements.txt
  1. Run data-process.py, the program will automatically download the dataset and divide it into a training set and a validation set.

  2. Run train.py to train your own model

  3. Run chat.py to start the chat using the model you trained.

References

https://github.com/Infatoshi/fcc-intro-to-llms/tree/main

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published