From a49e01e07e23b6208612509d21403a41b34c9011 Mon Sep 17 00:00:00 2001
From: Dzmitry Hramyka <grom.dima.grom@gmail.com>
Date: Fri, 26 Jan 2024 16:51:06 +0100
Subject: [PATCH] Clenup of readme (#12)

* add generation mode and wirte valid Readme

* cleanup of readme
---
 README.md                         |  15 +++++++++++----
 assets/{logs.png => training.png} | Bin
 2 files changed, 11 insertions(+), 4 deletions(-)
 rename assets/{logs.png => training.png} (100%)

diff --git a/README.md b/README.md
index cb5e156..978ca6c 100644
--- a/README.md
+++ b/README.md
@@ -22,9 +22,9 @@ were collected from different sources. For more information about the dataset, p
 ## Project Overview
 
 The beLLM is a character-level language model trained on a collection of belarusian poems and prose.
-First inspired by the [nanoGPT](https://github.com/karpathy/nanoGPT) by `Andrej Karpathy`.
-The model architecture is based on the [GPT-2](https://github.com/openai/gpt-2) by `OpenAI`. 
-The data was manually collected and preprocessed. The model was trained on a single GPU GeForce GTX 1080 Ti for 1000 epochs.
+First inspired by the [nanoGPT](https://github.com/karpathy/nanoGPT) by `Andrej Karpathy`, the model architecture was based on the [GPT-2](https://github.com/openai/gpt-2) by `OpenAI`. 
+The data for training was manually collected and preprocessed. The model was trained on a single GPU GeForce GTX 1080 Ti for 1000 epochs.
+
 
 This repository contains the following core folders&files:
 - `model.py`: The main file with the model architecture and training loop.
@@ -35,6 +35,11 @@ This repository contains the following core folders&files:
 - `generations/`: The folder with the generated text.
 - `models/`: The folder with the results of the training.
 
+The results of the training are available in the `models/` folder. The model weights are available on the [HuggingFace](https://huggingface.co/gromdimon/beLLM) model hub. Here is screenshot of the training process:
+
+![training](assets/training.png)
+
+
 
 ## Technologies Used
 
@@ -113,7 +118,8 @@ make lint
 
 ## Dataset
 
-The dataset was collected from different sources and manually preprocessed. The dataset contains over 9.5 million characters. The dataset is available in the `data/` folder. The dataset includes the following sources:
+The dataset was collected from different sources and manually preprocessed. It contains over 9.5 million characters and is available in the `data/` folder. The dataset includes the following sources:
+
 
 - [Belaruskaja Palichka](https://knihi.com/)
 - [Ejka](https://ejka.ru/)
@@ -142,3 +148,4 @@ Big thanks to the following people for their work and inspiration:
 
 - [Andrej Karpathy](https://github.com/karpathy) for the [nanoGPT](https://github.com/karpathy/nanoGPT)
 - Anastasija Yashina for creating the dataset
+- [ChatGPT](https://chat.openai.com/) for generating the header image
\ No newline at end of file
diff --git a/assets/logs.png b/assets/training.png
similarity index 100%
rename from assets/logs.png
rename to assets/training.png