LLMToolkit

Introduction

llmtoolkit is a toolkit for NLP(Natural Language Processing) and LLM(Large Language Models) using Pytorch. llmtoolkit has implemented many language models and data preprocessing methods. More importantly, it provides a lot of examples that can run end-to-end.

Tokenizer

Support Models

Supported Language Models:

Supported Transformer Models:

Dependencies

Python 3.7+
Pytorch 1.5.0+

Reference:

https://zh.d2l.ai/
- Dive into Deep Learning，D2L.ai
https://github.com/dmlc/gluon-nlp/
- GluonNLP: NLP made easy
https://github.com/huggingface/tokenizers
- Provides an implementation of today's most used tokenizers, with a focus on performance and versatility.
https://github.com/The-AI-Summer/self-attention-cv
- Self-attention building blocks for computer vision applications in PyTorch
自然语言处理：基于预训练模型的方法（作者：车万翔、郭江、崔一鸣）

License

llmtoolkit is released under the Apache 2.0 license.

Citation

Please cite the repo if you use the data or code in this repo.

@misc{llmtoolkit,
  author = {jianzhnie},
  title = {llmtoolkit: llmtoolkit is a toolkit for NLP and LLMs using Pytorch},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/jianzhnie/LLMToolkit}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 395 Commits
docs		docs
examples		examples
llmtoolkit		llmtoolkit
.flake8		.flake8
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMToolkit

Introduction

Tokenizer

Support Models

Dependencies

Reference:

License

Citation

About

Releases

Packages

Languages

License

jianzhnie/LLMToolkit

Folders and files

Latest commit

History

Repository files navigation

LLMToolkit

Introduction

Tokenizer

Support Models

Dependencies

Reference:

License

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages