From fe8d6f9c64feb8ded5bbda3349b67d086b614103 Mon Sep 17 00:00:00 2001 From: Lysandre Date: Tue, 18 Apr 2023 17:50:53 -0400 Subject: [PATCH 1/9] Awesome Transformers --- awesome-transformers.md | 405 ++++++++++++++++++++++++++++++++++++++++ 1 file changed, 405 insertions(+) create mode 100644 awesome-transformers.md diff --git a/awesome-transformers.md b/awesome-transformers.md new file mode 100644 index 00000000000000..7260c91cc3f7ff --- /dev/null +++ b/awesome-transformers.md @@ -0,0 +1,405 @@ +# Awesome projects built with Transformers + +This page lists awesome projects built on top of Transformers. Transformers is more than a toolkit to use pretrained +models: it's a community of projects built around it and the Hugging Face Hub. We want Transformers to enable +developers, researchers, students, professors, engineers, and anyone else to build their dream projects. + +In this list, we showcase incredibly impactful and novel projects that have pushed the field forward. We celebrate +100 of these projects as we reach the milestone of 100k stars as a community; but we're very open to pull requests +adding other projects to the list. If you believe a project should be here and it's not, then please, open a PR +to add it. + +## [gpt4all](https://github.com/nomic-ai/gpt4all) + +[gpt4all](https://github.com/nomic-ai/gpt4all) is an ecosystem of open-source chatbots trained on massive collections of clean assistant data including code, stories and dialogue. It offers open-source, large language models such as LLaMA and GPT-J trained in an assistant-style. + +Keywords: Open-source, LLaMa, GPT-J, instruction, assistant + +## [recommenders](https://github.com/microsoft/recommenders) + +This repository contains examples and best practices for building recommendation systems, provided as Jupyter notebooks. It goes over several aspects required to build efficient recommendation systems: data preparation, modeling, evaluation, model selection & optimization, as well as operationalization + +Keywords: Recommender systems, AzureML + +## [lama-cleaner](https://github.com/Sanster/lama-cleaner) + +Image inpainting tool powered by Stable Diffusion. Remove any unwanted object, defect, people from your pictures or erase and replace anything on your pictures. + +Keywords: inpainting, SD, Stable Diffusion + +## [flair](https://github.com/flairNLP/flair) + +FLAIR is a powerful PyTorch NLP framework, convering several important tasks: NER, sentiment-analysis, part-of-speech tagging, text and ducoment embeddings, among other things. + +Keywords: NLP, text embedding, document embedding, biomedical, NER, PoS, sentiment-analysis + +## [mindsdb](https://github.com/mindsdb/mindsdb) + +MindsDB is a low-code ML platform, which automates and integrates several ML frameworks into the data stack as "AI Tables" to streamline the integration of AI into applications, making it accessible to developers of all skill levels. + +## [langchain](https://github.com/hwchase17/langchain) + +[langchain](https://github.com/hwchase17/langchain) is aimed at assisting in the development of apps merging both LLMs and other sources of knowledge. The library allows chaining calls to applications, creating a sequence across many tools. + +Keywords: LLMs, Large Language Models, Agents, Chains + +## [ParlAI](https://github.com/facebookresearch/ParlAI) + +[ParlAI](https://github.com/facebookresearch/ParlAI) is a python framework for sharing, training and testing dialogue models, from open-domain chitchat, to task-oriented dialogue, to visual question answering. It provides more than 100 datasets under the same API, a large zoo of pretrained models, a set of agents, and has several integrations. + +## [sentence-transformers](https://github.com/UKPLab/sentence-transformers) + +This framework provides an easy method to compute dense vector representations for sentences, paragraphs, and images. The models are based on transformer networks like BERT / RoBERTa / XLM-RoBERTa etc. and achieve state-of-the-art performance in various task. Text is embedding in vector space such that similar text is close and can efficiently be found using cosine similarity. + +## [ludwig](https://github.com/ludwig-ai/ludwig) + +Ludwig is a declarative machine learning framework that makes it easy to define machine learning pipelines using a simple and flexible data-driven configuration system. Ludwig is targeted at a wide variety of AI tasks. It provides a data-driven configuration system, training, prediction, and evaluation scripts, as well as a programmatic API. + +## [InvokeAI](https://github.com/invoke-ai/InvokeAI) + +[InvokeAI](https://github.com/invoke-ai/InvokeAI) is an engine for Stable Diffusion models, aimed at professionals, artists, and enthusiasts. It leverages the latest AI-driven technologies through CLI as well as a WebUI. + +## [PaddleNLP](https://github.com/PaddlePaddle/PaddleNLP) + +[PaddleNLP](https://github.com/PaddlePaddle/PaddleNLP) is an easy-to-use and powerful NLP library particularly targeted at the Chinese languages. It has support for multiple pre-trained model zoos, and supports a wide-range of NLP tasks from research to industrial applications. + +## [stanza](https://github.com/stanfordnlp/stanza) + +The Stanford NLP Group's official Python NLP library. It contains support for running various accurate natural language processing tools on 60+ languages and for accessing the Java Stanford CoreNLP software from Python. + +## [DeepPavlov](https://github.com/deeppavlov/DeepPavlov) + +[DeepPavlov](https://github.com/deeppavlov/DeepPavlov) is an open-source conversational AI library. It is designed for the development of production ready chat-bots and complex conversational systems, as well as research in the area of NLP and, particularly, of dialog systems. + +## [alpaca-lora](https://github.com/tloen/alpaca-lora) + +Alpaca-lora contains code for reproducing the Stanford Alpaca results using low-rank adaptation (LoRA). The repository provides training (fine-tuning) as well as generation scripts. + +## [imagen-pytorch](https://github.com/lucidrains/imagen-pytorch) + +An open-source Implementation of Imagen, Google's closed-source Text-to-Image Neural Network that beats DALL-E2. As of release, it is the new SOTA for text-to-image synthesis. + +## [adapter-transformers](https://github.com/adapter-hub/adapter-transformers) + +[adapter-transformers](https://github.com/adapter-hub/adapter-transformers) is an extension of HuggingFace's Transformers library, integrating adapters into state-of-the-art language models by incorporating AdapterHub, a central repository for pre-trained adapter modules. It is a drop-in replacement for transformers, which is regularly updated to stay up-to-date with the developments of transformers. + +## [NeMo](https://github.com/NVIDIA/NeMo) + +NVIDIA [NeMo](https://github.com/NVIDIA/NeMo) is a conversational AI toolkit built for researchers working on automatic speech recognition (ASR), text-to-speech synthesis (TTS), large language models (LLMs), and natural language processing (NLP). The primary objective of [NeMo](https://github.com/NVIDIA/NeMo) is to help researchers from industry and academia to reuse prior work (code and pretrained models) and make it easier to create new https://developer.nvidia.com/conversational-ai#started. + +## [MONAI](https://github.com/Project-MONAI/MONAI) + +[MONAI](https://github.com/Project-MONAI/MONAI) is a PyTorch-based, open-source framework for deep learning in healthcare imaging, part of PyTorch Ecosystem. Its ambitions are: +• developing a community of academic, industrial and clinical researchers collaborating on a common foundation; +• creating state-of-the-art, end-to-end training workflows for healthcare imaging; +• providing researchers with the optimized and standardized way to create and evaluate deep learning models. + +## [simpletransformers](https://github.com/ThilinaRajapakse/simpletransformers) + +Simple Transformers lets you quickly train and evaluate Transformer models. Only 3 lines of code are needed to initialize, train, and evaluate a model. It supports a wide variety of NLP tasks. + +## [JARVIS](https://github.com/microsoft/JARVIS) + +[JARVIS](https://github.com/microsoft/JARVIS) is a system attempting to merge LLMs such as GPT-4 with the rest of the open-source ML community: leveraging up to 60 downstream models in order to perform tasks identified by the LLM. + +## [](https://xenova.github.io/transformers.js/) + +[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/).[](https://xenova.github.io/transformers.js/)j[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)J[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)v[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)S[](https://xenova.github.io/transformers.js/)c[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)p[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)b[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)y[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)g[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)u[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)g[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)c[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)y[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)w[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)h[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)h[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)b[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)w[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/).[](https://xenova.github.io/transformers.js/) + +## [bumblebee](https://github.com/elixir-nx/bumblebee) + +Bumblebee provides pre-trained Neural Network models on top of Axon, a neural networks library for the Elixir language. It includes integration with 🤗 Models, allowing anyone to download and perform Machine Learning tasks with few lines of code. + +## [argilla](https://github.com/argilla-io/argilla) + +Argilla is an open-source platform providing advanced NLP labeling, monitoring, and workspaces. It is compatible with many open source ecosystems such as Hugging Face, Stanza, FLAIR, and others. + +## [haystack](https://github.com/deepset-ai/haystack) + +Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs. It offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more. + +## [spaCy](https://github.com/explosion/spaCy) + +[spaCy](https://github.com/explosion/spaCy) is a library for advanced Natural Language Processing + in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It offers support for transformers models through its third party package, spacy-transformers. + +## [speechbrain](https://github.com/speechbrain/speechbrain) + +SpeechBrain is an open-source and all-in-one conversational AI toolkit based on PyTorch. +The goal is to create a single, flexible, and user-friendly toolkit that can be used to easily develop state-of-the-art speech technologies, including systems for speech recognition, speaker recognition, speech enhancement, speech separation, language identification, multi-microphone signal processing, and many others. + +## [skorch](https://github.com/skorch-dev/skorch) + +Skorch is a scikit-learn compatible neural network library that wraps PyTorch. It has support for models within transformers, and tokenizers from tokenizers. + +## [bertviz](https://github.com/jessevig/bertviz) + +BertViz is an interactive tool for visualizing attention in Transformer language models such as BERT, GPT2, or T5. It can be run inside a Jupyter or Colab notebook through a simple Python API that supports most Huggingface models. + +## [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) + +[mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) is a haiku library using the xmap/pjit operators in JAX for model parallelism of transformers. This library is designed for scalability up to approximately 40B parameters on TPUv3s. + +## [deepchem](https://github.com/deepchem/deepchem) + +DeepChem aims to provide a high quality open-source toolchain that democratizes the use of deep-learning in drug discovery, materials science, quantum chemistry, and biology. + +## [OpenNRE](https://github.com/thunlp/OpenNRE) + +An Open-Source Package for Neural Relation Extraction (NRE). It is targeted at a wide range of users, from newcomers to relation extraction, to developers, researchers, or students. + +## [pycorrector](https://github.com/shibing624/pycorrector) + +PyCorrector is a Chinese Text Error Correction Tool. It uses a language model to detect errors, pinyin feature and shape feature to correct Chinese text errors. it can be used for Chinese Pinyin and stroke input method. + +## [nlpaug](https://github.com/makcedward/nlpaug) + +This python library helps you with augmenting nlp for machine learning projects. It is a lightweight library featuring synthetic data generation for improving model performance, support for audio and text, and compatibility with several ecosystems (scikit-learn, pytorch, tensorflow). + +## [dream-textures](https://github.com/carson-katri/dream-textures) + +[dream-textures](https://github.com/carson-katri/dream-textures) is a library targeted at bringing stable-diffusion support within Blender. It supports several use-cases, such as image generation, texture projection, inpainting/outpainting, rendering passes, and upscaling. + +## [seldon-core](https://github.com/SeldonIO/seldon-core) + +Seldon core converts your ML models (Tensorflow, Pytorch, H2o, etc.) or language wrappers (Python, Java, etc.) into production REST/GRPC microservices. +Seldon handles scaling to thousands of production machine learning models and provides advanced machine learning capabilities out of the box including Advanced Metrics, Request Logging, Explainers, Outlier Detectors, A/B Tests, Canaries and more. + +## [open_model_zoo](https://github.com/openvinotoolkit/open_model_zoo) + +This repository includes optimized deep learning models and a set of demos to expedite development of high-performance deep learning inference applications. Use these free pre-trained models instead of training your own models to speed-up the development and production deployment process. + +## [ml-stable-diffusion](https://github.com/apple/ml-stable-diffusion) + +ML-Stable-Diffusion is a repository by Appel bringing Stable Diffusion support to Core ML, on Apple Silicon devices. It supports stable diffusion checkpoints hosted on the Hugging Face Hub. + +## [stable-dreamfusion](https://github.com/ashawkey/stable-dreamfusion) + +Stable-Dreamfusion is a pytorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. + +## [txtai](https://github.com/neuml/txtai) + +[txtai](https://github.com/neuml/txtai) is an open-source platform for semantic search and workflows powered by language models. + +## [djl](https://github.com/deepjavalibrary/djl) + +Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java framework for deep learning. DJL is designed to be easy to get started with and simple to use for Java developers. DJL provides a native Java development experience and functions like any other regular Java library. + +## [gpt-neox](https://github.com/EleutherAI/gpt-neox) + +This repository records EleutherAI's library for training large-scale language models on GPUs. The framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. It is focused on training multi-billion-parameter models. + +## [muzic](https://github.com/microsoft/muzic) + +Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence. Muzic was created by researchers from Microsoft Research Asia. + +## [dalle-flow](https://github.com/jina-ai/dalle-flow) + +DALL·E Flow is an interactive workflow for generating high-definition images from a text prompt. Itt leverages DALL·E-Mega, GLID-3 XL, and Stable Diffusion to generate image candidates, and then calls CLIP-as-service to rank the candidates w.r.t. the prompt. +The preferred candidate is fed to GLID-3 XL for diffusion, which often enriches the texture and background. Finally, the candidate is upscaled to 1024x1024 via SwinIR. + +## [lightseq](https://github.com/bytedance/lightseq) + +LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA. It enables highly efficient computation of modern NLP and CV models such as BERT, GPT, Transformer, etc. It is therefore best useful for machine translation, text generation, image classification, and other sequence related tasks. + +## [LaTeX-OCR](https://github.com/lukas-blecher/LaTeX-OCR) + +The goal of this project is to create a learning based system that takes an image of a math formula and returns corresponding LaTeX code. + +## [open_clip](https://github.com/mlfoundations/open_clip) + +OpenCLIP is an open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training). +The goal of this repository is to enable training models with contrastive image-text supervision, and to investigate their properties such as robustness to distribution shift. The starting point is an implementation of CLIP that matches the accuracy of the original CLIP models when trained on the same dataset. Specifically, a ResNet-50 model trained with our codebase on OpenAI's 15 million image subset of YFCC achieves 32.7% top-1 accuracy on ImageNet. + +## [dalle-playground](https://github.com/saharmor/dalle-playground) + +A playground to generate images from any text prompt using Stable Diffusion and Dall-E mini. + +## [FedML](https://github.com/FedML-AI/FedML) + +[FedML](https://github.com/FedML-AI/FedML) is a federated learning and analytics library enabling secure and collaborative machine learning on decentralized data anywhere at any scale. + +It supports large-scale cross-silo federated learning, and cross-device federated learning on smartphones/IoTs, and research simulation. + +## [gpt-code-clippy](https://github.com/CodedotAl/gpt-code-clippy) + +GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model -- based on GPT-3, called GPT-Codex -- that is fine-tuned on publicly available code from GitHub. + +## [TextAttack](https://github.com/QData/TextAttack) + +[TextAttack](https://github.com/QData/TextAttack) 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP. + +## [OpenPrompt](https://github.com/thunlp/OpenPrompt) + +Prompt-learning is a paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modify the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. This library provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline. [OpenPrompt](https://github.com/thunlp/OpenPrompt) supports loading PLMs directly from https://github.com/huggingface/transformers. + +## [](https://github.com/oobabooga/text-generation-webui/) + +[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)w[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/)b[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)U[](https://github.com/oobabooga/text-generation-webui/)I[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)f[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)u[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)u[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)M[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)s[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)k[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)M[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)m[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/).[](https://github.com/oobabooga/text-generation-webui/)c[](https://github.com/oobabooga/text-generation-webui/)p[](https://github.com/oobabooga/text-generation-webui/)p[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)G[](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/)-[](https://github.com/oobabooga/text-generation-webui/)J[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)y[](https://github.com/oobabooga/text-generation-webui/)t[](https://github.com/oobabooga/text-generation-webui/)h[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)O[](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)G[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/)C[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/)I[](https://github.com/oobabooga/text-generation-webui/)C[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/).[](https://github.com/oobabooga/text-generation-webui/) + +## [libra](https://github.com/Palashio/libra) + +An ergonomic machine learning [libra](https://github.com/Palashio/libra)ry for non-technical users. It focuses on ergonomics and on ensuring that training a model is as simple as it can be. + +## [alibi](https://github.com/SeldonIO/alibi) + +Alibi is an open source Python library aimed at machine learning model inspection and interpretation. The focus of the library is to provide high-quality implementations of black-box, white-box, local and global explanation methods for classification and regression models. + +## [tortoise-tts](https://github.com/neonbjb/tortoise-tts) + +Tortoise is a text-to-speech program built with the following priorities: strong multi-voice capabilities., and highly realistic prosody and intonation. + +## [flower](https://github.com/adap/flower) + +Flower (flwr) is a framework for building federated learning systems. The design of Flower is based on a few guiding principles: customizability, extendability, framework agnosticity, and ease-of-use. + +## [fast-bert](https://github.com/utterworks/fast-bert) + +Fast-Bert is a deep learning library that allows developers and data scientists to train and deploy BERT and XLNet based models for natural language processing tasks beginning with Text Classification. It is aimed at simplicity. + +## [towhee](https://github.com/towhee-io/towhee) + +Towhee makes it easy to build neural data processing pipelines for AI applications. We provide hundreds of models, algorithms, and transformations that can be used as standard pipeline building blocks. Users can use Towhee's Pythonic API to build a prototype of their pipeline and automatically optimize it for production-ready environments. + +## [alibi-detect](https://github.com/SeldonIO/alibi-detect) + +Alibi Detect is an open source Python library focused on outlier, adversarial and drift detection. The package aims to cover both online and offline detectors for tabular data, text, images and time series. Both TensorFlow and PyTorch backends are supported for drift detection. + +## [FARM](https://github.com/deepset-ai/FARM) + +[FARM](https://github.com/deepset-ai/FARM) makes Transfer Learning with BERT & Co simple, fast and enterprise-ready. It's built upon transformers and provides additional features to simplify the life of developers: Parallelized preprocessing, highly modular design, multi-task learning, experiment tracking, easy debugging and close integration with AWS SageMaker. + +## [aitextgen](https://github.com/minimaxir/aitextgen) + +A robust Python tool for text-based AI training and generation using OpenAI's GPT-2 and EleutherAI's GPT Neo/GPT-3 architecture. +[aitextgen](https://github.com/minimaxir/aitextgen) is a Python package that leverages PyTorch, Hugging Face Transformers and pytorch-lightning with specific optimizations for text generation using GPT-2, plus many added features. + +## [diffgram](https://github.com/diffgram/diffgram) + +Diffgram aims to integrate human supervision into platforms. We support your team programmatically changing the UI (Schema, layout, etc.) like in Streamlit. This means that you can collect and annotate timely data from users. In other words, we are the platform behind your platform, an integrated part of your application, to ship new & better AI products faster. + +## [ecco](https://github.com/jalammar/ecco) + +Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0). + +## [s3prl](https://github.com/s3prl/s3prl) + +[s3prl](https://github.com/s3prl/s3prl) stands for Self-Supervised Speech Pre-training and Representation Learning. Self-supervised speech pre-trained models are called upstream in this toolkit, and are utilized in various downstream tasks. + +## [ru-dalle](https://github.com/ai-forever/ru-dalle) + +RuDALL-E aims to be similar to DALL-E, targeted to Russian. + +## [DeepKE](https://github.com/zjunlp/DeepKE) + +[DeepKE](https://github.com/zjunlp/DeepKE) is a knowledge extraction toolkit for knowledge graph construction supporting cnSchema,low-resource, document-level and multimodal scenarios for entity, relation and attribute extraction. + +## [nebullvm](https://github.com/nebuly-ai/nebullvm) + +Nebullvm is an ecosystem of plug and play modules to optimize the performances of your AI systems. The optimization modules are stack-agnostic and work with any library. They are designed to be easily integrated into your system, providing a quick and seamless boost to its performance. Simply plug and play to start realizing the benefits of optimized performance right away. + +## [imaginAIry](https://github.com/brycedrennan/imaginAIry) + +Offers a CLI and a Python API to generate images with Stable Diffusion. It has support for many tools, like image structure control (controlnet), instruction-based image edits (InstructPix2Pix), prompt-based masking (clipseg), among others. + +## [sparseml](https://github.com/neuralmagic/sparseml) + +SparseML is an open-source model optimization toolkit that enables you to create inference-optimized sparse models using pruning, quantization, and distillation algorithms. Models optimized with SparseML can then be exported to the ONNX and deployed with DeepSparse for GPU-class performance on CPU hardware. + +## [opacus](https://github.com/pytorch/opacus) + +Opacus is a library that enables training PyTorch models with differential privacy. It supports training with minimal code changes required on the client, has little impact on training performance, and allows the client to online track the privacy budget expended at any given moment. + +## [LAVIS](https://github.com/salesforce/LAVIS) + +[LAVIS](https://github.com/salesforce/LAVIS) is a Python deep learning library for LAnguage-and-VISion intelligence research and applications. This library aims to provide engineers and researchers with a one-stop solution to rapidly develop models for their specific multimodal scenarios, and benchmark them across standard and customized datasets. It features a unified interface design to access + +## [buzz](https://github.com/chidiwilliams/buzz) + +Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper. + +## [rust-bert](https://github.com/guillaume-be/rust-bert) + +Rust-native state-of-the-art Natural Language Processing models and pipelines. Port of Hugging Face's Transformers library, using the tch-rs crate and pre-processing from rust-tokenizers. Supports multi-threaded tokenization and GPU inference. This repository exposes the model base architecture, task-specific heads (see below) and ready-to-use pipelines. + +## [EasyNLP](https://github.com/alibaba/EasyNLP) + +[EasyNLP](https://github.com/alibaba/EasyNLP) is an easy-to-use NLP development and application toolkit in PyTorch, first released inside Alibaba in 2021. It is built with scalable distributed training strategies and supports a comprehensive suite of NLP algorithms for various NLP applications. [EasyNLP](https://github.com/alibaba/EasyNLP) integrates knowledge distillation and few-shot learning for landing large pre-trained models, together with various popular multi-modality pre-trained models. It provides a unified framework of model training, inference, and deployment for real-world applications. + +## [TurboTransformers](https://github.com/Tencent/TurboTransformers) + +A fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU. + +## [hivemind](https://github.com/learning-at-home/hivemind) + +Hivemind is a PyTorch library for decentralized deep learning across the Internet. Its intended usage is training one large model on hundreds of computers from different universities, companies, and volunteers. + +## [docquery](https://github.com/impira/docquery) + +DocQuery is a library and command-line tool that makes it easy to analyze semi-structured and unstructured documents (PDFs, scanned images, etc.) using large language models (LLMs). You simply point DocQuery at one or more documents and specify a question you want to ask. DocQuery is created by the team at Impira. + +## [CodeGeeX](https://github.com/THUDM/CodeGeeX) + +[CodeGeeX](https://github.com/THUDM/CodeGeeX) is a large-scale multilingual code generation model with 13 billion parameters, pre-trained on a large code corpus of more than 20 programming languages. It has several unique features: +- Multilingual code generation +- Crosslingual code translation +- Is a customizable programming assistant + +## [ktrain](https://github.com/amaiya/ktrain) + +[ktrain](https://github.com/amaiya/ktrain) is a lightweight wrapper for the deep learning library TensorFlow Keras (and other libraries) to help build, train, and deploy neural networks and other machine learning models. Inspired by ML framework extensions like fastai and ludwig, [ktrain](https://github.com/amaiya/ktrain) is designed to make deep learning and AI more accessible and easier to apply for both newcomers and experienced practitioners. + +## [FastDeploy](https://github.com/PaddlePaddle/FastDeploy) + +[FastDeploy](https://github.com/PaddlePaddle/FastDeploy) is an Easy-to-use and High Performance AI model deployment toolkit for Cloud, Mobile and Edge with packageout-of-the-box and unified experience, endend-to-end optimization for over fire160+ Text, Vision, Speech and Cross-modal AI models. Including image classification, object detection, OCR, face detection, matting, pp-tracking, NLP, stable diffusion, TTS and other tasks to meet developers' industrial deployment needs for multi-scenario, multi-hardware and multi-platform. + +## [underthesea](https://github.com/undertheseanlp/underthesea) + +[underthesea](https://github.com/undertheseanlp/underthesea) is a Vietnamese NLP toolkit. Underthesea is a suite of open source Python modules data sets and tutorials supporting research and development in Vietnamese Natural Language Processing. We provides extremely easy API to quickly apply pretrained NLP models to your Vietnamese text, such as word segmentation, part-of-speech tagging (PoS), named entity recognition (NER), text classification and dependency parsing. + +## [hasktorch](https://github.com/hasktorch/hasktorch) + +Hasktorch is a library for tensors and neural networks in Haskell. It is an independent open source community project which leverages the core C++ libraries shared by PyTorch. + +## [donut](https://github.com/clovaai/donut) + +Donut, or Document understanding transformer, is a new method of document understanding that utilizes an OCR-free end-to-end Transformer model. + +Donut does not require off-the-shelf OCR engines/APIs, yet it shows state-of-the-art performances on various visual document understanding tasks, such as visual document classification or information extraction (a.k.a. document parsing). + +## [transformers-interpret](https://github.com/cdpierse/transformers-interpret) + +Transformers Interpret is a model explainability tool designed to work exclusively with the hugs transformers package. + +In line with the philosophy of the Transformers package Transformers Interpret allows any transformers model to be explained in just two lines. Explainers are available for both text and computer vision models. Visualizations are also available in notebooks and as savable png and html files + +## [mlrun](https://github.com/mlrun/mlrun) + +MLRun is an open MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications, significantly reducing engineering efforts, time to production, and computation resources. With MLRun, you can choose any IDE on your local machine or on the cloud. MLRun breaks the silos between data, ML, software, and DevOps/MLOps teams, enabling collaboration and fast continuous improvements. + +## [FederatedScope](https://github.com/alibaba/FederatedScope) + +[FederatedScope](https://github.com/alibaba/FederatedScope) is a comprehensive federated learning platform that provides convenient usage and flexible customization for various federated learning tasks in both academia and industry. Based on an event-driven architecture, [FederatedScope](https://github.com/alibaba/FederatedScope) integrates rich collections of functionalities to satisfy the burgeoning demands from federated learning, and aims to build up an easy-to-use platform for promoting learning safely and effectively. + +## [pythainlp](https://github.com/PyThaiNLP/pythainlp) + +PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. + +Keywords: Thai, NLP, NLTK + +## [FlagAI](https://github.com/FlagAI-Open/FlagAI) + +[FlagAI](https://github.com/FlagAI-Open/FlagAI) (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model. Our goal is to support training, fine-tuning, and deployment of large-scale models on various downstream tasks with multi-modality. + +## [pyserini](https://github.com/castorini/pyserini) + +[pyserini](https://github.com/castorini/pyserini) is a Python toolkit for reproducible information retrieval research with sparse and dense representations. Retrieval using sparse representations is provided via integration with the group's Anserini IR toolkit. Retrieval using dense representations is provided via integration with Facebook's Faiss library. + +Keywords: IR, Information Retrieval, Dense, Sparse + +## [baal](https://github.com/baal-org/baal) + +[baal](https://github.com/baal-org/baal) is an active learning library that supports both industrial applications and research usecases. [baal](https://github.com/baal-org/baal) currently supports Monte-Carlo Dropout, MCDropConnect, deep ensembles, and semi-supervised learning. + +Keywords: Active Learning, Research, Labeling + From c89e7777a4574ae49ddfb000a3b680a33a7aa7ca Mon Sep 17 00:00:00 2001 From: Lysandre Date: Tue, 18 Apr 2023 18:02:30 -0400 Subject: [PATCH 2/9] Update --- awesome-transformers.md | 10 ++++++++-- 1 file changed, 8 insertions(+), 2 deletions(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index 7260c91cc3f7ff..7e1d5690ea335e 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -208,8 +208,14 @@ The goal of this project is to create a learning based system that takes an imag ## [open_clip](https://github.com/mlfoundations/open_clip) -OpenCLIP is an open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training). -The goal of this repository is to enable training models with contrastive image-text supervision, and to investigate their properties such as robustness to distribution shift. The starting point is an implementation of CLIP that matches the accuracy of the original CLIP models when trained on the same dataset. Specifically, a ResNet-50 model trained with our codebase on OpenAI's 15 million image subset of YFCC achieves 32.7% top-1 accuracy on ImageNet. +OpenCLIP is an open source implementation of OpenAI's CLIP. + +The goal of this repository is to enable training models with contrastive image-text supervision, and to investigate their properties such as robustness to distribution shift. +The starting point is an implementation of CLIP that matches the accuracy of the original CLIP models when trained on the same dataset. + +Specifically, a ResNet-50 model trained with this codebase on OpenAI's 15 million image subset of YFCC achieves 32.7% top-1 accuracy on ImageNet. + +Keywords: CLIP, Open-source, Contrastice, Image-text ## [dalle-playground](https://github.com/saharmor/dalle-playground) From b2a6eb7100dd66dc23960dc352f2e194b070bdaa Mon Sep 17 00:00:00 2001 From: Lysandre Date: Tue, 18 Apr 2023 18:02:38 -0400 Subject: [PATCH 3/9] Update --- awesome-transformers.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index 7e1d5690ea335e..f21d525f9fd674 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -215,7 +215,7 @@ The starting point is an implementation of CLIP that matches the accuracy of the Specifically, a ResNet-50 model trained with this codebase on OpenAI's 15 million image subset of YFCC achieves 32.7% top-1 accuracy on ImageNet. -Keywords: CLIP, Open-source, Contrastice, Image-text +Keywords: CLIP, Open-source, Contrastive, Image-text ## [dalle-playground](https://github.com/saharmor/dalle-playground) From 01f51a5a13e3c45cc418d5da894bee99e66c3b34 Mon Sep 17 00:00:00 2001 From: Lysandre Date: Thu, 20 Apr 2023 17:12:42 +0200 Subject: [PATCH 4/9] Keywords --- awesome-transformers.md | 215 +++++++++++++++++++++++++++++++++------- 1 file changed, 177 insertions(+), 38 deletions(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index f21d525f9fd674..6e3c7fb44d0e4c 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -37,6 +37,8 @@ Keywords: NLP, text embedding, document embedding, biomedical, NER, PoS, sentime MindsDB is a low-code ML platform, which automates and integrates several ML frameworks into the data stack as "AI Tables" to streamline the integration of AI into applications, making it accessible to developers of all skill levels. +Keywords: Database, low-code, AI table + ## [langchain](https://github.com/hwchase17/langchain) [langchain](https://github.com/hwchase17/langchain) is aimed at assisting in the development of apps merging both LLMs and other sources of knowledge. The library allows chaining calls to applications, creating a sequence across many tools. @@ -47,165 +49,242 @@ Keywords: LLMs, Large Language Models, Agents, Chains [ParlAI](https://github.com/facebookresearch/ParlAI) is a python framework for sharing, training and testing dialogue models, from open-domain chitchat, to task-oriented dialogue, to visual question answering. It provides more than 100 datasets under the same API, a large zoo of pretrained models, a set of agents, and has several integrations. +Keywords: Dialogue, Chatbots, VQA, Datasets, Agents + ## [sentence-transformers](https://github.com/UKPLab/sentence-transformers) -This framework provides an easy method to compute dense vector representations for sentences, paragraphs, and images. The models are based on transformer networks like BERT / RoBERTa / XLM-RoBERTa etc. and achieve state-of-the-art performance in various task. Text is embedding in vector space such that similar text is close and can efficiently be found using cosine similarity. +This framework provides an easy method to compute dense vector representations for sentences, paragraphs, and images. The models are based on transformer networks like BERT / RoBERTa / XLM-RoBERTa etc. and achieve state-of-the-art performance in various task. Text is embedding in vector space such that similar text is close and can efficiently be found using cosine similarity. + +Keywords: Dense vector representations, Text embeddings, Sentence embeddings ## [ludwig](https://github.com/ludwig-ai/ludwig) -Ludwig is a declarative machine learning framework that makes it easy to define machine learning pipelines using a simple and flexible data-driven configuration system. Ludwig is targeted at a wide variety of AI tasks. It provides a data-driven configuration system, training, prediction, and evaluation scripts, as well as a programmatic API. +Ludwig is a declarative machine learning framework that makes it easy to define machine learning pipelines using a simple and flexible data-driven configuration system. Ludwig is targeted at a wide variety of AI tasks. It provides a data-driven configuration system, training, prediction, and evaluation scripts, as well as a programmatic API. + +Keywords: Declarative, Data-driven, ML Framework ## [InvokeAI](https://github.com/invoke-ai/InvokeAI) [InvokeAI](https://github.com/invoke-ai/InvokeAI) is an engine for Stable Diffusion models, aimed at professionals, artists, and enthusiasts. It leverages the latest AI-driven technologies through CLI as well as a WebUI. +Keywords: Stable-Diffusion, WebUI, CLI + ## [PaddleNLP](https://github.com/PaddlePaddle/PaddleNLP) -[PaddleNLP](https://github.com/PaddlePaddle/PaddleNLP) is an easy-to-use and powerful NLP library particularly targeted at the Chinese languages. It has support for multiple pre-trained model zoos, and supports a wide-range of NLP tasks from research to industrial applications. +[PaddleNLP](https://github.com/PaddlePaddle/PaddleNLP) is an easy-to-use and powerful NLP library particularly targeted at the Chinese languages. It has support for multiple pre-trained model zoos, and supports a wide-range of NLP tasks from research to industrial applications. + +Keywords: NLP, Chinese, Research, Industry ## [stanza](https://github.com/stanfordnlp/stanza) The Stanford NLP Group's official Python NLP library. It contains support for running various accurate natural language processing tools on 60+ languages and for accessing the Java Stanford CoreNLP software from Python. +Keywords: NLP, Multilingual, CoreNLP + ## [DeepPavlov](https://github.com/deeppavlov/DeepPavlov) [DeepPavlov](https://github.com/deeppavlov/DeepPavlov) is an open-source conversational AI library. It is designed for the development of production ready chat-bots and complex conversational systems, as well as research in the area of NLP and, particularly, of dialog systems. +Keywords: Conversational, Chatbot, Dialog + ## [alpaca-lora](https://github.com/tloen/alpaca-lora) -Alpaca-lora contains code for reproducing the Stanford Alpaca results using low-rank adaptation (LoRA). The repository provides training (fine-tuning) as well as generation scripts. +Alpaca-lora contains code for reproducing the Stanford Alpaca results using low-rank adaptation (LoRA). The repository provides training (fine-tuning) as well as generation scripts. + +Keywords: LoRA, Parameter-efficient fine-tuning ## [imagen-pytorch](https://github.com/lucidrains/imagen-pytorch) -An open-source Implementation of Imagen, Google's closed-source Text-to-Image Neural Network that beats DALL-E2. As of release, it is the new SOTA for text-to-image synthesis. +An open-source Implementation of Imagen, Google's closed-source Text-to-Image Neural Network that beats DALL-E2. As of release, it is the new SOTA for text-to-image synthesis. + +Keywords: Imagen, Text-to-image ## [adapter-transformers](https://github.com/adapter-hub/adapter-transformers) -[adapter-transformers](https://github.com/adapter-hub/adapter-transformers) is an extension of HuggingFace's Transformers library, integrating adapters into state-of-the-art language models by incorporating AdapterHub, a central repository for pre-trained adapter modules. It is a drop-in replacement for transformers, which is regularly updated to stay up-to-date with the developments of transformers. +[adapter-transformers](https://github.com/adapter-hub/adapter-transformers) is an extension of HuggingFace's Transformers library, integrating adapters into state-of-the-art language models by incorporating AdapterHub, a central repository for pre-trained adapter modules. It is a drop-in replacement for transformers, which is regularly updated to stay up-to-date with the developments of transformers. + +Keywords: Adapters, LoRA, Parameter-efficient fine-tuning, Hub ## [NeMo](https://github.com/NVIDIA/NeMo) -NVIDIA [NeMo](https://github.com/NVIDIA/NeMo) is a conversational AI toolkit built for researchers working on automatic speech recognition (ASR), text-to-speech synthesis (TTS), large language models (LLMs), and natural language processing (NLP). The primary objective of [NeMo](https://github.com/NVIDIA/NeMo) is to help researchers from industry and academia to reuse prior work (code and pretrained models) and make it easier to create new https://developer.nvidia.com/conversational-ai#started. +NVIDIA [NeMo](https://github.com/NVIDIA/NeMo) is a conversational AI toolkit built for researchers working on automatic speech recognition (ASR), text-to-speech synthesis (TTS), large language models (LLMs), and natural language processing (NLP). The primary objective of [NeMo](https://github.com/NVIDIA/NeMo) is to help researchers from industry and academia to reuse prior work (code and pretrained models) and make it easier to create new https://developer.nvidia.com/conversational-ai#started. + +Keywords: Conversational, ASR, TTS, LLMs, NLP ## [MONAI](https://github.com/Project-MONAI/MONAI) -[MONAI](https://github.com/Project-MONAI/MONAI) is a PyTorch-based, open-source framework for deep learning in healthcare imaging, part of PyTorch Ecosystem. Its ambitions are: -• developing a community of academic, industrial and clinical researchers collaborating on a common foundation; -• creating state-of-the-art, end-to-end training workflows for healthcare imaging; -• providing researchers with the optimized and standardized way to create and evaluate deep learning models. +[MONAI](https://github.com/Project-MONAI/MONAI) is a PyTorch-based, open-source framework for deep learning in healthcare imaging, part of PyTorch Ecosystem. Its ambitions are: +- developing a community of academic, industrial and clinical researchers collaborating on a common foundation; +- creating state-of-the-art, end-to-end training workflows for healthcare imaging; +- providing researchers with the optimized and standardized way to create and evaluate deep learning models. + +Keywords: Healthcare imaging, Training, Evaluation ## [simpletransformers](https://github.com/ThilinaRajapakse/simpletransformers) -Simple Transformers lets you quickly train and evaluate Transformer models. Only 3 lines of code are needed to initialize, train, and evaluate a model. It supports a wide variety of NLP tasks. +Simple Transformers lets you quickly train and evaluate Transformer models. Only 3 lines of code are needed to initialize, train, and evaluate a model. It supports a wide variety of NLP tasks. + +Keywords: Framework, simplicity, NLP ## [JARVIS](https://github.com/microsoft/JARVIS) [JARVIS](https://github.com/microsoft/JARVIS) is a system attempting to merge LLMs such as GPT-4 with the rest of the open-source ML community: leveraging up to 60 downstream models in order to perform tasks identified by the LLM. -## [](https://xenova.github.io/transformers.js/) +Keywords: LLM, Agents, HF Hub -[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/).[](https://xenova.github.io/transformers.js/)j[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)J[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)v[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)S[](https://xenova.github.io/transformers.js/)c[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)p[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)b[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)y[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)g[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)u[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)g[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)a[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)f[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)m[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)d[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)c[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)l[](https://xenova.github.io/transformers.js/)y[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)w[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)h[](https://xenova.github.io/transformers.js/)i[](https://xenova.github.io/transformers.js/)n[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)t[](https://xenova.github.io/transformers.js/)h[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/) [](https://xenova.github.io/transformers.js/)b[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/)o[](https://xenova.github.io/transformers.js/)w[](https://xenova.github.io/transformers.js/)s[](https://xenova.github.io/transformers.js/)e[](https://xenova.github.io/transformers.js/)r[](https://xenova.github.io/transformers.js/).[](https://xenova.github.io/transformers.js/) +## [transformers.js](https://xenova.github.io/transformers.js/) + +[transformers.js](https://xenova.github.io/transformers.js/) is a JavaScript library targeted at running models from transformers directly within the browser. + +Keywords: Transformers, JavaScript, browser ## [bumblebee](https://github.com/elixir-nx/bumblebee) -Bumblebee provides pre-trained Neural Network models on top of Axon, a neural networks library for the Elixir language. It includes integration with 🤗 Models, allowing anyone to download and perform Machine Learning tasks with few lines of code. +Bumblebee provides pre-trained Neural Network models on top of Axon, a neural networks library for the Elixir language. It includes integration with 🤗 Models, allowing anyone to download and perform Machine Learning tasks with few lines of code. + +Keywords: Elixir, Axon ## [argilla](https://github.com/argilla-io/argilla) Argilla is an open-source platform providing advanced NLP labeling, monitoring, and workspaces. It is compatible with many open source ecosystems such as Hugging Face, Stanza, FLAIR, and others. +Keywords: NLP, Labeling, Monitoring, Workspaces + ## [haystack](https://github.com/deepset-ai/haystack) Haystack is an open source NLP framework to interact with your data using Transformer models and LLMs. It offers production-ready tools to quickly build complex decision making, question answering, semantic search, text generation applications, and more. +Keywords: NLP, Framework, LLM + ## [spaCy](https://github.com/explosion/spaCy) -[spaCy](https://github.com/explosion/spaCy) is a library for advanced Natural Language Processing - in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It offers support for transformers models through its third party package, spacy-transformers. +[spaCy](https://github.com/explosion/spaCy) is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It offers support for transformers models through its third party package, spacy-transformers. + +Keywords: NLP, Framework ## [speechbrain](https://github.com/speechbrain/speechbrain) -SpeechBrain is an open-source and all-in-one conversational AI toolkit based on PyTorch. -The goal is to create a single, flexible, and user-friendly toolkit that can be used to easily develop state-of-the-art speech technologies, including systems for speech recognition, speaker recognition, speech enhancement, speech separation, language identification, multi-microphone signal processing, and many others. +SpeechBrain is an open-source and all-in-one conversational AI toolkit based on PyTorch. +The goal is to create a single, flexible, and user-friendly toolkit that can be used to easily develop state-of-the-art speech technologies, including systems for speech recognition, speaker recognition, speech enhancement, speech separation, language identification, multi-microphone signal processing, and many others. + +Keywords: Conversational, Speech ## [skorch](https://github.com/skorch-dev/skorch) Skorch is a scikit-learn compatible neural network library that wraps PyTorch. It has support for models within transformers, and tokenizers from tokenizers. +Keywords: Scikit-Learn, PyTorch + ## [bertviz](https://github.com/jessevig/bertviz) -BertViz is an interactive tool for visualizing attention in Transformer language models such as BERT, GPT2, or T5. It can be run inside a Jupyter or Colab notebook through a simple Python API that supports most Huggingface models. +BertViz is an interactive tool for visualizing attention in Transformer language models such as BERT, GPT2, or T5. It can be run inside a Jupyter or Colab notebook through a simple Python API that supports most Huggingface models. + +Keywords: Visualization, Transformers ## [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) -[mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) is a haiku library using the xmap/pjit operators in JAX for model parallelism of transformers. This library is designed for scalability up to approximately 40B parameters on TPUv3s. +[mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) is a haiku library using the xmap/pjit operators in JAX for model parallelism of transformers. This library is designed for scalability up to approximately 40B parameters on TPUv3s. + +Keywords: Haiku, Model parallelism, 40B parameters, TPU, TPUv3 ## [deepchem](https://github.com/deepchem/deepchem) DeepChem aims to provide a high quality open-source toolchain that democratizes the use of deep-learning in drug discovery, materials science, quantum chemistry, and biology. +Keywords: Drug discovery, Materials Science, Quantum Chemistry, Biology + ## [OpenNRE](https://github.com/thunlp/OpenNRE) An Open-Source Package for Neural Relation Extraction (NRE). It is targeted at a wide range of users, from newcomers to relation extraction, to developers, researchers, or students. +Keywords: Neural Relation Extraction, Framework + ## [pycorrector](https://github.com/shibing624/pycorrector) PyCorrector is a Chinese Text Error Correction Tool. It uses a language model to detect errors, pinyin feature and shape feature to correct Chinese text errors. it can be used for Chinese Pinyin and stroke input method. +Keywords: Chinese, Error correction tool, Language model, Pinyin + ## [nlpaug](https://github.com/makcedward/nlpaug) This python library helps you with augmenting nlp for machine learning projects. It is a lightweight library featuring synthetic data generation for improving model performance, support for audio and text, and compatibility with several ecosystems (scikit-learn, pytorch, tensorflow). +Keywords: Data augmentation, Synthetic data generation, Audio, NLP + ## [dream-textures](https://github.com/carson-katri/dream-textures) [dream-textures](https://github.com/carson-katri/dream-textures) is a library targeted at bringing stable-diffusion support within Blender. It supports several use-cases, such as image generation, texture projection, inpainting/outpainting, rendering passes, and upscaling. +Keywords: Stable-Diffusion, Blender + ## [seldon-core](https://github.com/SeldonIO/seldon-core) Seldon core converts your ML models (Tensorflow, Pytorch, H2o, etc.) or language wrappers (Python, Java, etc.) into production REST/GRPC microservices. Seldon handles scaling to thousands of production machine learning models and provides advanced machine learning capabilities out of the box including Advanced Metrics, Request Logging, Explainers, Outlier Detectors, A/B Tests, Canaries and more. +Keywords: Microservices, Modeling, Language wrappers + ## [open_model_zoo](https://github.com/openvinotoolkit/open_model_zoo) This repository includes optimized deep learning models and a set of demos to expedite development of high-performance deep learning inference applications. Use these free pre-trained models instead of training your own models to speed-up the development and production deployment process. +Keywords: Optimized models, Demos + ## [ml-stable-diffusion](https://github.com/apple/ml-stable-diffusion) -ML-Stable-Diffusion is a repository by Appel bringing Stable Diffusion support to Core ML, on Apple Silicon devices. It supports stable diffusion checkpoints hosted on the Hugging Face Hub. +ML-Stable-Diffusion is a repository by Apple bringing Stable Diffusion support to Core ML, on Apple Silicon devices. It supports stable diffusion checkpoints hosted on the Hugging Face Hub. + +Keywords: Stable Diffusion, Apple Silicon, Core ML ## [stable-dreamfusion](https://github.com/ashawkey/stable-dreamfusion) -Stable-Dreamfusion is a pytorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. +Stable-Dreamfusion is a pytorch implementation of the text-to-3D model Dreamfusion, powered by the Stable Diffusion text-to-2D model. + +Keywords: Text-to-3D, Stable Diffusion ## [txtai](https://github.com/neuml/txtai) [txtai](https://github.com/neuml/txtai) is an open-source platform for semantic search and workflows powered by language models. +Keywords: Semantic search, LLM + ## [djl](https://github.com/deepjavalibrary/djl) Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java framework for deep learning. DJL is designed to be easy to get started with and simple to use for Java developers. DJL provides a native Java development experience and functions like any other regular Java library. +Keywords: Java, Framework + ## [gpt-neox](https://github.com/EleutherAI/gpt-neox) -This repository records EleutherAI's library for training large-scale language models on GPUs. The framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. It is focused on training multi-billion-parameter models. +This repository records EleutherAI's library for training large-scale language models on GPUs. The framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. It is focused on training multi-billion-parameter models. + +Keywords: Training, LLM, Megatron, DeepSpeed ## [muzic](https://github.com/microsoft/muzic) -Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence. Muzic was created by researchers from Microsoft Research Asia. +Muzic is a research project on AI music that empowers music understanding and generation with deep learning and artificial intelligence. Muzic was created by researchers from Microsoft Research Asia. + +Keywords: Music understanding, Music generation ## [dalle-flow](https://github.com/jina-ai/dalle-flow) -DALL·E Flow is an interactive workflow for generating high-definition images from a text prompt. Itt leverages DALL·E-Mega, GLID-3 XL, and Stable Diffusion to generate image candidates, and then calls CLIP-as-service to rank the candidates w.r.t. the prompt. -The preferred candidate is fed to GLID-3 XL for diffusion, which often enriches the texture and background. Finally, the candidate is upscaled to 1024x1024 via SwinIR. +DALL·E Flow is an interactive workflow for generating high-definition images from a text prompt. Itt leverages DALL·E-Mega, GLID-3 XL, and Stable Diffusion to generate image candidates, and then calls CLIP-as-service to rank the candidates w.r.t. the prompt. +The preferred candidate is fed to GLID-3 XL for diffusion, which often enriches the texture and background. Finally, the candidate is upscaled to 1024x1024 via SwinIR. + +Keywords: High-definition image generation, Stable Diffusion, DALL-E Mega, GLID-3 XL, CLIP, SwinIR ## [lightseq](https://github.com/bytedance/lightseq) LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA. It enables highly efficient computation of modern NLP and CV models such as BERT, GPT, Transformer, etc. It is therefore best useful for machine translation, text generation, image classification, and other sequence related tasks. +Keywords: Training, Inference, Sequence Processing, Sequence Generation + ## [LaTeX-OCR](https://github.com/lukas-blecher/LaTeX-OCR) The goal of this project is to create a learning based system that takes an image of a math formula and returns corresponding LaTeX code. +Keywords: OCR, LaTeX, Math formula + ## [open_clip](https://github.com/mlfoundations/open_clip) OpenCLIP is an open source implementation of OpenAI's CLIP. @@ -221,129 +300,189 @@ Keywords: CLIP, Open-source, Contrastive, Image-text A playground to generate images from any text prompt using Stable Diffusion and Dall-E mini. +Keywords: WebUI, Stable Diffusion, Dall-E mini + ## [FedML](https://github.com/FedML-AI/FedML) [FedML](https://github.com/FedML-AI/FedML) is a federated learning and analytics library enabling secure and collaborative machine learning on decentralized data anywhere at any scale. It supports large-scale cross-silo federated learning, and cross-device federated learning on smartphones/IoTs, and research simulation. +Keywords: Federated Learning, Analytics, Collaborative ML, Decentralized + ## [gpt-code-clippy](https://github.com/CodedotAl/gpt-code-clippy) -GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model -- based on GPT-3, called GPT-Codex -- that is fine-tuned on publicly available code from GitHub. +GPT-Code-Clippy (GPT-CC) is an open source version of GitHub Copilot, a language model -- based on GPT-3, called GPT-Codex -- that is fine-tuned on publicly available code from GitHub. + +Keywords: LLM, Code ## [TextAttack](https://github.com/QData/TextAttack) -[TextAttack](https://github.com/QData/TextAttack) 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP. +[TextAttack](https://github.com/QData/TextAttack) 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP. + +Keywords: Adversarial attacks, Data augmentation, NLP ## [OpenPrompt](https://github.com/thunlp/OpenPrompt) -Prompt-learning is a paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modify the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. This library provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline. [OpenPrompt](https://github.com/thunlp/OpenPrompt) supports loading PLMs directly from https://github.com/huggingface/transformers. +Prompt-learning is a paradigm to adapt pre-trained language models (PLMs) to downstream NLP tasks, which modify the input text with a textual template and directly uses PLMs to conduct pre-trained tasks. This library provides a standard, flexible and extensible framework to deploy the prompt-learning pipeline. [OpenPrompt](https://github.com/thunlp/OpenPrompt) supports loading PLMs directly from https://github.com/huggingface/transformers. -## [](https://github.com/oobabooga/text-generation-webui/) +## [text-generation-webui](https://github.com/oobabooga/text-generation-webui/) -[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)w[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/)b[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)U[](https://github.com/oobabooga/text-generation-webui/)I[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)f[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)u[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)r[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)u[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)g[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)M[](https://github.com/oobabooga/text-generation-webui/)o[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)s[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)k[](https://github.com/oobabooga/text-generation-webui/)e[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)M[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)l[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)m[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/).[](https://github.com/oobabooga/text-generation-webui/)c[](https://github.com/oobabooga/text-generation-webui/)p[](https://github.com/oobabooga/text-generation-webui/)p[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)G[](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/)-[](https://github.com/oobabooga/text-generation-webui/)J[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)y[](https://github.com/oobabooga/text-generation-webui/)t[](https://github.com/oobabooga/text-generation-webui/)h[](https://github.com/oobabooga/text-generation-webui/)i[](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)O[](https://github.com/oobabooga/text-generation-webui/)P[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/),[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)a[](https://github.com/oobabooga/text-generation-webui/)n[](https://github.com/oobabooga/text-generation-webui/)d[](https://github.com/oobabooga/text-generation-webui/) [](https://github.com/oobabooga/text-generation-webui/)G[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/)L[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/)C[](https://github.com/oobabooga/text-generation-webui/)T[](https://github.com/oobabooga/text-generation-webui/)I[](https://github.com/oobabooga/text-generation-webui/)C[](https://github.com/oobabooga/text-generation-webui/)A[](https://github.com/oobabooga/text-generation-webui/).[](https://github.com/oobabooga/text-generation-webui/) +[text-generation-webui](https://github.com/oobabooga/text-generation-webui/) is a Gradio Web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA. + +Keywords: LLM, WebUI ## [libra](https://github.com/Palashio/libra) An ergonomic machine learning [libra](https://github.com/Palashio/libra)ry for non-technical users. It focuses on ergonomics and on ensuring that training a model is as simple as it can be. +Keywords: Ergonomic, Non-technical + ## [alibi](https://github.com/SeldonIO/alibi) -Alibi is an open source Python library aimed at machine learning model inspection and interpretation. The focus of the library is to provide high-quality implementations of black-box, white-box, local and global explanation methods for classification and regression models. +Alibi is an open source Python library aimed at machine learning model inspection and interpretation. The focus of the library is to provide high-quality implementations of black-box, white-box, local and global explanation methods for classification and regression models. + +Keywords: Model inspection, Model interpretation, Black-box, White-box ## [tortoise-tts](https://github.com/neonbjb/tortoise-tts) Tortoise is a text-to-speech program built with the following priorities: strong multi-voice capabilities., and highly realistic prosody and intonation. +Keywords: Text-to-speech + ## [flower](https://github.com/adap/flower) Flower (flwr) is a framework for building federated learning systems. The design of Flower is based on a few guiding principles: customizability, extendability, framework agnosticity, and ease-of-use. +Keywords: Federated learning systems, Customizable, Extendable, Framework-agnostic, Simplicity + ## [fast-bert](https://github.com/utterworks/fast-bert) Fast-Bert is a deep learning library that allows developers and data scientists to train and deploy BERT and XLNet based models for natural language processing tasks beginning with Text Classification. It is aimed at simplicity. +Keywords: Deployment, BERT, XLNet + ## [towhee](https://github.com/towhee-io/towhee) -Towhee makes it easy to build neural data processing pipelines for AI applications. We provide hundreds of models, algorithms, and transformations that can be used as standard pipeline building blocks. Users can use Towhee's Pythonic API to build a prototype of their pipeline and automatically optimize it for production-ready environments. +Towhee makes it easy to build neural data processing pipelines for AI applications. We provide hundreds of models, algorithms, and transformations that can be used as standard pipeline building blocks. Users can use Towhee's Pythonic API to build a prototype of their pipeline and automatically optimize it for production-ready environments. + +Keywords: Data processing pipeline, Optimization ## [alibi-detect](https://github.com/SeldonIO/alibi-detect) Alibi Detect is an open source Python library focused on outlier, adversarial and drift detection. The package aims to cover both online and offline detectors for tabular data, text, images and time series. Both TensorFlow and PyTorch backends are supported for drift detection. +Keywords: Adversarial, Outlier, Drift detection + ## [FARM](https://github.com/deepset-ai/FARM) [FARM](https://github.com/deepset-ai/FARM) makes Transfer Learning with BERT & Co simple, fast and enterprise-ready. It's built upon transformers and provides additional features to simplify the life of developers: Parallelized preprocessing, highly modular design, multi-task learning, experiment tracking, easy debugging and close integration with AWS SageMaker. +Keywords: Transfer Learning, Modular design, Multi-task learning, Experiment tracking + ## [aitextgen](https://github.com/minimaxir/aitextgen) -A robust Python tool for text-based AI training and generation using OpenAI's GPT-2 and EleutherAI's GPT Neo/GPT-3 architecture. -[aitextgen](https://github.com/minimaxir/aitextgen) is a Python package that leverages PyTorch, Hugging Face Transformers and pytorch-lightning with specific optimizations for text generation using GPT-2, plus many added features. +A robust Python tool for text-based AI training and generation using OpenAI's GPT-2 and EleutherAI's GPT Neo/GPT-3 architecture. +[aitextgen](https://github.com/minimaxir/aitextgen) is a Python package that leverages PyTorch, Hugging Face Transformers and pytorch-lightning with specific optimizations for text generation using GPT-2, plus many added features. + +Keywords: Training, Generation ## [diffgram](https://github.com/diffgram/diffgram) Diffgram aims to integrate human supervision into platforms. We support your team programmatically changing the UI (Schema, layout, etc.) like in Streamlit. This means that you can collect and annotate timely data from users. In other words, we are the platform behind your platform, an integrated part of your application, to ship new & better AI products faster. +Keywords: Human supervision, Platfor, + ## [ecco](https://github.com/jalammar/ecco) Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0). +Keywords: Model explainability + ## [s3prl](https://github.com/s3prl/s3prl) [s3prl](https://github.com/s3prl/s3prl) stands for Self-Supervised Speech Pre-training and Representation Learning. Self-supervised speech pre-trained models are called upstream in this toolkit, and are utilized in various downstream tasks. +Keywords: Speech, Training + ## [ru-dalle](https://github.com/ai-forever/ru-dalle) RuDALL-E aims to be similar to DALL-E, targeted to Russian. +Keywords: DALL-E, Russian + ## [DeepKE](https://github.com/zjunlp/DeepKE) [DeepKE](https://github.com/zjunlp/DeepKE) is a knowledge extraction toolkit for knowledge graph construction supporting cnSchema,low-resource, document-level and multimodal scenarios for entity, relation and attribute extraction. +Keywords: Knowledge Extraction, Knowledge Graphs + ## [nebullvm](https://github.com/nebuly-ai/nebullvm) Nebullvm is an ecosystem of plug and play modules to optimize the performances of your AI systems. The optimization modules are stack-agnostic and work with any library. They are designed to be easily integrated into your system, providing a quick and seamless boost to its performance. Simply plug and play to start realizing the benefits of optimized performance right away. +Keywords: Optimization, Performance + ## [imaginAIry](https://github.com/brycedrennan/imaginAIry) Offers a CLI and a Python API to generate images with Stable Diffusion. It has support for many tools, like image structure control (controlnet), instruction-based image edits (InstructPix2Pix), prompt-based masking (clipseg), among others. +Keywords: Stable Diffusion, CLI, Python API + ## [sparseml](https://github.com/neuralmagic/sparseml) -SparseML is an open-source model optimization toolkit that enables you to create inference-optimized sparse models using pruning, quantization, and distillation algorithms. Models optimized with SparseML can then be exported to the ONNX and deployed with DeepSparse for GPU-class performance on CPU hardware. +SparseML is an open-source model optimization toolkit that enables you to create inference-optimized sparse models using pruning, quantization, and distillation algorithms. Models optimized with SparseML can then be exported to the ONNX and deployed with DeepSparse for GPU-class performance on CPU hardware. + +Keywords: Model optimization, Pruning, Quantization, Distillation ## [opacus](https://github.com/pytorch/opacus) Opacus is a library that enables training PyTorch models with differential privacy. It supports training with minimal code changes required on the client, has little impact on training performance, and allows the client to online track the privacy budget expended at any given moment. +Keywords: Differential privacy + ## [LAVIS](https://github.com/salesforce/LAVIS) [LAVIS](https://github.com/salesforce/LAVIS) is a Python deep learning library for LAnguage-and-VISion intelligence research and applications. This library aims to provide engineers and researchers with a one-stop solution to rapidly develop models for their specific multimodal scenarios, and benchmark them across standard and customized datasets. It features a unified interface design to access +Keywords: Multimodal, NLP, Vision + ## [buzz](https://github.com/chidiwilliams/buzz) Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper. +Keywords: Audio transcription, Translation + ## [rust-bert](https://github.com/guillaume-be/rust-bert) -Rust-native state-of-the-art Natural Language Processing models and pipelines. Port of Hugging Face's Transformers library, using the tch-rs crate and pre-processing from rust-tokenizers. Supports multi-threaded tokenization and GPU inference. This repository exposes the model base architecture, task-specific heads (see below) and ready-to-use pipelines. +Rust-native state-of-the-art Natural Language Processing models and pipelines. Port of Hugging Face's Transformers library, using the tch-rs crate and pre-processing from rust-tokenizers. Supports multi-threaded tokenization and GPU inference. This repository exposes the model base architecture, task-specific heads and ready-to-use pipelines. + +Keywords: Rust, BERT, Inference ## [EasyNLP](https://github.com/alibaba/EasyNLP) [EasyNLP](https://github.com/alibaba/EasyNLP) is an easy-to-use NLP development and application toolkit in PyTorch, first released inside Alibaba in 2021. It is built with scalable distributed training strategies and supports a comprehensive suite of NLP algorithms for various NLP applications. [EasyNLP](https://github.com/alibaba/EasyNLP) integrates knowledge distillation and few-shot learning for landing large pre-trained models, together with various popular multi-modality pre-trained models. It provides a unified framework of model training, inference, and deployment for real-world applications. +Keywords: NLP, Knowledge distillation, Few-shot learning, Multi-modality, Training, Inference, Deployment + ## [TurboTransformers](https://github.com/Tencent/TurboTransformers) A fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU. +Keywords: Optimization, Performance + ## [hivemind](https://github.com/learning-at-home/hivemind) Hivemind is a PyTorch library for decentralized deep learning across the Internet. Its intended usage is training one large model on hundreds of computers from different universities, companies, and volunteers. +Keywords: Decentralized training + ## [docquery](https://github.com/impira/docquery) DocQuery is a library and command-line tool that makes it easy to analyze semi-structured and unstructured documents (PDFs, scanned images, etc.) using large language models (LLMs). You simply point DocQuery at one or more documents and specify a question you want to ask. DocQuery is created by the team at Impira. +Keywords: Semi-structured documents, Unstructured documents, LLM, Document Question Answering + ## [CodeGeeX](https://github.com/THUDM/CodeGeeX) [CodeGeeX](https://github.com/THUDM/CodeGeeX) is a large-scale multilingual code generation model with 13 billion parameters, pre-trained on a large code corpus of more than 20 programming languages. It has several unique features: From 8216e786bc7d15c8809282d71cc2a89d771450bf Mon Sep 17 00:00:00 2001 From: Lysandre Date: Fri, 21 Apr 2023 10:04:27 +0200 Subject: [PATCH 5/9] Keywords --- awesome-transformers.md | 22 +++++++++++++++++++++- 1 file changed, 21 insertions(+), 1 deletion(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index 6e3c7fb44d0e4c..c20450be610f3b 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -490,42 +490,60 @@ Keywords: Semi-structured documents, Unstructured documents, LLM, Document Quest - Crosslingual code translation - Is a customizable programming assistant +Keywords: Code Generation Model + ## [ktrain](https://github.com/amaiya/ktrain) [ktrain](https://github.com/amaiya/ktrain) is a lightweight wrapper for the deep learning library TensorFlow Keras (and other libraries) to help build, train, and deploy neural networks and other machine learning models. Inspired by ML framework extensions like fastai and ludwig, [ktrain](https://github.com/amaiya/ktrain) is designed to make deep learning and AI more accessible and easier to apply for both newcomers and experienced practitioners. +Keywords: Keras wrapper, Model building, Training, Deployment + ## [FastDeploy](https://github.com/PaddlePaddle/FastDeploy) [FastDeploy](https://github.com/PaddlePaddle/FastDeploy) is an Easy-to-use and High Performance AI model deployment toolkit for Cloud, Mobile and Edge with packageout-of-the-box and unified experience, endend-to-end optimization for over fire160+ Text, Vision, Speech and Cross-modal AI models. Including image classification, object detection, OCR, face detection, matting, pp-tracking, NLP, stable diffusion, TTS and other tasks to meet developers' industrial deployment needs for multi-scenario, multi-hardware and multi-platform. +Keywords: Model deployment, CLoud, Mobile, Edge + ## [underthesea](https://github.com/undertheseanlp/underthesea) [underthesea](https://github.com/undertheseanlp/underthesea) is a Vietnamese NLP toolkit. Underthesea is a suite of open source Python modules data sets and tutorials supporting research and development in Vietnamese Natural Language Processing. We provides extremely easy API to quickly apply pretrained NLP models to your Vietnamese text, such as word segmentation, part-of-speech tagging (PoS), named entity recognition (NER), text classification and dependency parsing. +Keywords: Vietnamese, NLP + ## [hasktorch](https://github.com/hasktorch/hasktorch) Hasktorch is a library for tensors and neural networks in Haskell. It is an independent open source community project which leverages the core C++ libraries shared by PyTorch. +Keywords: Haskell, Neural Networks + ## [donut](https://github.com/clovaai/donut) Donut, or Document understanding transformer, is a new method of document understanding that utilizes an OCR-free end-to-end Transformer model. Donut does not require off-the-shelf OCR engines/APIs, yet it shows state-of-the-art performances on various visual document understanding tasks, such as visual document classification or information extraction (a.k.a. document parsing). +Keywords: Document Understanding + ## [transformers-interpret](https://github.com/cdpierse/transformers-interpret) -Transformers Interpret is a model explainability tool designed to work exclusively with the hugs transformers package. +Transformers Interpret is a model explainability tool designed to work exclusively with the transformers package. In line with the philosophy of the Transformers package Transformers Interpret allows any transformers model to be explained in just two lines. Explainers are available for both text and computer vision models. Visualizations are also available in notebooks and as savable png and html files +Keywords: Model interpretation, Visualization + ## [mlrun](https://github.com/mlrun/mlrun) MLRun is an open MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications, significantly reducing engineering efforts, time to production, and computation resources. With MLRun, you can choose any IDE on your local machine or on the cloud. MLRun breaks the silos between data, ML, software, and DevOps/MLOps teams, enabling collaboration and fast continuous improvements. +Keywords: MLOps + ## [FederatedScope](https://github.com/alibaba/FederatedScope) [FederatedScope](https://github.com/alibaba/FederatedScope) is a comprehensive federated learning platform that provides convenient usage and flexible customization for various federated learning tasks in both academia and industry. Based on an event-driven architecture, [FederatedScope](https://github.com/alibaba/FederatedScope) integrates rich collections of functionalities to satisfy the burgeoning demands from federated learning, and aims to build up an easy-to-use platform for promoting learning safely and effectively. +Keywords: Federated learning, Event-driven + ## [pythainlp](https://github.com/PyThaiNLP/pythainlp) PyThaiNLP is a Python package for text processing and linguistic analysis, similar to NLTK with focus on Thai language. @@ -536,6 +554,8 @@ Keywords: Thai, NLP, NLTK [FlagAI](https://github.com/FlagAI-Open/FlagAI) (Fast LArge-scale General AI models) is a fast, easy-to-use and extensible toolkit for large-scale model. Our goal is to support training, fine-tuning, and deployment of large-scale models on various downstream tasks with multi-modality. +Keywords: Large models, Training, Fine-tuning, Deployment, Multi-modal + ## [pyserini](https://github.com/castorini/pyserini) [pyserini](https://github.com/castorini/pyserini) is a Python toolkit for reproducible information retrieval research with sparse and dense representations. Retrieval using sparse representations is provided via integration with the group's Anserini IR toolkit. Retrieval using dense representations is provided via integration with Facebook's Faiss library. From 5c0f3371a55e4b07d1c2275be8949182c475f954 Mon Sep 17 00:00:00 2001 From: Lysandre Date: Fri, 21 Apr 2023 17:28:45 +0200 Subject: [PATCH 6/9] Complete document --- awesome-transformers.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/awesome-transformers.md b/awesome-transformers.md index c20450be610f3b..9c73e594554a65 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -111,6 +111,14 @@ NVIDIA [NeMo](https://github.com/NVIDIA/NeMo) is a conversational AI toolkit bui Keywords: Conversational, ASR, TTS, LLMs, NLP +## [Runhouse](https://github.com/run-house/runhouse) + +[Runhouse](https://github.com/run-house/runhouse) allows to send code and data to any of your compute or data infra, all in Python, and continue to interact with them normally from your existing code and environment. Runhouse developers mention: + +> Think of it as an expansion pack to your Python interpreter that lets it take detours to remote machines or manipulate remote data. + +Keywords: MLOps, Infrastructure, Data storage, Modeling + ## [MONAI](https://github.com/Project-MONAI/MONAI) [MONAI](https://github.com/Project-MONAI/MONAI) is a PyTorch-based, open-source framework for deep learning in healthcare imaging, part of PyTorch Ecosystem. Its ambitions are: From 5ba24f89311d81ee4bac862721b181d4c3cc088a Mon Sep 17 00:00:00 2001 From: Lysandre Date: Wed, 3 May 2023 08:21:23 -0400 Subject: [PATCH 7/9] Add lm-evaluation-harness --- awesome-transformers.md | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/awesome-transformers.md b/awesome-transformers.md index 9c73e594554a65..6a654acf5237db 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -262,6 +262,13 @@ Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java fram Keywords: Java, Framework + +## [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness/) + +This project provides a unified framework to test generative language models on a large number of different evaluation tasks. It has support for more than 200 tasks, and supports different ecosystems: HF Transformers, GPT-NeoX, DeepSpeed, as well as the OpenAI API. + +Keywords: LLM, Evaluation, Few-shot + ## [gpt-neox](https://github.com/EleutherAI/gpt-neox) This repository records EleutherAI's library for training large-scale language models on GPUs. The framework is based on NVIDIA's Megatron Language Model and has been augmented with techniques from DeepSpeed as well as some novel optimizations. It is focused on training multi-billion-parameter models. From 32cc50e329066ecf9c633ae71cab0bec2df5f872 Mon Sep 17 00:00:00 2001 From: Lysandre Date: Wed, 3 May 2023 09:27:14 -0400 Subject: [PATCH 8/9] Edit txtai according to David's comments --- awesome-transformers.md | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index 6a654acf5237db..df4d94c1bd74a5 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -251,8 +251,8 @@ Stable-Dreamfusion is a pytorch implementation of the text-to-3D model Dreamfusi Keywords: Text-to-3D, Stable Diffusion ## [txtai](https://github.com/neuml/txtai) - -[txtai](https://github.com/neuml/txtai) is an open-source platform for semantic search and workflows powered by language models. + +[txtai](https://github.com/neuml/txtai) is an open-source platform for semantic search and workflows powered by language models. txtai builds embeddings databases, which are a union of vector indexes and relational databases enabling similarity search with SQL. Semantic workflows connect language models together into unified applications. Keywords: Semantic search, LLM @@ -262,7 +262,6 @@ Deep Java Library (DJL) is an open-source, high-level, engine-agnostic Java fram Keywords: Java, Framework - ## [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness/) This project provides a unified framework to test generative language models on a large number of different evaluation tasks. It has support for more than 200 tasks, and supports different ecosystems: HF Transformers, GPT-NeoX, DeepSpeed, as well as the OpenAI API. From 31757ac78ab8c065031d313e8cd24e07225ffcd6 Mon Sep 17 00:00:00 2001 From: Lysandre Debut Date: Wed, 17 May 2023 08:22:45 -0400 Subject: [PATCH 9/9] Update awesome-transformers.md --- awesome-transformers.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/awesome-transformers.md b/awesome-transformers.md index df4d94c1bd74a5..6dab782a1d46ae 100644 --- a/awesome-transformers.md +++ b/awesome-transformers.md @@ -191,9 +191,9 @@ Keywords: Visualization, Transformers ## [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) -[mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) is a haiku library using the xmap/pjit operators in JAX for model parallelism of transformers. This library is designed for scalability up to approximately 40B parameters on TPUv3s. +[mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) is a haiku library using the xmap/pjit operators in JAX for model parallelism of transformers. This library is designed for scalability up to approximately 40B parameters on TPUv3s. It was the library used to train the GPT-J model. -Keywords: Haiku, Model parallelism, 40B parameters, TPU, TPUv3 +Keywords: Haiku, Model parallelism, LLM, TPU ## [deepchem](https://github.com/deepchem/deepchem)