ProtBench: Protein Language Modeling Benchmarking Library

Welcome to ProtBench! This library is designed to make benchmarking protein language models easy and modular. Whether you're adding new models, datasets, or using downstream models, this library has you covered. With support for embedding extraction and saving embeddings to disk, you can streamline your workflow and focus on what matters most: advancing your research.

Features

Ease of Use: Simple and intuitive API for benchmarking protein language models.
Modular Design: Easily add new models and datasets for benchmarking.
Downstream Models: Support for integrating and benchmarking downstream models (Currently, supports ConvBERT only).
LoRA Integration: Use LoRA (Low-Rank Adaptation) for efficient benchmarking.
Embedding Extraction: Extract embeddings and save them to disk for later use.

Results

Installation

To install the library, simply use pip:

git clone git@github.com:Proteinea/protbench.git
pip install -e .

Quick Start

Here are some simple examples to get you started: Example directories:

ESM2: protbench/examples/train_with_convbert_esm2.py
ANKH: protbench/examples/train_with_convbert.py
ESM2 with LoRA: protbench/examples/train_with_lora_esm2.py
Ankh with LoRA: protbench/examples/train_with_lora.py

Documentation

Will be added soon.

Contributing

We welcome contributions from the community. If you'd like to contribute, please fork the repository and submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 242 Commits
examples		examples
imgs		imgs
protbench		protbench
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
tests.py		tests.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ProtBench: Protein Language Modeling Benchmarking Library

Features

Results

Installation

Quick Start

Documentation

Contributing

About

Releases

Packages

Contributors 2

Languages

Proteinea/protbench

Folders and files

Latest commit

History

Repository files navigation

ProtBench: Protein Language Modeling Benchmarking Library

Features

Results

Installation

Quick Start

Documentation

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages