You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Jun 24, 2024. It is now read-only.
Even if we don't support it directly, it may be worth publishing a safe-ish version of ggml-rs to crates.io so that a library like llama-rs could be built for BLOOM.
The text was updated successfully, but these errors were encountered:
Current state of affairs: this was implemented in #85, but that went out of sync after recent changes. It was then updated and trait-ified in #141, which is on track to be merged soon 🚀
Not sure if we should consider this out of scope, but
bloomz.cpp
is a fork ofllama.cpp
that's capable of inference with the BLOOM family of models. The changes don't look very large, so there's room for code sharing here: https://github.com/NouamaneTazi/bloomz.cpp/commits/main?before=ade8a9d82fa1dc440c26f09a9e02cc94d7294251+35&branch=main&qualified_name=refs%2Fheads%2FmainEven if we don't support it directly, it may be worth publishing a safe-ish version of
ggml-rs
tocrates.io
so that a library likellama-rs
could be built for BLOOM.The text was updated successfully, but these errors were encountered: