You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I reached out on Gitter but I'm not sure if that's still actively used:
With all the LLM and VLM models announced and released recently, are there thoughts or plans around supporting those types of models in DeepDetect? I'm specifically most interested in multimodal models like the new HuggingFace SmolVLM Instruct models and Microsoft Florence-2 vision models, both based on the HuggingFace Transformers library.
SmolVLM is available as a variety of ONNX models, while Florence-2 is available as pytorch .bin, both with their own set of various configs for the model, tokenizers, preprocessors, etc, so I'm not sure how much (or which) is viable for use in DeepDetect.
The text was updated successfully, but these errors were encountered:
Hi @cchadowitz we have another software stack for LLMs and VLLMs (including smolvlm), it's not been open sourced yet. If you'd like to have them inside DD, please PM us via email to see what can be done.
Sorry about the gitter, we've been busy elsewhere.
I reached out on Gitter but I'm not sure if that's still actively used:
The text was updated successfully, but these errors were encountered: