Add support for language models #5

HennerM · 2023-06-22T10:33:12Z

Previously reported in #2 (comment) by @aamir-s18

The backend currently only supports encoder-decoder models,
whereas the underlying library also has support for decoder-only models: https://github.com/OpenNMT/CTranslate2/blob/master/src/models/language_model.cc

This should be fairly straightforward to add. Ideally we want to auto-detect the type of the model, or alternatively specify in the configuration.

aamir-s18 · 2023-06-23T14:31:43Z

It would be great to support all the models ctranslate2 supports now (like Whisper and Encoder only).

I considered the best way to abstract the inference since different model classes have other calling functions (like generate, translate ...). We could create a metaclass which takes over the handling of the inference and initialization, so we have a unified interface to talk with. Tbh, I need to find out how easy this is.

Let's specify the model type in the config.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for language models #5

Add support for language models #5

HennerM commented Jun 22, 2023 •

edited

Loading

aamir-s18 commented Jun 23, 2023

Add support for language models #5

Add support for language models #5

Comments

HennerM commented Jun 22, 2023 • edited Loading

aamir-s18 commented Jun 23, 2023

HennerM commented Jun 22, 2023 •

edited

Loading