Marian RNN conversion support #36651

FricoRico · 2025-03-11T22:48:08Z

Feature request

Add support for converting Marian Transformer RNN models.

Motivation

Firefox is doing great work at training new models. Their teacher models are able to be converted to PyTorch models using existing conversion tooling.

However their student models which are way smaller, and more efficient, are structured as Transformer-RNN. This is currently not supported by the conversion tools. Would it be even possible to add support for this?

Your contribution

At this point I'm just a parrot trying to seek help and information on this topic. It is far outside my current knowledge on what this exactly means. Perhaps if someone could point me in the right direction I could figure this out.

Rocketknight1 · 2025-03-12T12:21:02Z

Hi @FricoRico, I don't know if the Marian modeling code in Transformers supports Transformer-RNN architectures either! This means that you'd need to either:

Convert the models as "custom code" models: https://huggingface.co/docs/transformers/en/custom_models
Write a full PR to add the Transformer-RNN architecture to Transformers

FricoRico · 2025-03-12T21:28:08Z

@Rocketknight1 Yeah you are right, ideally the Marian tools would also need to be expanded to support Transformer-RNN for inference. But I guess step one would be to even allow to export the models in the first place. But perhaps I'm over simplifying things.

Rocketknight1 · 2025-03-13T14:46:15Z

In general, we need modeling code first in order to support conversion of model checkpoints, rather than the other way around! The model code provides the "architecture" that runs a particular set of weights.

FricoRico added the Feature request Request for a new feature label Mar 11, 2025

FricoRico mentioned this issue Mar 11, 2025

Explore uploading models to Hugging Face mozilla/translations#804

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Marian RNN conversion support #36651

Marian RNN conversion support #36651

FricoRico commented Mar 11, 2025

Rocketknight1 commented Mar 12, 2025

FricoRico commented Mar 12, 2025

Rocketknight1 commented Mar 13, 2025

Marian RNN conversion support #36651

Marian RNN conversion support #36651

Comments

FricoRico commented Mar 11, 2025

Feature request

Motivation

Your contribution

Rocketknight1 commented Mar 12, 2025

FricoRico commented Mar 12, 2025

Rocketknight1 commented Mar 13, 2025