Improve AutoTokenizer error message for Voxtral models missing mistral-common #41592

Khansa435 · 2025-10-14T20:23:12Z

What does this PR do?

Adds a clearer ImportError message when users try to load a Voxtral tokenizer without having mistral-common installed.

Issue

Fixes #41553
Fixes misleading TypeError: not a string when loading Voxtral models.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@ArthurZucker @Rocketknight1
Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Khansa435 · 2025-10-14T21:18:37Z

Hi @ArthurZucker @Rocketknight1 My PR passes 21 out of 22 checks and 1 is pending.
Plz review it and give your valuable suggestions :)

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Khansa435 · 2025-10-15T12:22:28Z

@ArthurZucker @Rocketknight1 It passed all checks after the changes I made but then when I merged both branches to stay up-to-date now it gave an error in tests but now all checks are being successful.

Plz review!

Khansa435 · 2025-10-15T13:54:41Z

@vasqu Can you please review this PR.It passess 21 out of 22 checks and none failed.One is pending.

vasqu · 2025-10-15T14:03:50Z

Hey @Khansa435, I directly responded in the issue #41553 (comment). Thx for the PR, but I will leave the decision to the respective maintainers who know more than me on this :D

vasqu · 2025-10-15T14:04:21Z

Also dont worry about the CI, we had some issue which should be good now

AvinashDwivedi · 2025-10-15T14:07:39Z

@vasqu may you please help me to contribute on some issue as I'm new to opensource I just want to learn.

vasqu · 2025-10-15T14:23:36Z

Hey @AvinashDwivedi, sorry about this one. It got a bit messy, usually we should only open 1 PR / coordinate with other people that already provided something. No worries, we will guide you as best as we can if the PR makes sense! We have contributions guide here for example https://huggingface.co/docs/transformers/contributing and those labeled with good first issue are good to start with in general; just make sure that not others already worked on it or that your solution is not the same

src/transformers/models/auto/tokenization_auto.py

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Khansa435 · 2025-10-24T13:56:54Z

Hi, Can you please review this and let me know if it satisfies the requirements :)
@ArthurZucker @ethanknights @itazap @jackzhxng

Khansa435 · 2025-10-26T15:38:21Z

Hi, Just checking in to see if there’s any update on this PR. Hacktoberfest is wrapping up soon, and I’d love to have this issue resolved before the event ends.
Please let me know if there’s anything else I should update or address.
Thanks for your time and review!
@ArthurZucker @ethanknights @itazap @jackzhxng @Rocketknight1

src/transformers/models/auto/tokenization_auto.py

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

src/transformers/models/auto/tokenization_auto.py

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

src/transformers/models/auto/tokenization_auto.py

github-actions · 2025-10-30T14:30:21Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

Khansa435 · 2025-10-30T14:31:49Z

Reverted to the original error message and adjusted indentation as suggested. Thanks for clarifying!
@vasqu Please take a look

vasqu · 2025-10-30T16:45:50Z

Thanks for bearing with me and iterating!

I started to look into this myself as this fix didn't work. Turns out that this problem goes a bit deeper so please wait a bit.
The core issue is that it allows a Llama fallback with improper files on the hub --> might indicate some faulty behavior in the saving of the mistral tokenizer backend

Khansa435 · 2025-10-31T03:52:32Z

Thanks a lot for the update and for taking the time to look into this deeper!
I’ll hold off on any further changes for now.

I really enjoyed working on this and learned a lot through your feedback. This was my first deeper dive into the AutoTokenizer internals.
Since I initially picked this up for Hacktoberfest and spent quite some time on it, I would be pleased if there’s anything else I could help with or another good first issue you’d recommend to complete within today for hacktoberfest?
I’d love to continue contributing to Transformers. 😊

Khansa435 added 5 commits October 15, 2025 01:04

Improve AutoTokenizer error message for Voxtral models

d2eceb9

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

7f7cfcb

Passing ruff checks

a79a7ba

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

12ee3b0

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Improved formatting

9b33bb4

Khansa435 added 6 commits October 15, 2025 02:22

fix_bad_error_voxtral_tokenizer

a3bc18b

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

066bee6

fix_bad_error

3c28280

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

be6cf68

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

0afd559

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

3c9732c

Khansa435 added 4 commits October 15, 2025 18:24

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

7ee01a5

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

c08826c

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

37b9adb

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

7685851

vasqu mentioned this pull request Oct 15, 2025

Bad error message for AutoTokenizer loading Voxtral #41553

Open

4 tasks

ArthurZucker reviewed Oct 16, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Outdated Show resolved Hide resolved

Rocketknight1 mentioned this pull request Oct 20, 2025

AutoTokenizer: clear ImportError when loading Voxtral without mistral-common + unit test #41718

Open

Khansa435 force-pushed the fix-bad-error-voxtral-tokenizer branch from 9bdf8a1 to 7685851 Compare October 23, 2025 17:42

Khansa435 added 4 commits October 23, 2025 22:46

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

5314035

Raise an error for missing mistral

0e52f37

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

4bdfc51

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Improve formatting and code_quality

5de1f23

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

92946da

Rocketknight1 mentioned this pull request Oct 29, 2025

fix: add clear error message when mistral-common is missing for AutoTokenizer loading Voxtral #41928

Open

5 tasks

vasqu reviewed Oct 29, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Outdated Show resolved Hide resolved

Khansa435 added 5 commits October 30, 2025 00:38

Simplifying the changes

918b93f

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

a827397

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

5966f8a

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

abe0bda

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

2294157

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

vasqu reviewed Oct 30, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Show resolved Hide resolved

vasqu reviewed Oct 30, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Outdated Show resolved Hide resolved

vasqu reviewed Oct 30, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Outdated Show resolved Hide resolved

Khansa435 added 4 commits October 30, 2025 17:48

Improve Auto Tokenizer error message

653d2ab

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

6da632a

Merge branch 'main' into fix-bad-error-voxtral-tokenizer

aed9a74

Merge branch 'fix-bad-error-voxtral-tokenizer' of https://github.com/…

1d086f6

…Khansa435/transformers into fix-bad-error-voxtral-tokenizer

vasqu reviewed Oct 30, 2025

View reviewed changes

src/transformers/models/auto/tokenization_auto.py Show resolved Hide resolved

Improve error message for voxtral

58a5025

Improve AutoTokenizer error message for Voxtral models missing mistral-common #41592

Are you sure you want to change the base?

Improve AutoTokenizer error message for Voxtral models missing mistral-common #41592

Conversation

Khansa435 commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Issue

Before submitting

Who can review?

Uh oh!

Khansa435 commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Khansa435 commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Khansa435 commented Oct 15, 2025

Uh oh!

vasqu commented Oct 15, 2025

Uh oh!

vasqu commented Oct 15, 2025

Uh oh!

AvinashDwivedi commented Oct 15, 2025

Uh oh!

vasqu commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Khansa435 commented Oct 24, 2025

Uh oh!

Khansa435 commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Oct 30, 2025

Uh oh!

Khansa435 commented Oct 30, 2025

Uh oh!

vasqu commented Oct 30, 2025

Uh oh!

Khansa435 commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Khansa435 commented Oct 14, 2025 •

edited

Loading

Khansa435 commented Oct 14, 2025 •

edited

Loading

Khansa435 commented Oct 15, 2025 •

edited

Loading

vasqu commented Oct 15, 2025 •

edited

Loading

Khansa435 commented Oct 26, 2025 •

edited

Loading