Update init.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

wyattscarpenter · 2025-04-18T13:34:38Z

Using a raw string.

…'\w' with raw string

ArthurZucker

This file is automatically generated! # Generated content DO NOT EDIT as the header. If you want to modify something it should be the doc here: https://github.com/huggingface/tokenizers/blob/main/bindings/python/src/pre_tokenizers.rs#L344

wyattscarpenter · 2025-06-04T08:38:19Z

Thanks. So it's something to do with the transformation from a rust docstring.

/// This pre-tokenizer simply splits using the following regex: `\w+|[^\w\s]+`

This is a valid rust docstring, but whatever turns it into a python docstring needs to be updated to respect the python escape rules. (Or, maybe, it wants to let you use python escapes, and so we must correct this docstring to escape the escapes? Hmm...)

wyattscarpenter · 2025-06-04T08:50:18Z

I don't really know how to deal with that myself, offhand, so I guess I'll just pivot to opening an issue about it instead.

Update __init__.pyi: fix 525: SyntaxWarning: invalid escape sequence …

fc7ea9f

…'\w' with raw string

ArthurZucker reviewed May 27, 2025

View reviewed changes

Merge branch 'main' into patch-1

2eb61a8

wyattscarpenter closed this Jun 4, 2025

wyattscarpenter mentioned this pull request Jun 4, 2025

SyntaxWarning: invalid escape sequence '\w' #1791

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update init.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

Update init.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

Uh oh!

wyattscarpenter commented Apr 18, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

wyattscarpenter commented Jun 4, 2025

Uh oh!

wyattscarpenter commented Jun 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update __init__.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

Update __init__.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

Uh oh!

Conversation

wyattscarpenter commented Apr 18, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

wyattscarpenter commented Jun 4, 2025

Uh oh!

wyattscarpenter commented Jun 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Update init.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764

Update init.pyi: fix 525: SyntaxWarning: invalid escape sequence '\w' #1764