Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Database error when abbreviations multi-byte characters #1319

Open
arteymix opened this issue Jan 25, 2025 · 0 comments
Open

Database error when abbreviations multi-byte characters #1319

arteymix opened this issue Jan 25, 2025 · 0 comments
Assignees
Labels
bug database Issues that involve the database
Milestone

Comments

@arteymix
Copy link
Member

MySQL storage limit for text is based on the number of bytes, not the number of characters.

We historically addressed this issue by abbreviating certain strings. This is not enough however, because this operation operates on characters, not bytes.

This problem has started arising in the single-cell branch due to the lengthy subset names that result from pseudo-bulking. I've added a bunch of string utilities to complement Commons Lang that truncate and abbreviate based on the resulting byte representation.

The fix should be back-ported on the next patch release.

@arteymix arteymix added bug database Issues that involve the database labels Jan 25, 2025
@arteymix arteymix added this to the 1.31.13 milestone Jan 25, 2025
@arteymix arteymix self-assigned this Jan 25, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug database Issues that involve the database
Projects
None yet
Development

No branches or pull requests

1 participant