Skip to content
Andj edited this page Apr 13, 2022 · 1 revision

Python internationalisation notes

  1. Encoding
  2. Locales
  3. Boundary analysis: break iterators, text segmentation, and tokenisation – character, grapheme, syllable, word, sentence boundaries.
  4. Collation
  5. String transformations
    1. Unicode normalisation
    2. Casing operations
    3. Casefolding
    4. Transliteration
  6. Working with digits
Clone this wiki locally