The most accurate natural language detection library for Python, suitable for short text and mixed-language text
-
Updated
Mar 20, 2025 - Python
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
A TensorFlow-based spoken language identification
Faster, modernized fork of the language identification tool langid.py
End-to-end spoken language identification out of the box.
📚 Сборник полезных штук из Natural Language Processing: Определение языка текста, Разделение текста на предложения, Получение основного содержимого из html документа
Simple, yet fast, Python scripts to read Kaldi NNet3 models and compute bottleneck features
Source code for: Efficient Self-supervised Learning Representations for Spoken Language Identification
🎏🎌 language recognition script implemented using basic algorithms and spaghetti code
Suite of Python modules to recognise the language of a file
A single-layer neural network written from scratch that predicts the language of the text.
End to End Dialect Identification using Convolutional Neural Network
The European Parliament Proceedings Parallel Corpus (1996-2011) (https://www.statmt.org/europarl/) is a well-known dataset in Natural Language Processing tasks, it contains proceedings of the European Parliament in 21 European languages. In this project we will only extract data from 6 languages (German, French, Spanish, Italian, Polish and Eng…
Modify Markdown fenced code blocks to contain the language name by detecting it from the block contents.
too Simple Language Recognition
Add a description, image, and links to the language-recognition topic page so that developers can more easily learn about it.
To associate your repository with the language-recognition topic, visit your repo's landing page and select "manage topics."