A curated list of NLP Resources for the Nepali Language
-
High quality TTS data for Nepali - Multi-speaker TTS data for Nepali (ne-NP) ~2,000 sentences (48kHz, 16 bit, mono, Wave audio)
-
Large Nepali ASR training data set - Nepali ASR training data set containing ~157K utterances (16kHz, 16 bit, mono, FLAC audio).
- 16NepaliNews Corpus
- 65K Nepali Sentences
- 1000 Sport News
- Nagarik News Corpus
- Setopati News Corpus
- Nepali news in 10 different categories
- Nepali News in English Corpus
- Title Article pairs from news
- Nepali News Classification Dataset
- Pretrained fastText Word Vectors: bin,txt
Trained on Common Crawl and Wikipedia - 300-D Word Embeddings (Word2Vec) for Nepali Language
The text corpus contains more than 90 million running words
- Nepali Text Extraction
- Laxmi Prasad Devkota Poems
- Nepali Names
- Nepali Ngram
- Nepali Stopwords
- Nepali Word List
- Nepali transliteration
- Nepali Textbooks by Cornell Anthropology
- Nepali Textbooks from grade 1 to 12
- Nepali Spelling Correction Dataset
- Nepali Contemporary Dictionary
- Nepali Names list
- English to Nepali dictionary
Contributions are always welcome!