Assignment 2 for CS 11-731 Machine Translation course.
-
Updated
Nov 6, 2019 - TypeScript
Assignment 2 for CS 11-731 Machine Translation course.
[ACL 2021, Findings] Cognate Prediction Per Machine Translation
NONWESTLIT Project Codebase
A practical introduction to Generative AI and LLMs, equipping professionals with essential skills to apply Gen AI in workflows, data processes, and tool development through hands-on labs and case studies.
Embedding Evaluation Data for South African Languages
Investigating transfer learning in low-resourced languages, specifically in a named entity recognition (NER) task (IJCNLP-AACL 2023). http://arxiv.org/abs/2309.05311
Prepping the corpus of texts in Moksha Mordvin: crawling texts from the Russian-Moksha newspaper website (mokshapr.ru) + splitting the texts into Russian and Moksha using cluster analysis
Italian hate speech detection using transformer.
A web application to test sentence-similarity models of the top 10 Indian Languages
A 16M LLM for POS tagging in African languages
A natural language processing and machine learning project for a low resource langauge in Zambia.
Natural language processing tool for siSwati
Auto-generated stopwords for South African Bantu Languages
Automating healthcare QA in a noisy multilingual low-resource setting
Example dataset and prompt design of Korean Offensive language Machine Generation (K-OMG), published at IJCNLP-AACL 2023.
Model training and evaluation tools for a Polish-Kashubian translator.
Scripts and files I used throughout my M.Sc. Voice Technology Thesis Project at Rijkuniversiteit Groningen - Campus Fryslân.
Finetuning BERT models on a powerset of different linguistic domains
Jopara (Guarani-dominant mixed with Spanish) sentiment analysis corpus
Fine-tune LLM for early Middle English lemmatization with data from LAEME.
Add a description, image, and links to the low-resource-languages topic page so that developers can more easily learn about it.
To associate your repository with the low-resource-languages topic, visit your repo's landing page and select "manage topics."