Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LOCALISATION REQUEST: Karnataka Konkani #4673

Open
1 of 3 tasks
chasingdragonflies opened this issue Oct 26, 2024 · 0 comments
Open
1 of 3 tasks

LOCALISATION REQUEST: Karnataka Konkani #4673

chasingdragonflies opened this issue Oct 26, 2024 · 0 comments
Assignees
Labels
Localisation New language requests and or issues regarding localisation (l10n)

Comments

@chasingdragonflies
Copy link
Contributor

By the request of a professor from mangalore, I am adding this. The konkani speakers in karnataka have a little different grammar and vocabulary than goan konkani. They also mainly write in kannada script. The state of karnataka has konkani speakers (~46,00,000) of a similar size to Goa's konkani speakers. Hence it vital to include them. They also have 42 dialects spanning across 3 religions (hindu, muslim, christians)

Welcome to the Common Voice Community !

Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labelled voice data that is representative of languages, variants and accents spoken across the world. This template helps us to know how your language could participate in the Common Voice Project. There are three sections of this form, once you have filled out a section please click the checkbox. If you have any issues please contact commonvoice@mozilla.org.

Pontoon Set-up

To start a language on Common Voice volunteers localise our platform via Pontoon and create sentence corpus’ of cc0 text.

Language name
Karnataka Konkani (ಕರ್ನಾಟಕ ಕೊಂಕಣಿ)

Language code
kok

Language size
46,00,000+

Plural forms
ಶೂನ್ಯ್ 0
ಏಕ್ 1
ದೋನ್ 2
ತೀನ್ 3
ಚಾರ್ 4
ಪಾಂಚ್ 5
ಧಾ 10
ವೀಸ್ 20
ಶೆಂಭೊರ್ 100
ಹಜಾರ್ 1000
ಹಾಂಗಾಂ ಲ್ಹಾನ್ ಫಾತೊರ್ ನಾಂತ್.
ಏಕ್ ಲ್ಹಾನ್ ಫಾತೊರ್ ಹಾಂವ್ ಪಳೆತಾ.
ಹಾಂವೆಂ ಧಾ ಲ್ಹಾನ್ ಫಾತೊರ್ ಪಳಯಿಲ್ಲೆ .
ಹಾಂವ್ ಲ್ಹಾನ್ ಫಾತೊರ್ ಧರ್ಣಿರ್ ಪಳೈತಾ.

Pontoon manager
https://pontoon.mozilla.org/contributors/eXy6lJzx2MjXnEPHQW8alV2qu9E/

Language Script
Kannada

Sentence Collection Requirements

On the Common Voice Platform contributors on the platform read out public domain sentences generated through sentence collection. Sentence collection is a crucial part in launching languages on Common Voice. To support the equitable participation of languages of Common Voice we have introduced three new sentence collection requirements bands.

Sentence Requirement Band

  • Band A
  • Band B
  • Band C

Creating Community

  1. [optional to share] Why do you want to take part in Common Voice ?
    To help the Konkanni community.
  2. [optional to share] Would you like to have a follow up conversation regarding community building?
    Yes.
@ftyers ftyers added the Localisation New language requests and or issues regarding localisation (l10n) label Nov 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Localisation New language requests and or issues regarding localisation (l10n)
Projects
None yet
Development

No branches or pull requests

2 participants