Skip to content

Commit

Permalink
[abk] Abkhaz: initial scrape with just enough data (finally!). (#474)
Browse files Browse the repository at this point in the history
* Abkhaz scrape.

* Updated summaries.

* Updated.
  • Loading branch information
agutkin authored Nov 27, 2022
1 parent 6c13a08 commit debf20f
Show file tree
Hide file tree
Showing 5 changed files with 748 additions and 0 deletions.
1 change: 1 addition & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -54,6 +54,7 @@ Unreleased
- Added Livvi (`olo`). (\#459)
- Added Kalmyk (`xal`). (\#472)
- Added Ternate (`tft`). (\#473)
- Added Abkhaz (`abk`). (\#474)

#### Changed

Expand Down
2 changes: 2 additions & 0 deletions data/scrape/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,8 @@
| Link | ISO 639-3 Code | ISO 639 Language Name | Wiktionary Language Name | Script | Dialect | Filtered | Narrow/Broad | Case-folding | # of entries |
| :---- | :----: | :----: | :----: | :----: | :----: | :----: | :----: | :----: | ----: |
| [TSV](tsv/aar_latn_broad.tsv) | aar | Afar | Afar | Latin | | False | Broad | True | 1,538 |
| [TSV](tsv/abk_cyrl_broad.tsv) | abk | Abkhazian | Abkhaz | Cyrillic | | False | Broad | True | 124 |
| [TSV](tsv/abk_cyrl_narrow.tsv) | abk | Abkhazian | Abkhaz | Cyrillic | | False | Narrow | True | 619 |
| [TSV](tsv/acw_arab_broad.tsv) | acw | Hijazi Arabic | Hijazi Arabic | Arabic | | False | Broad | False | 1,122 |
| [TSV](tsv/acw_arab_narrow.tsv) | acw | Hijazi Arabic | Hijazi Arabic | Arabic | | False | Narrow | False | 173 |
| [TSV](tsv/ady_cyrl_narrow.tsv) | ady | Adygei; Adyghe | Adyghe | Cyrillic | | False | Narrow | True | 5,119 |
Expand Down
124 changes: 124 additions & 0 deletions data/scrape/tsv/abk_cyrl_broad.tsv
Original file line number Diff line number Diff line change
@@ -0,0 +1,124 @@
ааба a a p a
абельльи a b e lʲ lʲ ə j
абна a b n a
абӷьы a b ʁʲ ə
агды a ɡ d ə
агәарҭа a ɡʷ a r tʰ a
ажьа a ʒ a
ажьырныҳәа a ʒ ɨ r n ɨ ħʷ a
акы a kʼ ɨ
амахәар a m a χʷ a r
амени a m e n i
амц a m t͡s
амшынҳәа ɑ m ʂ ɨ n ħʷ ɑ
амшә a m ʃʷ
анашанаҟаҵаҩ a n a ʂ a n a qʼ a t͡sʼ a ɥ
асофра a s o f r a
асууари a s u w a r i j
ах'әшә a χˤʷ ʃʷ
аха a x a
аха a χ a
ахазына a χ a z ə n a
ахамы a χ a m ə
ахарҵәы a χ a r t͡ɕʷ ɨ
ахаҳә a x a ħʷ
ахш a χ ʂ
ахы a χ ə
ахыц a χ ə t sʰ
ахәмарра a χʷ m a r r a
ахәхәа a χʷ χʷ a
ахәшә a χʷ ʃʷ
ац a t͡s
аца a t sʰ a
ацыр a t sʰ ə r
ашьа a ʃ a
аџьыка a d͡ʒ ə kʼ a
ақаанун a kʰ aː n u n
ақәыџьма a kʷ ə d͡ʒ m a
аҟәараӷ a qʼʷ a r a ʁ
аҭаацәа a tʰ a a t͡ɕʷʰ a
аҳа a ħ a
аҳамҭа a ħ a m tʰ a
аҳасабтә a ħ a s a b tʷʼ ə
аҳәа a ħʷ ə
аҵаа a t sʼ a a
аӡа a d z a
аӡы a d͡z ɨ
аӡын a d z ə n
аӡәыцәи a d͡ʑʷ ɨ t͡ɕʰʷ ɨ j
аӷу ɑ ʁ ɨ w
аԥсыӡ a pʰ s ə d͡z
аԥшьырҳа a pʰ ʃ ə r ħ a
аԥҳа a p ħ a
аԥҳал a pʰ ħ a l
бжьба p ʒ p a
гь c
гә
дә
дә
жь ʒ
жә ʒʷ
жәаа ʒʷ a a
жәаба ʒʷ a p a
жәаха ʒʷ a x a
жәаҩа ʒʷ a ɥ a
жәба ʒʷ p a
жәеиза ʒʷ a j z a
жәибжь ʒʷ ɨ j p ʒ
жәиԥшь ʒʷ ɨ j pʰ ʃ
жәохә ʒʷ a xʷ
зеижә z a j ʒʷ
кь
кә kʼʷ
тә tʼʷ
фба f p a
х' χˤ
х'ә χˤʷ
хь ç
хә ʍ
хәба xʷ p a
хԥа x pʰ a
ць t ɕʰ
цә t͡ɕʷʰ
шь ʃ
шә ʃʷ
џ d͡ʐ
џь d͡ʒ
ҕ ɣ
ҕ ʁ
ҕь ɣʲ
ҕь ʁʲ
ҕә ɣʷ
ҕә ʁʷ
ҙ ʑ
ҙә ʑʷ
қь
қә kʷʰ
ҟ
ҟь qʼʲ
ҟә qʼʷ
ҩ ɥ
ҩажәа ɥ a ʒʷ a
ҩба ɥ p a
ҫ ɕ
ҫә ɕʷ
ҭ
ҭә tʷʰ
ҳ ħ
ҳә ħʷ
ҵ t͡sʼ
ҵь t ɕʼ
ҵә t͡ɕʼʷ
ҷ t͡ʃʼ
ҽ t͡ʂʰ
ҿ t͡ʂʼ
ә ʷ
ӡ d͡z
ӡь d ʑ
ӡә d͡ʑʷ
ӷь ʁʲ
ӷь ʝ
ӷә ɣʷ
ӷә ʁʷ
ԥ
ԥшьба pʰ ʃ p a
Loading

0 comments on commit debf20f

Please sign in to comment.