Skip to content

Commit

Permalink
added extra cleft checking to validate_gd_extras.py and fixed missing…
Browse files Browse the repository at this point in the history
… csubj:cop relations in train. Fixes #43.
  • Loading branch information
colinbatchelor committed Dec 17, 2024
1 parent 69aea50 commit 483536b
Show file tree
Hide file tree
Showing 3 changed files with 675 additions and 642 deletions.
24 changes: 12 additions & 12 deletions gd_arcosg-ud-dev.conllu
Original file line number Diff line number Diff line change
Expand Up @@ -1115,7 +1115,7 @@
# text = uill ‘s iongantach mar a chuala
1 uill uill INTJ I _ 3 discourse _ _
2 ‘s is AUX Wp-i Tense=Pres 3 cop _ _
3 iongantach iongantach ADJ Ap CleftType=Adj 0 root _ _
3 iongantach iongantach ADJ Ap _ 0 root _ _
4 mar mar SCONJ Cs _ 6 mark _ _
5 a a PART Q-r PartType=Vb|PronType=Rel 6 mark:prt _ _
6 chuala cluinn VERB V-s Mood=Ind|Tense=Past|VerbForm=Fin 3 csubj:cop _ _
Expand Down Expand Up @@ -1610,7 +1610,7 @@
21 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 17 xcomp:pred _ _
22 a a PART Q-r PartType=Vb|PronType=Rel 24 obl _ _
23 b’ is AUX Ws Tense=Past 24 cop _ _
24 fhearr math ADJ Apc CleftType=Adj|Degree=Cmp,Sup 18 acl:relcl _ _
24 fhearr math ADJ Apc Degree=Cmp,Sup 18 acl:relcl _ _
25 a a PART Ug PartType=Inf 26 mark:prt _ _
26 chòrdadh còrd VERB V-h Mood=Ind|VerbForm=Fin 24 csubj:cop _ _
27 ri ri ADP Sp _ 28 case _ _
Expand Down Expand Up @@ -1771,7 +1771,7 @@
5 gur is AUX Wpdia ExtPos=AUX|Polarity=Aff|Tense=Pres 8 cop _ _
6 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 5 fixed _ _
7 trì trì NUM Mc NumForm=Word|NumType=Card 8 nummod _ _
8 seachdainnean seachdainn NOUN Ncpfn Case=Nom|Gender=Fem|Number=Plur 4 ccomp _ _
8 seachdainnean seachdainn NOUN Ncpfn Case=Nom|CleftType=Nom|Gender=Fem|Number=Plur 4 ccomp _ _
9 nach nach PART Qn PartType=Cmpl|Polarity=Neg 10 mark:prt _ _
10 bi bi VERB V-f--d Mood=Ind|Tense=Fut|VerbForm=Fin 8 csubj:cleft _ _
11 an an DET Tds Definite=Def|Number=Sing|PronType=Art 12 det _ _
Expand Down Expand Up @@ -3890,7 +3890,7 @@
11 adhbharachadh adhbharaich NOUN Nv VerbForm=Vnoun 1 xcomp:pred _ _
12 gur is AUX Wpdia ExtPos=AUX|Polarity=Aff|Tense=Pres 14 cop _ _
13 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 12 fixed _ _
14 taobh taobh NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 11 ccomp _ _
14 taobh taobh NOUN Ncsmn Case=Nom|CleftType=Nom|Gender=Masc|Number=Sing 11 ccomp _ _
15 an an NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 14 nmod _ _
16 iar iar NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 14 nmod _ _
17 na an DET Tdsfg Case=Gen|Definite=Def|Gender=Fem|Number=Sing|PronType=Art 18 det _ _
Expand Down Expand Up @@ -4234,7 +4234,7 @@
6 an an ADP Sp _ 5 fixed _ _
7 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 5 fixed _ _
8 mar mar ADP Sp _ 9 case _ _
9 thoradh toradh NOUN Ncsmd Case=Dat|Gender=Masc|Number=Sing 0 root _ _
9 thoradh toradh NOUN Ncsmd Case=Dat|CleftType=Obl|Gender=Masc|Number=Sing 0 root _ _
10 air air ADP Sp _ 12 case _ _
11 an an DET Tdsf Definite=Def|Gender=Fem|Number=Sing|PronType=Art 12 det _ _
12 iomagain iomagain NOUN Ncsfd Case=Dat|Gender=Fem|Number=Sing 9 nmod _ _
Expand Down Expand Up @@ -11378,15 +11378,15 @@
# sent_id = pw09_025
# text = Mar eisimpleir tha dùil gun tèid factaraidh giullachd bhradan fhosgladh an Sgalpaigh.
1 Mar mar ADP Sp _ 2 case _ _
2 eisimpleir eisimpleir NOUN Ncsmd Case=Dat|Gender=Masc|Number=Sing 3 obl _ OblType=Conj
2 eisimpleir eisimpleir NOUN Ncsmd Case=Dat|Gender=Masc|Number=Sing 3 obl _ OblType=Man
3 tha bi VERB V-p Mood=Ind|Tense=Pres|VerbForm=Fin 0 root _ _
4 dùil dùil NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 3 nsubj _ _
5 gun gu PART Qa PartType=Cmpl 6 mark:prt _ _
6 tèid rach VERB V-f--d Mood=Ind|Tense=Fut|VerbForm=Fin 4 acl _ _
7 factaraidh factaraidh NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 6 nsubj _ _
8 giullachd giullachd NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 6 obj _ _
5 gun gu PART Qa PartType=Cmpl 10 mark:prt _ _
6 tèid rach AUX V-f--d Mood=Ind|Tense=Fut|VerbForm=Fin 10 aux:pass _ _
7 factaraidh factaraidh NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 10 nsubj:pass _ _
8 giullachd giullachd NOUN Ncsfn Case=Nom|Gender=Fem|Number=Sing 7 nmod _ _
9 bhradan brad NOUN Ncpmg Case=Gen|Gender=Masc|Number=Plur 8 nmod _ _
10 fhosgladh fosgail NOUN Nv VerbForm=Vnoun 6 xcomp _ _
10 fhosgladh fosgail NOUN Nv VerbForm=Vnoun 4 acl _ _
11 an an ADP Sp _ 12 case _ _
12 Sgalpaigh Sgalpaigh PROPN Nt NounType=Top 10 obl _ SpaceAfter=No
13 . . PUNCT Fe _ 3 punct _ _
Expand Down Expand Up @@ -13292,7 +13292,7 @@
17 smaoineachadh smaoinich NOUN Nv VerbForm=Vnoun 14 xcomp:pred _ _
18 gur is AUX Wpdia ExtPos=AUX|Polarity=Aff|Tense=Pres 20 cop _ _
19 e e PRON Pp3sm Gender=Masc|Number=Sing|Person=3|PronType=Prs 18 fixed _ _
20 sin sin PRON Pd PronType=Dem 17 ccomp _ _
20 sin sin PRON Pd CleftType=Nom|PronType=Dem 17 ccomp _ _
21 a a PART Q-r PartType=Vb|PronType=Rel 23 nsubj _ _
22 b' is AUX Ws Tense=Past 23 cop _ _
23 aobhar aobhar NOUN Ncsmn Case=Nom|Gender=Masc|Number=Sing 20 csubj:cleft _ _
Expand Down
Loading

0 comments on commit 483536b

Please sign in to comment.