forked from UniversalDependencies/UD_Hebrew-HTB
-
Notifications
You must be signed in to change notification settings - Fork 0
/
eval.log
113 lines (113 loc) · 8.48 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
Running the following version of UD tools:
commit ca3b862e9e2871c76cd91bf78ffb089c0c60184d
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Date: Thu May 6 20:16:19 2021 +0200
Evaluating the following revision of UD_Hebrew-HTB:
commit 75c1fbfd022265f3c616a23d130f6742a99acb7a
Merge: bdc43a8 f425049
Author: Dan Zeman <zeman@ufal.mff.cuni.cz>
Size: counted 161411 of 161411 words (nodes).
Size: min(0, log((N/1000)**2)) = 10.167907814338.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 15 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 95256 out of 161411 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 29 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi:
TOTAL 13937
Udapi: found 13937 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 161411 words.
Genres: found 1 out of 17 known.
validate.py --lang he --max-err=10 UD_Hebrew-HTB/he_htb-ud-dev.conllu
[Line 503 Sent 13 Node 33]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (33:מוכנים:aux --> 32:ה:mark)
[Line 504 Sent 13 Node 33]: [L5 Morpho aux-lemma] 'מוכן' is not an auxiliary verb in language [he]
[Line 522 Sent 14 Node 6]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (6:יש:aux --> 2:דבר_:obl)
[Line 526 Sent 14 Node 6]: [L5 Morpho aux-lemma] 'יש' is not an auxiliary verb in language [he]
[Line 582 Sent 15 Node 14]: [L5 Syntax cop-lemma] 'הוא' is not a copula in language [he]
[Line 1061 Sent 27 Node 27]: [L5 Morpho aux-lemma] 'ייתכן' is not an auxiliary verb in language [he]
[Line 1078 Sent 28 Node 15]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (15:יכול:aux --> 3:אלה:obl)
[Line 1093 Sent 28 Node 15]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (15:יכול:aux --> 14:אני:nsubj)
[Line 1125 Sent 29 Node 8]: [L3 Syntax leaf-cc] 'cc' not expected to have children (8:ו:cc --> 1:ו:cc)
[Line 1322 Sent 34 Node 34]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'PRON'
[Line 1338 Sent 35 Node 9]: [L3 Syntax rel-upos-aux] 'aux' should be 'AUX' but it is 'VERB'
[Line 1374 Sent 36 Node 18]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (18:צריך:aux --> 15:מלכתחילה:advmod)
...suppressing further errors regarding Syntax
[Line 1377 Sent 36 Node 18]: [L5 Morpho aux-lemma] 'צריך' is not an auxiliary verb in language [he]
[Line 1421 Sent 37 Node 16]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
[Line 1938 Sent 54 Node 3]: [L5 Morpho aux-lemma] 'מוכן' is not an auxiliary verb in language [he]
[Line 1998 Sent 55 Node 2]: [L5 Morpho aux-lemma] 'מוכן' is not an auxiliary verb in language [he]
[Line 2129 Sent 57 Node 2]: [L5 Morpho aux-lemma] 'מוכן' is not an auxiliary verb in language [he]
[Line 2533 Sent 65 Node 13]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
...suppressing further errors regarding Morpho
Morpho errors: 74
Syntax errors: 218
*** FAILED *** with 292 errors
Exit code: 1
validate.py --lang he --max-err=10 UD_Hebrew-HTB/he_htb-ud-test.conllu
[Line 19 Sent 5726 Node 14]: [L5 Morpho aux-lemma] 'אמור' is not an auxiliary verb in language [he]
[Line 111 Sent 5728 Node 15]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'NUM'
[Line 117 Sent 5729 Node 10]: [L3 Syntax leaf-cc] 'cc' not expected to have children (10:ו:cc --> 1:אולם:cc)
[Line 230 Sent 5733 Node 6]: [L3 Syntax rel-upos-advmod] 'advmod' should be 'ADV' but it is 'NUM'
[Line 303 Sent 5735 Node 14]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'NOUN'
[Line 517 Sent 5741 Node 14]: [L5 Syntax cop-lemma] 'הוא' is not a copula in language [he]
[Line 568 Sent 5744 Node 1]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (1:אפשר:aux --> 2:כבר:advmod)
[Line 567 Sent 5744 Node 1]: [L5 Morpho aux-lemma] 'אפשר' is not an auxiliary verb in language [he]
[Line 656 Sent 5748 Node 2]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (2:יכולה:aux --> 1:ראשל"ץ:nsubj)
[Line 660 Sent 5748 Node 2]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (2:יכולה:aux --> 5:עתה:advmod)
[Line 662 Sent 5748 Node 2]: [L3 Syntax leaf-aux-cop] 'aux' not expected to have children (2:יכולה:aux --> 7:חזק:advmod)
...suppressing further errors regarding Syntax
[Line 1173 Sent 5772 Node 7]: [L5 Morpho aux-lemma] 'אמור' is not an auxiliary verb in language [he]
[Line 1676 Sent 5789 Node 6]: [L5 Morpho aux-lemma] 'צריך' is not an auxiliary verb in language [he]
[Line 1799 Sent 5794 Node 1]: [L5 Morpho aux-lemma] 'כדאי' is not an auxiliary verb in language [he]
[Line 1825 Sent 5794 Node 22]: [L5 Morpho aux-lemma] 'יכול' is not an auxiliary verb in language [he]
[Line 2356 Sent 5813 Node 14]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
[Line 2407 Sent 5815 Node 7]: [L5 Morpho aux-lemma] 'סביר' is not an auxiliary verb in language [he]
[Line 3322 Sent 5845 Node 4]: [L5 Morpho aux-lemma] 'יש' is not an auxiliary verb in language [he]
...suppressing further errors regarding Morpho
Morpho errors: 63
Syntax errors: 240
*** FAILED *** with 303 errors
Exit code: 1
validate.py --lang he --max-err=10 UD_Hebrew-HTB/he_htb-ud-train.conllu
[Line 287 Sent 497 Node 1]: [L5 Morpho aux-lemma] 'קשה' is not an auxiliary verb in language [he]
[Line 369 Sent 502 Node 1]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'ADV'
[Line 401 Sent 504 Node 1]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'ADV'
[Line 412 Sent 504 Node 9]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'ADV'
[Line 479 Sent 505 Node 26]: [L3 Syntax leaf-fixed] 'fixed' not expected to have children (26:בית_:fixed --> 25:ל:case)
[Line 482 Sent 505 Node 26]: [L3 Syntax leaf-fixed] 'fixed' not expected to have children (26:בית_:fixed --> 28:_הוא:nmod)
[Line 484 Sent 505 Node 26]: [L3 Syntax leaf-fixed] 'fixed' not expected to have children (26:בית_:fixed --> 30:ראש:nmod)
[Line 859 Sent 516 Node 14]: [L3 Syntax rel-upos-det] 'det' should be 'DET' or 'PRON' but it is 'NUM'
[Line 1530 Sent 541 Node 22]: [L3 Syntax leaf-fixed] 'fixed' not expected to have children (22:שוק:fixed --> 21:ל:case)
[Line 1534 Sent 541 Node 22]: [L3 Syntax leaf-fixed] 'fixed' not expected to have children (22:שוק:fixed --> 24:ירקות:compound)
...suppressing further errors regarding Syntax
[Line 3027 Sent 588 Node 3]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
[Line 3039 Sent 588 Node 13]: [L5 Morpho aux-lemma] 'מסוגל' is not an auxiliary verb in language [he]
[Line 3042 Sent 588 Node 16]: [L5 Morpho aux-lemma] 'מסוגל' is not an auxiliary verb in language [he]
[Line 3052 Sent 588 Node 25]: [L5 Morpho aux-lemma] 'חשוב' is not an auxiliary verb in language [he]
[Line 3151 Sent 590 Node 10]: [L5 Morpho aux-lemma] 'אפשר' is not an auxiliary verb in language [he]
[Line 3232 Sent 591 Node 23]: [L5 Morpho aux-lemma] 'צריך' is not an auxiliary verb in language [he]
[Line 3309 Sent 595 Node 13]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
[Line 3614 Sent 602 Node 12]: [L5 Morpho aux-lemma] 'אינו' is not an auxiliary verb in language [he]
...suppressing further errors regarding Morpho
Morpho errors: 910
Syntax errors: 2929
*** FAILED *** with 3839 errors
Exit code: 1
Validity: 0.01
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.735977709377992) = 0.188712233173844
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.705882352941177) = 0.0542986425339367
(weight=0.307692307692308) * (score{udapi}=0.13655203176983) = 0.0420160097753323
(weight=0.0769230769230769) * (score{udeprels}=0.627027027027027) = 0.0482328482328482
(TOTAL score=0.512143594952764) * (availability=1) * (validity=0.01) = 0.00512143594952764
STARS = 0
UD_Hebrew-HTB 0.00512143594952764 0