-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
együttjárhatnak: incorrect POS tags #8
Comments
Balázs, can you check this issue, please? :) |
Did you checked the training corpus? Do we have a bugtracker for it at all? I checked the corpus:
@DavidNemeskey: @vinczev : |
@dlazesz Sorry, but I think this should be done by the owner of the corpus and the tagger, not a third party. :) That said, the above three are all I have discovered; though I did not specifically look for these differences, I added a mapping for the erroneous tags, and this is the list I ended up with:
|
I am not the owner of the corpus. @DavidNemeskey: |
I am not blaming anybody, I just don't know where this error stems from. I have already listed all errors I found. @vinczev I second the notion of having a bug-tracker for the corpus. The errors I sent a few weeks earlier (disagreement between the old and new-style tags, |
The analysis of együttjárhatnak (
QT,HFSTLemm,ML3-PosLem-hfstcode
) is[V][_Mod][Prs.NDef.3Pl]
, which is incorrect: the tags[V]
and[_Mod]
should be[/V]
and[_Mod/V]
, respectively.HFST does not recognize the word (probably because it should be written separately), so it might be some fallback module that produces this analysis?
Similar invalid analyses are
[N][All]
[Num][Nom]
(interestingly enough, HFST returns an analysis for tíz-, so why doesn't it appear in GATE? This word was at the beginning of the sentence, hence the capitalization, but usually it is not a problem)The text was updated successfully, but these errors were encountered: