Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xpos abbreviations for English #155

Closed
czopluoglu opened this issue Sep 10, 2021 · 1 comment
Closed

xpos abbreviations for English #155

czopluoglu opened this issue Sep 10, 2021 · 1 comment

Comments

@czopluoglu
Copy link

Is there a reference we can find what the abbreviations returned under the column xpos mean for English?

I can find the ones for upos at this link (https://universaldependencies.org/u/pos/index.html); however, the ones under xpos require a lot of guesswork.

@martinpopel
Copy link
Member

From the UDPipe point of view, XPOS is just another tag to predict and the predicted values are based on whatever was used for XPOS in the training data.

In CoNLL-U, XPOS is a treebank-specific PoS tag, so there is no unified reference. It is not needed in UD (there are treebanks without XPOS). Usually, it contains legacy pre-UD pos tags. For example, English uses PennTB PoS tags, Czech uses PDT tags - in both cases these XPOS tags encode not only the part of speech, but also other morphological features, which are encoded also in the CoNLL-U FEATS column. For other languages/treebanks, search their documentation (at least, the metadata should contain info such as manual native).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants