Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create test data and measure quality #5

Open
vidraj opened this issue Oct 14, 2017 · 0 comments
Open

Create test data and measure quality #5

vidraj opened this issue Oct 14, 2017 · 0 comments
Assignees

Comments

@vidraj
Copy link
Owner

vidraj commented Oct 14, 2017

We need an objective measure of quality to see the impact of various code changes. A decent measure would be precision/recall/F-measure combo obtained by comparing segmenter output to gold standard data.
To do this, we need the gold standard data, though. Someone has to create them by manually annotating morph boundaries in a piece of text.
If we obtain a larger amount of data, we could also use them to learn how to segment – for example, probabilities of various phonemic changes could be extracted from them.

@vidraj vidraj self-assigned this Oct 14, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant