Skip to content
This repository has been archived by the owner on Mar 6, 2019. It is now read-only.

Possibly Improved Sequence Tagger #9

Open
ghost opened this issue Oct 23, 2016 · 2 comments
Open

Possibly Improved Sequence Tagger #9

ghost opened this issue Oct 23, 2016 · 2 comments

Comments

@ghost
Copy link

ghost commented Oct 23, 2016

Hi,
This is a pretty awesome project, thanks for posting it! I've been experimenting with Structured Prediction methods, and decided to use this project to compare CRFs and Learning to Search methods. Pending a more rigorous evaluation (fingers crossed) I'm seeing roughly 96% per-token accuracy and 95.55% sentence-level accuracy with L2S and vw, taking 22 minutes total for read+train+test. This is an 80/20 split on the full dataset, using the output of bin/generate. Once I've cleaned up the source, I'll be happy to send over a pull request.

- Arthur

@tettoffensive
Copy link

@Zintinio did you ever clean up the source? I'm curious to see your improvements. I'm seeing a few things where I'll get "name": "Salt and pepper" instead of two separate ingredients. Wondering if your improvements would help with this sort of problem?

Also, ingredients like "Basil" and "Basil leaves". I think it would be better if they were both recognized as the same ingredient. But that might be much more challenging ;)

@ghost
Copy link
Author

ghost commented Dec 1, 2017 via email

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant