Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parser for Acknowledgments #510

Closed
wants to merge 42 commits into from

Conversation

tantikristanti
Copy link
Collaborator

@tantikristanti tantikristanti commented Oct 16, 2019

A branch containing a model for parsing acknowledgment section both for a raw string text and for the Pdf files. Result from the DESIR Code sprint 2.

# Conflicts:
#	grobid-core/src/main/java/org/grobid/core/engines/Engine.java
#	grobid-trainer/src/main/java/org/grobid/trainer/TrainerRunner.java


Former-commit-id: eed3c30
Former-commit-id: 5a503bf
…[label]>[result]<ref/> to <rs type=[label]>[result]<rs/>

Former-commit-id: 8c0934e
# Conflicts:
#	grobid-core/src/main/java/org/grobid/core/document/TEIFormatter.java


Former-commit-id: 68c080a
Copy link
Collaborator

@lfoppiano lfoppiano left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for this PR. I passed through it and added some comments. Please have a look at them.

Also why do you need the clearNLP dictionary data/clearNLP/dictionary-1.3.1.zip?

Thank you

.travis.yml Outdated Show resolved Hide resolved
.zshrc Outdated Show resolved Hide resolved
Brewfile Outdated Show resolved Hide resolved
grobid-home/models/acknowledgment/TrainResult.txt Outdated Show resolved Hide resolved
grobid-home/models/date/backup/model.wapiti Outdated Show resolved Hide resolved
grobid-home/models/date/model.wapiti Outdated Show resolved Hide resolved
@lfoppiano lfoppiano changed the title Acknowledgment parser desir2nd code sprint Parser for Acknowledgments Oct 18, 2019
@coveralls
Copy link

coveralls commented Oct 18, 2019

Coverage Status

Coverage decreased (-0.1%) to 37.19% when pulling cef620b on AcknowledgmentParser_Desir2ndCodeSprint into 5d3ad03 on master.

…x all class linked with it, fix the test class as well

Former-commit-id: 21a4ef8
…gingTokenClusteror implementation in AcknowledgmentParser class; 2) add new feature vectors for improving acknowledgement model; 3) automatically split data (80/20)-train-evaluate a new acknowledgment model; 4) add new n-fold evaluation results. Old results are still kept for comparison purposes.

Former-commit-id: c85abd9
… model 0 and correct them), until identifier: 87409333-188e-4307-bc83-91ef9c691aae
@tantikristanti tantikristanti force-pushed the AcknowledgmentParser_Desir2ndCodeSprint branch 2 times, most recently from 1f9f0c5 to 60fcfa7 Compare November 21, 2019 14:49
@lfoppiano lfoppiano marked this pull request as draft April 17, 2020 05:28
@kermitt2
Copy link
Owner

Si finally covered with PR #1046, with some re-design, re-annotation and more training data.

@kermitt2 kermitt2 closed this Feb 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants