Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

acquiring data on the lengths of spans #1111

Open
jeswan opened this issue Sep 17, 2020 · 1 comment
Open

acquiring data on the lengths of spans #1111

jeswan opened this issue Sep 17, 2020 · 1 comment

Comments

@jeswan
Copy link
Contributor

jeswan commented Sep 17, 2020

Issue by lovodkin93
Monday Aug 24, 2020 at 12:12 GMT
Originally opened as nyu-mll/jiant#1111


Hello, So I've been working with your project for a while now.
I've mostly worked on the Coreference and the SRL tasks, both of which have 2 spans, and I was analyzing the impact of the distance between these spans. For that, I've been using the "stratisfier"'s column in the dataframe acquired from the Ontonotes dataset (using the tsv file generated by the "analyze_runs.py file that is located in the probing firectory) . Recently, I have also wanted to examine the impact of the length of the spans in the NER task, but I can't seem to find how to get that information from the tsv file generated by thr "analyze_runs.pu file.
Does anyone happen to know how I can acquire that information?
Thanks!

@jeswan
Copy link
Contributor Author

jeswan commented Sep 17, 2020

Comment by iftenney
Monday Aug 24, 2020 at 20:09 GMT


Hi,

I don't think the span length is output by default, but you could add a couple lines to compute it in analysis.py:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant