Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding GLUE to PyTorch #5

Open
PattynR opened this issue Dec 18, 2018 · 1 comment
Open

Adding GLUE to PyTorch #5

PattynR opened this issue Dec 18, 2018 · 1 comment

Comments

@PattynR
Copy link

PattynR commented Dec 18, 2018

Hi, I am currently adding some files into the PyTorch project that would enable it to directly import the GLUE datasets. I am however facing a problem regarding the QQP and SNLI datasets. There are some lines where there are too much tabs according to the number of columns that are mentioned in the first lines of those files. For example in the train.tsv file of QQP, line 97.931 is :

"\tWas Muhammad a real historical figure? What is the evidence for his existence?\t0

So in that line are supposed to be 3 columns while in the file there should 6 columns.
How should I handle those lines?

Thank you.

@sleepinyourhat
Copy link
Contributor

Hi P,

We have some notes on this issue here: https://groups.google.com/forum/#!topic/glue-benchmark-discuss/J5p3oTpqogY

Also, for a reference implementation of GLUE data loading/prediction writing, I'd look at jiant rather than this codebase: https://github.com/jsalt18-sentence-repl/jiant

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants