-
Notifications
You must be signed in to change notification settings - Fork 404
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bugs in "textattack train” #488
Comments
I tried running the above commands on my Machine and observed the same errors. (Ubuntu 20.04, Python 3.8.11, Tensorflow 2.6, PyTorch 1.9.1) |
For the first Issue with the snli dataset. From HuggingFace SNLI Dataset page , regarding the snli labels .
Since the SNLI labels have four possible values [ -1 , 0 , 1 , 2 ] , we need to first filter -1 out of the dataset and then set --model-num-labels to 3. so the textattack command to run should be
The model trains without errors with the new arguments. One potential enhancement would be that in Do let me know if the enhancement looks good , I can go ahead and open a PR for the same. |
For the Second Issue with the T5 Models, T5 Models have their own wrappers for the model and tokenizer defined in In These Assert Statements need to be updated with classes respective to t5_for_text_to_text and t5_tokenizer . Current Assert Statements
These Assert Statements should be updated to
@jxmorris12 |
Hey @VijayKalmath. The changes to T5 look great. For the SNLI case, can you we a check to make sure all the labels are accounted for, and throw an |
@jxmorris12 Thank you for confirming the T5 changes. Can you please elaborate more on what you mean by |
She ran this command: Instead, we should check the number of labels, and if it's not right through |
e.g.
The text was updated successfully, but these errors were encountered: