Skip to content

Commit

Permalink
Task tweaks (#1149)
Browse files Browse the repository at this point in the history
* qasrl

* set logic

* update readme

* ANLI documentation

Co-authored-by: jeswan <57466294+jeswan@users.noreply.github.com>
  • Loading branch information
zphang and jeswan authored Oct 13, 2020
1 parent 82ed396 commit 392976c
Show file tree
Hide file tree
Showing 4 changed files with 11 additions and 8 deletions.
11 changes: 6 additions & 5 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -77,8 +77,8 @@ downloader.download_data(["mrpc"], "/content/data")
# Set up the arguments for the Simple API
args = run.RunConfiguration(
run_name="simple",
exp_dir="/content/exp",
data_dir="/content/data",
exp_dir="/path/to/exp",
data_dir="/path/to/exp/tasks",
model_type="roberta-base",
tasks="mrpc",
train_batch_size=16,
Expand All @@ -91,15 +91,16 @@ run.run_simple(args)

Bash version:
```bash
BASE_PATH=/path/to/exp
python jiant/scripts/download_data/runscript.py \
download \
--tasks mrpc \
--output_path /content/data
--output_path /path/to/exp/tasks
python jiant/proj/simple/runscript.py \
run \
--run_name simple \
--exp_dir /content/data \
--data_dir /content/data \
--exp_dir /path/to/exp \
--data_dir /path/to/exp/tasks \
--model_type roberta-base \
--tasks mrpc \
--train_batch_size 16 \
Expand Down
2 changes: 1 addition & 1 deletion guides/tasks/supported_tasks.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,7 +29,7 @@
| MultiRC | multirc ||| multirc | SuperGLUE |
| MRPC | mrpc ||| mrpc | GLUE |
| QAMR | qamr ||| qamr | |
| QA-SRL | qa-srl ||| qa-srl | |
| QA-SRL | qasrl ||| qasrl | |
| EP-NER | ner || | ner | Edge-Probing |
| PAWS-X | `pawsx_{lang}` ||| pawsx | XTREME, multi-lang |
| WikiAnn | `panx_{lang}` ||| panx | XTREME, multi-lang |
Expand Down
4 changes: 3 additions & 1 deletion guides/tasks/task_specific.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,9 @@

### Adversarial NLI

[Adversarial NLI](https://arxiv.org/pdf/1910.14599.pdf) has 3 rounds of adversarial data creation. A1/A2/A3 are expanding supersets of the previous round.
[Adversarial NLI](https://arxiv.org/pdf/1910.14599.pdf) has 3 rounds of adversarial data creation. A1, A2 and A3 are different rounds of data creation. When downloading, you can use the task names `adversarial_nli_r1`, `adversarial_nli_r2`, `adversarial_nli_r3` to point the the different rounds.

When doing training on the full ANLI dataset, which is SNLI+MNLI+A1+A2+A3, perform training in a multi-task manner with proportional sampling, and be sure to set the `task_to_taskmodel_map` to have all tasks point to the same NLI head.


### Masked Language Modeling (MLM)
Expand Down
2 changes: 1 addition & 1 deletion jiant/scripts/download_data/runscript.py
Original file line number Diff line number Diff line change
Expand Up @@ -16,7 +16,7 @@

# DIRECT_DOWNLOAD_TASKS need to be directly downloaded because the nlp
# implementation differs from the original dataset format
NLP_DOWNLOADER_TASKS = GLUE_TASKS | SUPERGLUE_TASKS | OTHER_NLP_TASKS - DIRECT_DOWNLOAD_TASKS
NLP_DOWNLOADER_TASKS = (GLUE_TASKS | SUPERGLUE_TASKS | OTHER_NLP_TASKS) - DIRECT_DOWNLOAD_TASKS
SUPPORTED_TASKS = NLP_DOWNLOADER_TASKS | XTREME_TASKS | SQUAD_TASKS | DIRECT_DOWNLOAD_TASKS


Expand Down

0 comments on commit 392976c

Please sign in to comment.