Add HuffPost #750

KhalidAlt · 2022-04-27T13:27:09Z

No description provided.

jon-tow · 2022-04-28T04:32:35Z

Hey, @KhalidAlt . Is there any chance you could add all the categories (listed here) to the answer_choices fields so that rank/accuracy scoring could easily be performed? Thanks!

KhalidAlt · 2022-04-28T08:21:32Z

I add choices_in_prompt as @jon-tow suggested. However, the categories are listed as string instead of labels and I can not use {{answer_choice[category]}} as suggested in the prompt guide. Therefore, I just wrote {{category}} as the target. Does it still work correctly with rank/accuracy regardless of this issue or should I edit the dataset to include categories as labels instead of just string.

thinkzink · 2022-04-28T13:28:42Z

I think the following could be reworded to make the description more separable from the surrounding text but I think the others look fine: 'Given the following article headline: "{{headline}}" and the following
short passage "{{short_description}}" What is the category of this news report?
||| {{category}}'

For consistency it would be important to edit the dataset for that {{answer_choice[category]}} ability in including categories as labels.

cjlovering · 2022-05-05T17:37:40Z

@thinkzink @KhalidAlt It looks like the changes are approved. Should this be merged?

KhalidAlt · 2022-05-06T07:13:55Z

@thinkzink @KhalidAlt It looks like the changes are approved. Should this be merged?

Yes, I think so.

awebson

Thanks for the PR! A few issues left:

None of your prompts have the answer choices in the input, so (a) models or even humans will never know to answer "impact" or "divorce" as a valid news category. (b) This may be mitigated by rank classification of the answer choices, but it's far from guaranteed. At the very lest, you need to set the choices_in_prompts accordingly.
Please lowercase or title case your answer choices. Most models are case-sensitive now and are a priori unlikely to generate all caps vocabulary.
The classes are imbalanced, so you probably don't really want to report naive accuracy. Fine for now, but please pay attention in you evaluation.
What is the file change of promptsource/templates/tydiqa/primary_task/.templates.yaml.swp?

awebson · 2022-05-09T01:26:31Z

promptsource/templates/khalidalt/HuffPost/templates.yaml

+      NEWS|||  ARTS & CULTURE|||  ENVIRONMENT|||  COLLEGE|||  LATINO VOICES|||  CULTURE
+      & ARTS|||  EDUCATION
+    id: 19d2449b-1a52-4079-aaec-301517ee6ee1
+    jinja: 'What can you infer from the headline and short description given about


This is ungrammatical. How about "Given these headline and description, what category can you infer this piece of news belongs to?"

Thanks for the PR! A few issues left:

1. None of your prompts have the answer choices in the input, so (a) models or even humans will never know to answer "impact" or "divorce" as a valid news category. (b) This may be mitigated by rank classification of the answer choices, but it's far from guaranteed. At the very lest, you need to set the `choices_in_prompts` accordingly. 2. Please lowercase or title case your answer choices. Most models are case-sensitive now and are a priori unlikely to generate all caps vocabulary. 3. The classes are imbalanced, so you probably don't really want to report naive accuracy. Fine for now, but please pay attention in you evaluation. 4. What is the file change of `promptsource/templates/tydiqa/primary_task/.templates.yaml.swp`?

1- I can add answer choices as list in the input as follow:

Given the following news headline and short description: {{headline}} {{short_description}} What is a correct category class from the following list {{answer_choices}} ? ||| {{answer_choices[label]}}

However, It may seem unnatural to use this style with humans. Also, I set up choices_in_prompt to false.

2- Fixed. I titled case all answer choices.

3- I will pay attention for this in the evaluation code. Thank you!

4- Fixed. It was a mistake and I removed the file.

promptsource/templates/khalidalt/HuffPost/templates.yaml

awebson

Thank you!

I can add answer choices as list in the input as follow:

Given the following news headline and short description: {{headline}} {{short_description}} What is a correct category class from the following list {{answer_choices}} ? ||| {{answer_choices[label]}}

However, It may seem unnatural to use this style with humans.

I think it's pretty natural to ask people and models something like

choose from the list of options below:

politics

sports

...

But I will leave it up to you. In our experience, including the answer choices could either make a huge difference or nothing at all. You should definitely try it both ways in your evaluation. Let me know if you want me to merge this now or wait until you include some prompts with answer choices.

… add_HuffPost update

KhalidAlt · 2022-05-13T19:27:36Z

Now it is done! I added answer_choices in 3 of the prompts and kept two without any answer_choices in the prompts. Please review last commits and merge it if everything is OK.

awebson · 2022-05-14T02:20:16Z

Beautiful. Thanks so much!

KhalidAlt added 3 commits April 27, 2022 11:42

add username to INCLUDED_USERS list

5313590

add 5 prompts

75f6920

change prompt name

f45ca4a

KhalidAlt changed the title ~~Add huff post~~ Add huffPost Apr 27, 2022

KhalidAlt changed the title ~~Add huffPost~~ Add HuffPost Apr 27, 2022

awebson assigned thinkzink Apr 27, 2022

add choices_in_prompt and fix minor issues

2b89499

KhalidAlt mentioned this pull request Apr 28, 2022

Add HuffPo Text Classification to Full Benchmark bigscience-workshop/evaluation#31

Open

add labels and fix a prompt

f133830

KhalidAlt requested a review from thinkzink April 29, 2022 10:46

thinkzink approved these changes May 4, 2022

View reviewed changes

thinkzink approved these changes May 8, 2022

View reviewed changes

awebson requested changes May 9, 2022

View reviewed changes

awebson self-assigned this May 9, 2022

KhalidAlt added 3 commits May 9, 2022 16:44

title case answer choices

7ec85d5

fix grammatical error

c177727

add other metrics

2e80514

KhalidAlt force-pushed the add_HuffPost branch from b5bf556 to 2e80514 Compare May 9, 2022 15:36

remove file

b1c924a

KhalidAlt force-pushed the add_HuffPost branch from a45a167 to b1c924a Compare May 9, 2022 16:58

KhalidAlt and others added 5 commits May 9, 2022 19:59

fix issues

c861f0a

Merge branch 'eval-hackathon' into add_HuffPost

6e32c60

Update templates.py

5157a94

Update templates.py

d82c88a

set choices_in_prompt to false

dd3f5bb

KhalidAlt added 2 commits May 10, 2022 18:16

remove unmerged

c8efc7a

add file

c4d70d4

KhalidAlt requested a review from awebson May 11, 2022 13:54

awebson approved these changes May 12, 2022

View reviewed changes

KhalidAlt and others added 5 commits May 13, 2022 19:43

add answer_choices in the prompts

a5bba7e

fix minor issues

145c1c5

Merge branch 'eval-hackathon' into add_HuffPost

2be8d00

make prompts more natural

9023175

Merge branch 'add_HuffPost' of github.com:KhalidAlt/promptsource into…

cd6391c

… add_HuffPost update

awebson merged commit 4f0051f into bigscience-workshop:eval-hackathon May 14, 2022

KhalidAlt deleted the add_HuffPost branch May 14, 2022 07:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add HuffPost #750

Add HuffPost #750

KhalidAlt commented Apr 27, 2022

jon-tow commented Apr 28, 2022

KhalidAlt commented Apr 28, 2022 •

edited

Loading

thinkzink commented Apr 28, 2022

cjlovering commented May 5, 2022

KhalidAlt commented May 6, 2022 •

edited

Loading

awebson left a comment

awebson May 9, 2022

KhalidAlt May 9, 2022 •

edited

Loading

awebson left a comment

KhalidAlt commented May 13, 2022 •

edited

Loading

awebson commented May 14, 2022

Add HuffPost #750

Add HuffPost #750

Conversation

KhalidAlt commented Apr 27, 2022

jon-tow commented Apr 28, 2022

KhalidAlt commented Apr 28, 2022 • edited Loading

thinkzink commented Apr 28, 2022

cjlovering commented May 5, 2022

KhalidAlt commented May 6, 2022 • edited Loading

awebson left a comment

Choose a reason for hiding this comment

awebson May 9, 2022

Choose a reason for hiding this comment

KhalidAlt May 9, 2022 • edited Loading

Choose a reason for hiding this comment

awebson left a comment

Choose a reason for hiding this comment

KhalidAlt commented May 13, 2022 • edited Loading

awebson commented May 14, 2022

KhalidAlt commented Apr 28, 2022 •

edited

Loading

KhalidAlt commented May 6, 2022 •

edited

Loading

KhalidAlt May 9, 2022 •

edited

Loading

KhalidAlt commented May 13, 2022 •

edited

Loading