-
Notifications
You must be signed in to change notification settings - Fork 275
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CLEVA Schema #1864
Merged
Merged
CLEVA Schema #1864
Changes from 228 commits
Commits
Show all changes
229 commits
Select commit
Hold shift + click to select a range
b063368
Add CLEVA text classification task
lyy1994 6b3658a
Add CLEVA opinion mining task
lyy1994 96b7b15
Add Pinyin Transliteration Task and Two Related Subtasks
Jianqiao-Zhao 0b5c4b1
Amend requirements-freeze.txt
Jianqiao-Zhao 1544260
1. Add chinese_bleu_1 Metric in BasicMetric
Jianqiao-Zhao 5e50966
Amend Pinyin Transliteration max_tokens
Jianqiao-Zhao 145f914
Fix Typo and Add Examples for CLEVAPinyinTransliterationScenario
Jianqiao-Zhao 64141ac
Add Classical Chinese Understanding Scenario
Jianqiao-Zhao d45fc97
Add Sentiment Analysis Scenario
Jianqiao-Zhao 0a770e8
Add Instruction Following Scenario
Jianqiao-Zhao 9835f38
Merge pull request #1 from lyy1994/pinyin
lyy1994 bb7c088
Add CLEVA robustness perturbations
lyy1994 3ee04ba
Fix Bug & Clean up
Jianqiao-Zhao cebb101
Put name in to CLEVAScenario to avoid creating unnecessary directory
Jianqiao-Zhao db4c953
Merge pull request #5 from lyy1994/scenarios/instruction_following
lyy1994 0a00421
Merge branch 'stanford-crfm:main' into main
lyy1994 872e810
Add Disinformation Scenario
Jianqiao-Zhao b1c0104
Merge pull request #4 from lyy1994/perturbations/robustness
Jianqiao-Zhao 26b0e86
Load Prompt Setting From File
Jianqiao-Zhao c1bb31c
Fix Python style
lyy1994 ac61947
Update style
lyy1994 25a53b0
Fix checking
lyy1994 d778690
Merge pull request #8 from lyy1994/hotfix/style
Jianqiao-Zhao 12d8329
Add Gender Perturbation
lyy1994 4716053
1. Merge branch 'main' to pass the auto checks
Jianqiao-Zhao ac6754a
Fix checking
Jianqiao-Zhao fa8bbbf
Fix checking
Jianqiao-Zhao 43e9815
Fix checking
Jianqiao-Zhao 46f0a18
Merge pull request #7 from lyy1994/load_prompt_templates
lyy1994 b759579
Initialize Translation Scenario
Jianqiao-Zhao a98aa4f
Implement ClevaMachineTranslationMetric to support Chinese and variab…
Jianqiao-Zhao 6d57bbd
Add Chinese person name perturbation
lyy1994 e68edf0
Add Simplified to Traditional perturbation
lyy1994 b7e70c9
Add Mandarin to Cantonese perturbation
lyy1994 3372a2a
Add CLEVA data augmentation setup
lyy1994 7ab1bf6
Update type annotation
lyy1994 387fd26
1. Cleva -> CLEVA in translation metric
Jianqiao-Zhao 7047cc1
Merge pull request #9 from lyy1994/scenarios/translation
lyy1994 854555b
Merge branch 'main' into perturbations/fairness
Jianqiao-Zhao 1c45137
Merge pull request #10 from lyy1994/perturbations/fairness
zd11024 2f30a44
Add Intent Understanding Scenario
Jianqiao-Zhao 6cfaa79
Replace hardcoded file names
lyy1994 04ca882
Add CLEVA harms metrics
lyy1994 5c13686
Rename
lyy1994 66442f8
Minor Debug
Jianqiao-Zhao d086460
Minor Debug
Jianqiao-Zhao 9ae9b64
Merge pull request #12 from lyy1994/scenarios/intent_understanding
Jianqiao-Zhao c179d32
Add Coreference Resolution Scenario
Jianqiao-Zhao 78e97fc
Minor Debug
Jianqiao-Zhao 0025efc
Add Dialog Generation Scenario
zd11024 0d14c42
Add Subject Knowledge Scenario
zd11024 94acb9a
Merge branch 'main' into add_dialog_scenario
zd11024 699a9a3
Merge pull request #11 from lyy1994/metrics/harms
zd11024 ec95b0d
Remove extra knowledge instances
zd11024 b830464
Overwrite get_instances for Dialog Scenario
zd11024 69222d4
Truncate labels for Subject Knowledge
zd11024 1e8547a
Merge branch 'main' into add_dialog_scenario
zd11024 cd801aa
Merge pull request #14 from lyy1994/scenarios/dialogue_generation
lyy1994 babcc84
Add CLEVA cultural knowledge scenario
lyy1994 331ea7c
Fix style
lyy1994 565422d
Add Reading Comprehension Scenario
Jianqiao-Zhao 8763099
Add summarization and closed-book QA scenarios
zd11024 e1aa830
Merge pull request #15 from lyy1994/scenarios/cultural_knowledge
Jianqiao-Zhao be29c69
Minor Improvements After Review
Jianqiao-Zhao 634f48d
Merge branch 'main' into scenarios/coreference_resolution
Jianqiao-Zhao 19f5ef1
Merge pull request #13 from lyy1994/scenarios/coreference_resolution
Jianqiao-Zhao 0d82bb7
Merge branch 'main' into scenarios/reading_comprehension
Jianqiao-Zhao 3620568
Minor Debug
Jianqiao-Zhao 4e824ea
Merge pull request #16 from lyy1994/scenarios/reading_comprehension
lyy1994 226d16e
add toxicity_detection & paraphrase_generation
HenryHZY 66c15fc
Merge branch 'main' into scenarios/toxicity_detection
HenryHZY 6813fab
Reformat files
HenryHZY 445ab6d
Add annotations for Subject Knowledge datasets
zd11024 f04dbb3
Add chinese_rouge_2 metric
zd11024 9e6b991
Merge branch 'main' into scenarios/closed_book_qa
zd11024 ad00afb
Merge branch 'stanford-crfm:main' into main
lyy1994 b6ce87b
Merge pull request #17 from lyy1994/scenarios/closed_book_qa
lyy1994 f4c8858
Update class CLEVAToxicityDetectionScenario
HenryHZY 22d4e3f
Add Paraphrase Identification Scenario
Jianqiao-Zhao d7f27fc
Minor Debug
Jianqiao-Zhao 5064e7b
Merge branch 'main' into scenarios/paraphrase_identification
Jianqiao-Zhao 327df88
Merge pull request #20 from lyy1994/scenarios/paraphrase_identification
lyy1994 34f9489
Fix class CLEVAParaphraseGenerationMetric
HenryHZY 126c9ad
minor update for paraphrase generation
HenryHZY 3102ba9
Add mathematic calculation and inductive reasoning
zd11024 0f20c57
minor update for paraphrase generation
HenryHZY 29dd0e3
Merge pull request #18 from lyy1994/scenarios/toxicity_detection
lyy1994 3c00bfd
Add Bias Scenario and Related Subtasks
Jianqiao-Zhao 87a2b98
Implement Population_F1 for Multiple Choice Adapter
Jianqiao-Zhao a64737f
Improve Classification Logic
Jianqiao-Zhao 3b6aac0
Cleanup
Jianqiao-Zhao e2aa3f9
Add reasoning primitive scenario
zd11024 5b8fc01
Merge branch 'main' into scenarios/reasoning
zd11024 43b3c2c
Update Subtask Names
Jianqiao-Zhao 151267e
Merge pull request #21 from lyy1994/scenarios/bias
lyy1994 37f7257
Add CLEVA reasoning scenarios
lyy1994 8db3ac1
Add Deductive Reasoning Scenario and One Related Subtask
Jianqiao-Zhao bd05af6
Merge branch 'stanford-crfm:main' into main
lyy1994 347e2a9
Minor Improvements for Review
Jianqiao-Zhao 919a431
Add CLEVA copyright scenario
lyy1994 99f395c
Fix typos
lyy1994 0abced5
Merge pull request #23 from lyy1994/deductive_reasoning
lyy1994 c7f2c85
Merge branch 'main' into scenarios/commonsense_reasoning
Jianqiao-Zhao 82e1b38
Merge pull request #22 from lyy1994/scenarios/commonsense_reasoning
Jianqiao-Zhao b29f99b
Add Chinese copyright metric
lyy1994 f97883a
Merge branch 'main' into scenarios/copyright
lyy1994 cdaff79
Update Chinese tokenizer default value
lyy1994 bc1cd7c
Merge branch 'stanford-crfm:main' into main
lyy1994 42add25
Remove label truncation
zd11024 6f046d7
Merge branch 'main' into scenarios/reasoning
zd11024 ae9b0ad
Add reasoning tag
zd11024 2b91f75
Merge pull request #25 from lyy1994/scenarios/reasoning
zd11024 940811a
Merge pull request #24 from lyy1994/scenarios/copyright
Jianqiao-Zhao 1935bfc
Add Data-to-Text Generation Scenario
Jianqiao-Zhao a2d609f
Add CLEVA to HELM prompt conversion
lyy1994 7c347b6
Fetch inference parameters online
lyy1994 d01eca9
Reformat
lyy1994 c696945
Add Mathematical Reasoning Scenario and Related Metrics
Jianqiao-Zhao c510861
Minor Debug
Jianqiao-Zhao c2ce65b
Add Language Modeling Scenario
Jianqiao-Zhao 12719f6
Minor Debug
Jianqiao-Zhao 7b46000
Minor Adjustments after Review Discussion
Jianqiao-Zhao e926f8d
Merge pull request #27 from lyy1994/scenarios/d2t_maths_language
lyy1994 f769fd0
Merge pull request #26 from lyy1994/adaptation
lyy1994 1a4368a
Fix bugs
lyy1994 e66c15e
Add CLEVA paraphrase generation multiple prompts
lyy1994 d36ed3b
Fix typo
lyy1994 2ec2a97
Merge pull request #28 from lyy1994/prompts/paraphrase
Jianqiao-Zhao 0dbe02d
Adjust prompts for all multiple-choice scenarios
Jianqiao-Zhao d5e5d2a
1. Adjust all the generation scenarios to use CLEVA prompts
Jianqiao-Zhao d21cc1a
Minor Improvements After Review Discussion
Jianqiao-Zhao 4247a59
Minor Debug
Jianqiao-Zhao 0224f60
Adapt CLEVACodeSynthesisScenario to HELM Style
Jianqiao-Zhao 05d4635
1. Fix Mathematical Reasoning Scenario Prompt Issue in Test Split Cau…
Jianqiao-Zhao fb99b8f
Make CLEVA dependency optional
lyy1994 19179fe
Handle optional dependency error
lyy1994 60a3188
Handle optional dependency
lyy1994 105be1d
1. Fix Mathematical Reasoning Testing Labels
Jianqiao-Zhao 9649dcc
Merge pull request #30 from lyy1994/dependency
Jianqiao-Zhao c57ec27
Update process_instance in Language Modeling and Paraphrase Generatio…
Jianqiao-Zhao dfbb30e
Merge pull request #29 from lyy1994/adjust_prompts
lyy1994 581e00a
Make CLEVAScenario an abstract class
lyy1994 a56755b
Minor debug
lyy1994 fbfa44e
Merge pull request #31 from lyy1994/abstract
Jianqiao-Zhao 0a341dd
Add financial_question subtask for CLEVAParaphraseIdentificationScenario
Jianqiao-Zhao 43e4d54
Add KeyphraseExtractionScenario and other subtasks
Jianqiao-Zhao 771e0cc
Fix task-subtask conflicts
lyy1994 c8f647d
Fix dialogue processing
zd11024 adeee53
Merge branch 'scenarios/dialogue' into increase_data
zd11024 f81d01f
Add More data
Jianqiao-Zhao 43a605a
Save working progress
Jianqiao-Zhao 59fe8d0
Merge remote-tracking branch 'origin/increase_data' into increase_data
Jianqiao-Zhao 5b59ccb
Increase to Full-Scale Data
Jianqiao-Zhao 3972aec
Minor debug
lyy1994 f57c944
Merge pull request #32 from lyy1994/increase_data
lyy1994 6599f07
Fix converter bug
zd11024 20e53cb
Merge pull request #33 from lyy1994/hotfix/converter
lyy1994 3a0e7a2
Fix bugs
lyy1994 e38a265
Merge pull request #34 from lyy1994/hotfix/minor
Jianqiao-Zhao 5509035
Update asset links
lyy1994 7ffbd6b
Update data link
lyy1994 4988330
Merge pull request #35 from lyy1994/update_link
lyy1994 fa092f6
Optimize Some Names
Jianqiao-Zhao fa3904d
Add Type Annotations to Perturbation Functions
Jianqiao-Zhao f2e5ce0
Make mypy Happy
Jianqiao-Zhao 82f6a20
Refactor get_words_with_similar_pinyin in ChineseTyposPerturbation
Jianqiao-Zhao e350a89
Include rare_char_prob, consider_tone, and word_level_perturb in Chin…
Jianqiao-Zhao 64a0f15
Add One Missing Type Annotation
Jianqiao-Zhao f6f82d4
Simplify data structure
lyy1994 52481d9
Merge pull request #36 from lyy1994/minor_improvement
Jianqiao-Zhao c22396d
1. Optimize ChineseTokenizer structure in cleva_harms_metric.py and b…
Jianqiao-Zhao 6e2b142
Merge pull request #37 from lyy1994/minor_improvement
lyy1994 572c72d
Simplify code structure; Increase readability; Refine comments and me…
Jianqiao-Zhao 806f034
Add CLEVA RunSpecs
lyy1994 c16b911
Fix Type Annotations
Jianqiao-Zhao 00d3b07
Merge remote-tracking branch 'origin/scenario_improvement' into scena…
Jianqiao-Zhao b080817
Fix typos
Jianqiao-Zhao 1979ddc
Add more assertions to ensure correct type
Jianqiao-Zhao 350e520
Continue add type annotations
Jianqiao-Zhao 5f24185
Fix type annotations
lyy1994 92f38af
Merge pull request #38 from lyy1994/scenario_improvement
lyy1994 acbc061
Add more comments
lyy1994 cbd4197
Add one more example
lyy1994 3c737c1
Replace examples
lyy1994 3ad2db6
Revise examples
lyy1994 05e7358
Merge pull request #39 from lyy1994/template_improvement
lyy1994 858be52
Fix incorrect assert
lyy1994 e4fd565
Update from `full_functionality_text` to `text`
lyy1994 997dcf0
Minor debug
lyy1994 f7c0576
Merge pull request #40 from lyy1994/debug
Jianqiao-Zhao ea34b43
Minor update for clearer presentation
lyy1994 2e562f6
Minor debug
Jianqiao-Zhao 5877b19
Rephrase a comment
Jianqiao-Zhao e4d08a0
Merge pull request #41 from lyy1994/minor_improvement
lyy1994 f2d01e2
Resolve upstream conflicts
lyy1994 4cd0f8d
Merge pull request #42 from lyy1994/conflicts
lyy1994 7da19db
Minor fix
lyy1994 e87e1c8
Merge branch 'main' into main
lyy1994 13f280d
Merge branch 'stanford-crfm:main' into main
lyy1994 dd50a9d
Merge branch 'stanford-crfm:main' into main
lyy1994 2ec19cb
Add CLEVA accuracy metric descriptions
lyy1994 9cfc54f
Minor update
lyy1994 f7d7643
Update metric groups
lyy1994 125c639
Update taxonomy & metric_groups
lyy1994 43fb081
Reorganize
lyy1994 ec2d535
Update CLEVA harms, others and a few scenarios
lyy1994 95f3de4
Update CLEVA language, knowledge, reasoning schema
lyy1994 1f5c56f
Correct some fields
lyy1994 309a7e9
Complete All Schema in schema.yaml
Jianqiao-Zhao 9b30bf2
Refine schema.yaml
Jianqiao-Zhao ac5f025
Merge pull request #43 from lyy1994/schema
Jianqiao-Zhao d8e68cb
Update All Download Links
Jianqiao-Zhao a3270ae
Minor Debug
Jianqiao-Zhao ec1aed2
Minor Debug
Jianqiao-Zhao 1b33806
Add Type Annotation
Jianqiao-Zhao 5c33af0
Update One File Name
Jianqiao-Zhao da63e61
Merge pull request #44 from lyy1994/download_link
lyy1994 3d53210
Update schema.yaml
lyy1994 008160c
Update run_specs.py
lyy1994 fd3a2e8
Correct an error
lyy1994 06308af
Merge pull request #45 from lyy1994/schema
lyy1994 ed42663
Merge branch 'stanford-crfm:main' into main
lyy1994 2281fd5
Fix most comments
lyy1994 4b318ee
Reformat
lyy1994 e57ef22
Add data collection time
lyy1994 b913851
Merge pull request #46 from lyy1994/fix_schema
Jianqiao-Zhao 521bfdb
Fix Typos: math_world_problem -> math_word_problem
Jianqiao-Zhao 5e0ddd5
Merge pull request #47 from lyy1994/fix_typos
Jianqiao-Zhao 52d2c69
Update metric name
lyy1994 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In general, it is better to have this be a domain name due to (1) HTTPS security and (2) it's easier to lose an allocated IP address than a domain name. Is there any plan to change the URL back to a HTTPS domain name?
For instance, these assets could be hosted in a GitHub repository.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Because of the Great Firewall, Google Drive is not accessible in China. For this reason, and to ensure our project's long-term stability, we have rented a cloud server with a two-year contract. This means that this IP address and related service will remain stable in the foreseeable future.
In the meantime, we are applying for a domain and an SSL certificate. We estimate the process should be completed within 2 weeks. Immediately after, we will update the URLs to use the new HTTPS domain name.
Please let me know how you feel about this plan. We truly value your suggestions. Thank you!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for sharing your plans. Sounds good to me.