Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Questions about the data preprocessing method #4

Open
mingyang-wang26 opened this issue Jul 10, 2023 · 1 comment
Open

Questions about the data preprocessing method #4

mingyang-wang26 opened this issue Jul 10, 2023 · 1 comment

Comments

@mingyang-wang26
Copy link

mingyang-wang26 commented Jul 10, 2023

Hi,

thanks for your excellent paper! I have a questions about the data preprocessing in the implementation code.

A small observation about data processing, for the 'yahoo', 'amazon', 'agnews' and 'dbpedia' dataset, besides the 'content' column, they also include the 'title' (and 'answer' for 'yahoo') column. In the implementation in IDBR and MbPA++, they only use 'content' for training and discard other information such as 'title' and 'answer'. In Progressive Prompts all texts are taken into account, I'm wondering if this will leads to some performance difference and did you compare them?

Many thanks in advance for your clarification!

@JHL328
Copy link

JHL328 commented Jul 26, 2023

Hi, @ddd26

have you addressed the problem above for the value of "select_k_per_class"?

@mingyang-wang26 mingyang-wang26 changed the title Argument "select_k_per_class" should be 16 for T5 experiments as stated in the paper? Questions about the argument "select_k_per_class" and the number of samples used in the experiments Aug 18, 2023
@mingyang-wang26 mingyang-wang26 changed the title Questions about the argument "select_k_per_class" and the number of samples used in the experiments Questions about the data preprocessing method Oct 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants