Questions about the paper #1

Gyuyeong · 2023-09-01T11:38:02Z

After reading the paper, I have some questions:

Is CARP applied on an already fine-tuned LLM like Chat GPT? If so, if I am trying to apply this concept in a model that has not been fine-tuned at all (for example GPT variants that can be found in Huggingface), how should I prepare the training data to fine-tune the LLM so that CARP can be effectively applied?
I do not understand what the paper says regarding using training sets. From what I understand, there is a training set, and you sample some of them out using SimCSE for few-shot learning demonstration examples. However, I do not understand where the training set is used other than when sampling out the few-shot examples. Were they used to fine-tune the LLM and you keep them for future use?

I apologize if I asked anything that was already mentioned in the paper and I was not paying close attention to it. Thank you in advance

PeterXiaTian · 2023-11-10T07:58:38Z

看完这篇文章，个人觉得作者没有用训练集微调任何大模型，只是挑选了部分样本来做few-shot.　但是这里挑选样本的时候比较疑惑，比如用simces来选择和问题相似的文本，问题样本来自哪里？？［mask了部分带label的样本吗？] 但是如果没有任何先验样本的话，这个该如何处理了？

pranerd · 2024-02-01T09:13:13Z

table 4 in this paper show that as the training set grows，16，128,256,512,1024, the carp models accuracy increase, how did it use the increased training set, did LLM model fine-tune on the training set?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about the paper #1

Questions about the paper #1

Gyuyeong commented Sep 1, 2023

PeterXiaTian commented Nov 10, 2023

pranerd commented Feb 1, 2024 •

edited

Loading

Questions about the paper #1

Questions about the paper #1

Comments

Gyuyeong commented Sep 1, 2023

PeterXiaTian commented Nov 10, 2023

pranerd commented Feb 1, 2024 • edited Loading

pranerd commented Feb 1, 2024 •

edited

Loading