Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How did you get instructions to train Base Decoder on Chart Perception task #11

Open
nguyenquangtan opened this issue Apr 11, 2024 · 2 comments

Comments

@nguyenquangtan
Copy link

Hi,
Congratulations on your great work! I am curious about how you created the instructions for the Chart Perception task since the component datasets included in this task (e.g. Chart2Text) do not originally come with the instructions. Did you set one fixed instruction for all samples or utilize a generative model to generate multiple instructions? If it is the latter case, did you generate different instructions for each sample or construct a set of instructions, like about 1000, and use them alternately?

Thank you for your attention to this matter.

@renqiux0302
Copy link
Contributor

Hi @nguyenquangtan,

We set one fixed instruction for all summarization samples during traing auxiliary decoder.
However, to adapt various user instructions, we trained the instruction adapter to classify tasks according to user instructions. Hence, the initial input to the model (instruction adapter) is various user instruction, but the input to the auxiliary decoder is fixed task-specific instruction.

Best regards

@nguyenquangtan
Copy link
Author

Hi @renqiux0302,
Thank you for your thoughtful replies! May I ask about the prompts that you use to generate the data for the instruction adapter?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants