added Qwen2.5 to generate QA pairs. #96
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Integrate Qwen2.5 7B Model for Question Generation
Changes
Rationale
The Qwen2.5 7B model provides more advanced question generation capabilities compared to the previous T5 model. By focusing solely on question generation without answers, we streamline the process for scenarios where RAG is not being performed end-to-end.
How to Run
Ensure you have the required dependencies installed:
Place your
knowledge_dataset.csv
file in the same directory as the script. There's a mock one, so don't worry.Run the script with the following command:
Adjust the
batch_size
andsample_size
as needed. Theoutput_dir
specifies where the generated questions will be saved.The script will process the dataset, generate questions, and save the results in the specified output directory.
Notes