share sft-dataset #1

yyht · 2024-06-28T04:27:13Z

hello, nice work. could share the sft-dataset in hf?

X-Lai · 2024-06-28T05:49:46Z

Sure, it will be released soon. Please stay tuned.

yapdianang · 2024-07-10T23:41:03Z

Hi authors, following up on this thread to stay updated when the SFT datasets are released. Thanks and nice work!

kleinzcy · 2024-08-19T03:10:02Z

Hi authors, it is a nice work to advance the off-policy method for enhancing reason ability of LLM. I am following up on this thread to stay updated when the SFT datasets are released. Thanks!

yyht · 2024-08-20T02:53:06Z

hello everyone, https://huggingface.co/datasets/yingyingzhang/metamath-qwen2-math .
I use qwen2-math-instruct and open-source-datasets such as metamath-qa and numina-cot to construct a high quality sft-dataset.
When finetuning on qwen2-general-base or qwen2-math-base, the sft model could achieve comparable results to qwen2-instruct-7b\72b and qwen2-math-7b-instruct.
The whole datasets contains metamath-qwen2-math and none-synthetic datasets from https://huggingface.co/datasets/AI-MO/NuminaMath-CoT.
Please enjoy it.

Claude121381011 mentioned this issue Aug 29, 2024

I followed the steps in the README file to train the model, but I got an error. Here is the error message. #16

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

share sft-dataset #1

share sft-dataset #1

yyht commented Jun 28, 2024

X-Lai commented Jun 28, 2024

yapdianang commented Jul 10, 2024

kleinzcy commented Aug 19, 2024

yyht commented Aug 20, 2024

share sft-dataset #1

share sft-dataset #1

Comments

yyht commented Jun 28, 2024

X-Lai commented Jun 28, 2024

yapdianang commented Jul 10, 2024

kleinzcy commented Aug 19, 2024

yyht commented Aug 20, 2024