Skip to content

[Bug]: Qwen2.5-7B-Instruct 支持 128K tokens,为啥 config 里面写 32k呢,如果我基于qwen2.5 要训练一个大于 32k 的模型,需要怎么做呢 #1134

Unanswered
tao-githup asked this question in Q&A
Discussion options

You must be logged in to vote

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@tao-githup
Comment options

@jklj077
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants
Converted from issue

This discussion was converted from issue #1133 on December 17, 2024 12:23.