Skip to content

qwen2.5 instruct是多轮的ChatML式,SFT时每轮的position id需要进行reset吗? #1119

Closed Answered by jklj077
zzzm83 asked this question in Q&A
Discussion options

You must be logged in to vote

从0开始到结尾(0....N)。
这个一般不需要手动指定的,而且Qwen使用的是RoPE,相对位置编码,跟绝对位置关系不大。

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by zzzm83
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants