We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,我读了context_manager的代码之后有几点问题,能否解答下:
(代码注释太少,,刚看起来参数都不知道什么含义
The text was updated successfully, but these errors were encountered:
如果只需要对应论文算法的代码,初始版本会好读一些,目前的版本优化了性能。
Sorry, something went wrong.
顺序对结果没有影响,计算是等价的 global_h_q 已经做过旋转了,global_h_k 没做 rope 相当于旋转 0 度 如果只需要对应论文算法的代码,初始版本会好读一些,目前的版本优化了性能。
好的感谢! 请问你们有没有做过对 global_h_k 做旋转的相关实验呢
目前没有,因为按照rerope的长度拓展应该使用相同的旋转角度,你可以尝试一下其他旋转方法
No branches or pull requests
你好,我读了context_manager的代码之后有几点问题,能否解答下:
(代码注释太少,,刚看起来参数都不知道什么含义The text was updated successfully, but these errors were encountered: