We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我修改了下代码,基于句号对长文本切分以后把多个文本放到一个batch里(扩大了batch_size,原始代码默认为1),但测试发现并行相比串行耗时几乎没有减少。我没太清楚是GPU算子在串行下已达到极限的原因吗?还是其他原因,请问还有什么优化空间吗?
GPU 4070 Windows 10 操作系统 使用300MB-SFT模型
The text was updated successfully, but these errors were encountered:
没研究过加速相关,可以再群里问问
Sorry, something went wrong.
No branches or pull requests
我修改了下代码,基于句号对长文本切分以后把多个文本放到一个batch里(扩大了batch_size,原始代码默认为1),但测试发现并行相比串行耗时几乎没有减少。我没太清楚是GPU算子在串行下已达到极限的原因吗?还是其他原因,请问还有什么优化空间吗?
GPU 4070
Windows 10 操作系统
使用300MB-SFT模型
The text was updated successfully, but these errors were encountered: