-
Notifications
You must be signed in to change notification settings - Fork 274
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
docker运行报错:multiproc_worker_utils.py:226] RuntimeError: Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method #305
Comments
单卡可以启动吗 |
单卡的话不会报上面这个错误了,但是有新的错误,报显存不足 但问题在于,我是4090 24G显存,并且显存是空的没有起任何服务也没有被占用 +-----------------------------------------------------------------------------------------+ |
单独部署llm没有问题,但是同时部署llm和embedding就会报这个错,新开一个issue:#308 |
提交前必须检查以下项目 | The following items must be checked before submission
问题类型 | Type of problem
模型推理和部署 | Model inference and deployment
操作系统 | Operating system
Linux
详细描述问题 | Detailed description of the problem
Ubuntu上docker-compose部署Qwen2-7B-Instruct报错
其中关键信息是
RuntimeError: Cannot re-initialize CUDA in forked subprocess. To use CUDA with multiprocessing, you must use the 'spawn' start method
Dependencies
运行日志或截图 | Runtime logs or screenshots
The text was updated successfully, but these errors were encountered: