-
Notifications
You must be signed in to change notification settings - Fork 8.8k
Issues: deepseek-ai/DeepSeek-R1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
How to reproduce deepseek R1-671b model performance on AIME2024
#336
opened Feb 8, 2025 by
xiaoycolor
How does the Deepseekr1-zero model distillation work with base models like LLaMA or Qwen?
#335
opened Feb 8, 2025 by
QuantaoYao
Potential Memory Utilization Issue with vLLM Deployment of DeepSeek-R1-AWQ Model in Long Context Scenarios
#334
opened Feb 8, 2025 by
le0820
Where can I download the dataset and training weights of DeepSeek - R1?
#324
opened Feb 8, 2025 by
lizhichao999
请问R1训练是不是只做了论文中写的2.3.3和2.3.4的内容,而2.3.1和2.3.2实际上只是为了生成数据给2.3.3用的?
#320
opened Feb 8, 2025 by
ciaoyizhen
Problem: Unstable Response When Calling deepseek-reasoner Model (DeepSeek-R1) via API
#314
opened Feb 7, 2025 by
DavidMastrenet
About the doubts regarding the data processing of the first phase SFT section
#311
opened Feb 7, 2025 by
mlshenkai
Why is the CoT length shorter in DeepSeek R1 API than in the chatbot?
#309
opened Feb 7, 2025 by
fengyao12
Previous Next
ProTip!
Adding no:label will show everything without a label.