Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Single GPU feasibility #5

Open
gusye1234 opened this issue Mar 2, 2023 · 1 comment
Open

Single GPU feasibility #5

gusye1234 opened this issue Mar 2, 2023 · 1 comment

Comments

@gusye1234
Copy link

Hi.
Note that the demo command will launch the training in 8 GPU. Have you tested running this task on a single GPU, and how long will it take?

I tried to follow up on this work but only got one GPU... So if the training speed is hard to bear, I might consider adding more GPUs.

@ArvinZhuang
Copy link
Owner

Hi @gusye1234! thanks for using our code!
You definitely could try one GPU with --gradient_accumulation_steps first, DSI with QG actually converges pretty fast with 8 gpus, so I think speed on one GPU is acceptable.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants