-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot start evaluation after running Train_MambaBCD.py #37
Comments
Hi, thank you for your question. Did you solve your issue? Evaluation stage takes time. If that still doesn't work, you can try lowering the batch size. |
类似的问题,训练一开始是正常的,但是在首次starting evaluation时报cuda内存错误,调整batchsize没有效果。看起来不像是GPU内存不足的问题,请问作者这种情况是不是需要增加一些关于cuda内存管理的代码。CUDA out of memory. Tried to allocate 24.00 MiB (GPU 0; 11.76 GiB total capacity; 10.47 GiB already allocated; 33.62 MiB free; 10.49 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF |
请问这个问题你解决了吗?我也遇到这个问题了 |
还没有,加了清理128MB的碎片设置、还试了tiny版的模型也是一样 |
Hi guys, Thank you so much for your question. May I ask which dataset are you running? Best, |
levircd and SYSU
…---原始邮件---
发件人: "Sapere ***@***.***>
发送时间: 2024年6月17日(周一) 晚上6:04
收件人: ***@***.***>;
抄送: ***@***.******@***.***>;
主题: Re: [ChenHongruixuan/MambaCD] Train_MambaBCD.py运行后无法进行starting evaluation (Issue #37)
Hi guys,
Thank you so much for your question. May I ask which dataset are you running?
Best,
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
Are you running into this issue on both datasets, or just LEVIR-CD+? |
both datasets |
Hi, That's quite weird. For the LEVIR-CD+ dataset, since the image size in it is 1024x1024, the problem may occur. Thus, you may need to crop it into smaller size by yourself. But evalution on the SYSU dataset should not have that problem. We have updated the code, please try to train again with the current version of the code. Best, |
Thank you for your help and hard work! I'll try the new code. And good luck for your paper.
…---原始邮件---
发件人: "Sapere ***@***.***>
发送时间: 2024年6月18日(周二) 凌晨4:37
收件人: ***@***.***>;
抄送: ***@***.******@***.***>;
主题: Re: [ChenHongruixuan/MambaCD] Train_MambaBCD.py运行后无法进行starting evaluation (Issue #37)
Hi,
That's quite weird. For the LEVIR-CD+ dataset, since the image size in it is 1024x1024, the problem may occur. Thus, you may need to crop it into smaller size by yourself. But evalution on the SYSU dataset should not have that problem. We have updated the code, please try to train again with the current version of the code.
Best,
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you commented.Message ID: ***@***.***>
|
你好,非常感谢你的帮助,我已经从成功在SYSU数据集上复现了BCD代码,且精度与论文相符,LEVIR-CD数据集我会裁剪成256*256版本后再去尝试。 |
Hi,
Glad to hear that!
The evaluation code is placed on the outside to get the final accuracy. To increase the speed of evaluation, you need to increase eval_batch_size. the current setting is 1.
Thank you for pointing this out. We will fix this error soon. Best, |
在500个iter过后,开始starting evaluation,然后就卡在那里不动了,源代码中似乎是用test测试集进行评估的,不知道哪里出现问题了。
The text was updated successfully, but these errors were encountered: