Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Hello, chuanguang
It`s a PR about fp16 training. I use this code to run your CIRKD on four RTX 3090 without hyperparameter changes. The performance is still aligned with faster training speeds and reduced GPU memory.
I implemented fp16 in a straightforward way with if-else style. If you have any suggestions on code style please contact me. BTW, I just started my internship at Horizon, you can find me on feishu.
Looking forward to your reply. Thanks!
Best Regards!
Yun