[Question] The gradient for the additional tunable parameters is None. #1846

LiZhangMing · 2025-02-28T14:15:54Z

Question

First of all, thank you very much for your work. I want to add some tunable parameters into the CLIP attention during the LLava fine-tuning process. These parameters have their requires_grad set to True and have been included in the optimizer_grouped_parameters. However, during training, I noticed that the gradients for these fine-tuning parameters in the optimizer are p.grad=None. Could you please advise how I should modify the project? Many thanks!

(Note: I have commented out the @torch.no_grad() decorator in the clip_encoder.py file; is there anything else that needs to be changed?)

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] The gradient for the additional tunable parameters is None. #1846

[Question] The gradient for the additional tunable parameters is None. #1846

LiZhangMing commented Feb 28, 2025 •

edited

Loading

[Question] The gradient for the additional tunable parameters is None. #1846

[Question] The gradient for the additional tunable parameters is None. #1846

Comments

LiZhangMing commented Feb 28, 2025 • edited Loading

Question

LiZhangMing commented Feb 28, 2025 •

edited

Loading