please help me some key questions #11

Ultraman6 · 2024-10-14T07:04:46Z

I see the fine-tune of lora implementions in your code is only tune the parameters of image-encoder in sam, if it is important to take adaptation of downstream prompt encoder and mask decoder in sam?
why I try to expand the fine-tune to those block, but the result shows like that

Is the period of training should be longer than that of fine-tune of only lora?

MathieuNlp · 2024-10-19T16:54:42Z

Hello,

You are right, my implementation adapt the encoder only. I tought that it would be wise to adapt the feature extractor which is the encoder. I believe that adapting the mask decoder would make similar results so I don't understand why the results are like this.

For the prompt encoder, I am not sure if it is necessary to adapt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

please help me some key questions #11

please help me some key questions #11

Ultraman6 commented Oct 14, 2024

MathieuNlp commented Oct 19, 2024

please help me some key questions #11

please help me some key questions #11

Comments

Ultraman6 commented Oct 14, 2024

MathieuNlp commented Oct 19, 2024