the training parameters of your single branch convnext encoder #4

yuecao0119 · 2024-03-08T12:32:27Z

Hello, your job is so great.

But I would like to ask, is it convenient to disclose the training parameters of your single branch convnext encoder? I am not very able to understand the following part of the code.

def feature_select(self, image_forward_outs):
        if self.select_layer>100:
            image_features = image_forward_outs[-4:]
        else:
            image_features = image_forward_outs[-1]
        return image_features

The text was updated successfully, but these errors were encountered:

luogen1996 · 2024-03-08T12:39:42Z

These codes of image_features = image_forward_outs[-4:] are not actually used. We directly select the last layer of ConvNeXT to extract visual features. We will revise our codes soon.

yuecao0119 · 2024-03-08T14:33:36Z

Thank you for your answer.
How should the single-branch convnext in your paper be trained? Because I tried to use your code to train single-branch convnext, the loss effect in the pretrain stage was not very good.

luogen1996 · 2024-03-08T14:37:00Z

Your loss looks actually good. Single-branch LLaVA-HR performs worse, see our paper.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the training parameters of your single branch convnext encoder #4

the training parameters of your single branch convnext encoder #4

yuecao0119 commented Mar 8, 2024

luogen1996 commented Mar 8, 2024

yuecao0119 commented Mar 8, 2024

luogen1996 commented Mar 8, 2024

the training parameters of your single branch convnext encoder #4

the training parameters of your single branch convnext encoder #4

Comments

yuecao0119 commented Mar 8, 2024

luogen1996 commented Mar 8, 2024

yuecao0119 commented Mar 8, 2024

luogen1996 commented Mar 8, 2024