Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about finetuning and LISA++ #169

Open
chris5zk opened this issue Dec 18, 2024 · 0 comments
Open

Some questions about finetuning and LISA++ #169

chris5zk opened this issue Dec 18, 2024 · 0 comments

Comments

@chris5zk
Copy link

chris5zk commented Dec 18, 2024

Hello,
Your work is truly impressive! I’m particularly interested in the instance segmentation performance presented in the LISA++ paper.
I’m trying to reproduce the LISA++ results using the pretrained LISA model with my own data, but I’ve encountered the following issues:

  1. Fine-tuning LISA with my own data:
    When using LISA as a pretrained model to fine-tune on my data, I ran into the error:
    AttributeError: 'LISAForCausalLM' object has no attribute 'ce_loss_weight'.
    Thanks to the solution provided in AttributeError: 'LISAForCausalLM' object has no attribute 'ce_loss_weight' #85 , modifying the LISA.py file by replacing:

    if not hasattr(config,  train_mask_decoder"):

    with

    if hasattr(config, "train_mask_decoder"):

    then the code now runs successfully. However, I’m unsure why this error occurs. Could you clarify if this adjustment might cause any issues, or if it’s simply a typo in the code?

  2. Instruction Data formatting and tokenizer compatibility:
    Following the instructions in the LISA++ paper, I formatted the reasoning problem answers as class_name [ID] to serve as instruction data. However, this seems a bit unreasonable since the tokenizer only includes the [SEG] token. I noticed that in LISA.py, the [SEG] IDs are counted to decide whether to generate a prompt for the mask decoder. This causes a mismatch between gt_mask and pred_mask. Are there additional details I should be aware of to avoid such issues? or should I add an [SEG] behind the class ID?

Will you be providing more details or guidance on this? I would deeply appreciate any help or clarification on these matters.

Thank you so much!

@chris5zk chris5zk changed the title Some question about finetuning and LISA++ Some questions about finetuning and LISA++ Dec 18, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant