Exemplar Masking for Multimodal Incremental Learning

Official PyTorch implementaton of paper "Exemplar Masking for Multimodal Incremental Learning".
You can visit our project website here.

Introduction

Multimodal incremental learning needs to digest the information from multiple modalities while concurrently learning new knowledge without forgetting the previously learned information. There are numerous challenges for this task, mainly including the larger storage size of multimodal data in exemplar-based methods and the computational requirement of finetuning on huge multimodal models. In this paper, we leverage the parameter-efficient tuning scheme to reduce the burden of fine-tuning and propose the exemplar masking framework to efficiently replay old knowledge. Specifically, the non-important tokens are masked based on the attention weights and the correlation across different modalities, significantly reducing the storage size of an exemplar and consequently saving more exemplars under the same memory buffer. Moreover, we design a multimodal data augmentation technique to diversify exemplars for replaying prior knowledge. In experiments, we not only evaluate our method in existing multimodal datasets but also extend the ImageNet-R dataset to a multimodal dataset as a real-world application, where captions are generated by querying multimodal large language models (e.g., InstructBLIP). Extensive experiments show that our exemplar masking framework is more efficient and robust to catastrophic forgetting under the same limited memory buffer.

Usage

To Do

Enviroment

To Do

Prerequisites

To Do

Other requirements

To Do

Prepare Dataset

To Do

Evaluation

To Do

Train

To Do

To Do

Acknowledgements

This code is based on ViLT.

Citation

If you find this work useful for your research, please cite:

@article{lee2024exemplarmasking,
  title={Exemplar Masking for Multimodal Incremental Learning},
  author={Yi-Lun Lee and Chen-Yu Lee and Wei-Chen Chiu and Yi-Hsuan Tsai},
  journal={arXiv preprint arXiv:2412.09549},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
fig		fig
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Exemplar Masking for Multimodal Incremental Learning

Introduction

Usage

Enviroment

Prerequisites

Other requirements

Prepare Dataset

Evaluation

Train

Acknowledgements

Citation

About

Releases

Packages

YiLunLee/Exemplar_Masking_MCIL

Folders and files

Latest commit

History

Repository files navigation

Exemplar Masking for Multimodal Incremental Learning

Introduction

Usage

Enviroment

Prerequisites

Other requirements

Prepare Dataset

Evaluation

Train

Acknowledgements

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages