GitHub - CVer-Yang/HCNet

Remote sensing image captioning aims to describe the crucial objects from remote sensing images in the form of natural language. Currently, it is challenging to generate high-quality captions due to the multi-scale targets in remote sensing images and the cross-modality differences between images and text features. To address these problems, this paper presents an approach for generating captions through hierarchical feature aggregation and cross-modality feature alignment, namely HCNet. Specifically, we propose a hierarchical feature aggregation module (HFAM) to obtain a comprehensive representation of vision features. Considering the disparities among different modality features, a cross-modality feature interaction module (CFIM) is designed in the decoder to facilitate feature alignment. Meanwhile, a cross-modality align loss is introduced to realize the alignment of image and text features. Extensive experiments on the three public caption datasets show our HCNet can achieve satisfactory performance. Especially, we demonstrate significant performance improvements of +14.15% CIDEr score on NWPU datasets compared to existing approaches.

First, refer to the MLAT to generate the required data in the data\UCM_images1.

Then, python train_HCNet_UCM.py, generate the weights in the best_UCM_weights.

Finally, python eval_HCNet_UCM.py.

This code is based on the MLAT and Clip.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
data		data
model		model
HCNet.py		HCNet.py
README.md		README.md
datasets.py		datasets.py
eval.py		eval.py
eval_HCNet_UCM.py		eval_HCNet_UCM.py
train.py		train.py
train_HCNet_UCM.py		train_HCNet_UCM.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

CVer-Yang/HCNet

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages