ScanRefer with Graph and Attention (SRGA)

The blue bounding boxes represent ground truth, the red bounding boxes are the ScanRefer predictions and the green bounding boxes show our results. The strength of the red background coloring represents the size of the attention weights.

Introduction

In this work, we study the task of 3D object localization using referring natural language expressions. We use RGB-D scans of indoor scenes represented in the form of 3D point clouds from the recently introduced ScanRefer dataset. The corresponding model ScanRefer treats each object individually and therefore lacks context-awareness. Our key technical contribution is designing an approach leveraging a graph neural network and a language self-attention mechanism to improve the understanding of relationships between objects within a scene. We show that our model has a better understanding of the language expressions and the interactions between the objects.

Setup

For information regarding setup, training commands and the ScanRefer baseline model check out: https://github.com/daveredrum/ScanRefer

Usage

To train the current SRGA model with multiview features and loading pretrained VoteNet weights:

python scripts/train.py --use_multiview --use_normal --use_pretrained

To visualize the learned attention weights:

python scripts/visualize_attention.py --folder <foldername> --use_multiview --use_normal

Benchmark results

Acknowledgement

We would like to thank Dave Zhenyu Chen for the continued support on this project and the provided ScanRefer codebase.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
benchmark		benchmark
data		data
demo		demo
docs		docs
models		models
outputs/best_model		outputs/best_model
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScanRefer with Graph and Attention (SRGA)

Introduction

Setup

Usage

Benchmark results

Acknowledgement

About

Releases

Packages

Languages

License

DomiSchmauser/ScanRefer-with-Graph-and-Attention-SRGA

Folders and files

Latest commit

History

Repository files navigation

ScanRefer with Graph and Attention (SRGA)

Introduction

Setup

Usage

Benchmark results

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages