Where am I?: Scene Retrieval with Language

This is the repository that contains source code for the work Where am I?: Scene Retrieval with Language.

Evaluation

First download the model weights from here and place it in /playground/graph_models/model_checkpoints/graph2graph/. The necessary data files also need to be downloaded from here and placed into /playground/graph_models/. Then run the run_eval.sh script in /shell/.

Training

Run the run.sh script in /shell/.

Baselines

The CLIP2CLIP baseline can be found and run in the /baselines/CLIP2CLIP/ folder. And the Text2Pos baseline can be found in this fork. The model weights for the fine-tuned version of Text2Pos can be found here, and for the version trained from scratch on the 3DSSG dataset is here. In order to run the Text2Pos models, you can run run_text2pos.sh for training and run_eval_text2pos.sh for evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
baselines/CLIP-to-CLIP		baselines/CLIP-to-CLIP
data_distribution_analysis		data_distribution_analysis
playground/graph_models		playground/graph_models
shell		shell
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Where am I?: Scene Retrieval with Language

Evaluation

Training

Baselines

About

Releases

Packages

Languages

jiaqchen/whereami-text2sgm

Folders and files

Latest commit

History

Repository files navigation

Where am I?: Scene Retrieval with Language

Evaluation

Training

Baselines

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages