Name	Name	Last commit message	Last commit date
parent directory ..
agents	agents
alfred_utils	alfred_utils
envs	envs
intruct2params	intruct2params
media	media
models	models
utils	utils
README.md	README.md
arguments.py	arguments.py
eval.sh	eval.sh
eval_total_tests.sh	eval_total_tests.sh
eval_total_valid.sh	eval_total_valid.sh
main.py	main.py
requirements.txt	requirements.txt
sem_mapping.py	sem_mapping.py
val_sr.py	val_sr.py

ReALFRED - FILM

This is the code repository of the paper FILM: Following Instructions in Language with Modular Methods for reproduction in ReALFRED.
Our code is largely built upon the codebase from FILM.

Environment

Clone repository

$ git clone https://github.com/snumprlab/realfred.git
$ cd realfred/film
$ export ALFRED_ROOT=$(pwd)

Install requirements

$ conda create -n refilm python=3.6
$ conda activate refilm

$ cd $ALFRED_ROOT
$ pip install --upgrade pip
$ pip install -r requirements.txt

You also need to install Pytorch depending on your system. e.g., PyTorch v1.8.1 + cuda 11.1
Refer here

pip install torch==1.8.1+cu111 torchvision==0.9.1+cu111 torchaudio==0.8.1 -f https://download.pytorch.org/whl/torch_stable.html

If you want to visualize semantic segmentation outputs, install detectron2 according to your system configuration. (You do not need to install this unless you want to visualize segemtnation outputs on the egocentric rgb frame as in the "segmented RGB" here). If you are using conda:

python -m pip install detectron2 -f \
  https://dl.fbaipublicfiles.com/detectron2/wheels/cu111/torch1.8/index.html

Additional preliminaries to use ReALFRED scenes

Please make sure that ai2thor's version is 4.3.0.

Download the annotation files from the Hugging Face repo.

git clone https://huggingface.co/datasets/SNUMPR/realfred_json alfred_data_all

Now,

Go to 'FILM' directory (this repository).

$ export FILM=$(pwd)

Go to a directory you would like to git clone "alfred". Then run,

$ git clone https://github.com/askforalfred/alfred.git
$ export ALFRED_ROOT=$(pwd)/alfred

Now run

$ cd $ALFRED_ROOT
$ python models/train/train_seq2seq.py --data $FILM/alfred_data_all/Re_json_2.1.0 --model seq2seq_im_mask --dout exp/model:{model},name:pm_and_subgoals_01 --splits $FILM/alfred_data_all/splits/oct24.json --gpu --batch 8 --pm_aux_loss_wt 0.1 --subgoal_aux_loss_wt 0.1 --preprocess

The will take 2~5 minutes. You will see this:

Once the bars for preprocessing are all filled, the code will break with an error message. (You can ignore and proceed).

Now run,

$ cd $FILM
$ mkdir alfred_data_small
$ ln -s alfred_data_small $FILM/alfred_data_all

Download Trained models

Download "Pretrained_Models_FILM" from this link and unzip it.

unzip Pretrained_Models_realfred.zip

Semantic segmentation

mv mrcnn_objects_best.pt $FILM
mv mrcnn_receptacles_best.pt $FILM

Depth prediction

mv depth_models $FILM/models/depth/depth_models

Semantic Search Policy

mv best_model_multi.pt $FILM/models/semantic_policy/best_model_multi.pt

Run FILM on Valid/ Tests Sets

Caveat: Multiprocessing (using --num_processes > 1) will make the construction of semantic mapping slower. We recommend that you use "--num_processes 1" (or a number around 2) and just run several jobs. (E.g. one job with episodes from 0 to 200, another job with episodes from 200 to 400, etc)

e.g., Run following command to evaluate FILM.

export $SPLIT=valid_seen
export $FROM_IDX=0
export $TO_IDX=50
$ python main.py \
        --max_episode_length 1000       \
        --num_local_step 25     \
        --num_processes 1       \
        --eval_split $SPLIT     \
        --from_idx $FROM_IDX    \
        --to_idx $TO_IDX        \
        --max_fails 10  \
        --debug_local   \
        --set_dn realfred-film-${SPLIT} \
        --which_gpu 0   \
        --sem_gpu_id 0  \
        --sem_seg_gpu 0 \
        --depth_gpu 0   \
        --x_display 1   \
        --data "alfred_data_all/Re_json_2.1.0"   \
        --splits "alfred_data_small/splits/oct24.json" \
        --learned_depth \
        --use_sem_policy  \
        --appended \
        --use_sem_seg   \
        --appended \

Or you could run

bash eval_total_valid.sh
bash eval_total_tests.sh

Evaluate results

The output of your runs are saved in the pickles of "results/analyze_recs/". For example, you may see results like the following in your "results/analyze_recs/".

Change directory to "results/analyze_recs/" and inside a python3 console,

python val_sr.py

Train BERT models

Move to the following directory and clone data. Then, change the folder name as alfred_data.

cd models/instructions_processed_LP/BERT/data
git clone https://huggingface.co/SNUMPR/realfred_film_BERT_data
mv realfred_film_BERT_data alfred_data

Change directory:

$ cd models/instructions_processed_LP/BERT

To train BERT type classification ("base.pt"),

$ python3 train_bert_base.py -lr 1e-5

(Use --no_appended to use high level instructions only for training data.)

To train BERT argument classification,

$ python3 train_bert_args.py --no_divided_label --task mrecep -lr 5e-5

(Use --no_appended to use high level instructions only for training data.) Similarly, train models for --task object, --task parent, --task toggle, --task sliced.

To generate the finial output for "agent/sem_exp_thor.py",

$ python3 end_to_end_outputs.py -sp YOURSPLIT -m MODEL_SAVED_FOLDER_NAME -o OUTPUT_PICKLE_NAME

(Again, use --no_appended to use high level instructions only for training data.)

After you get the output pickle files (e.g., instruction2_params_tests_seen_appended_new_split_oct24.p), put them in instruc2params folder.

Then, edit read_test_dict in models/instructions_processed_LP/ALFRED_task_helper.py to use your files.

You can check the accuracy of the argument prediction results of trained models.

cd ..
python3 compare_BERT_pred_with_GT.py

Download pretraind models

You can download pretrained models (appended).

cd models/instructions_processed_LP/BERT
git clone https://huggingface.co/SNUMPR/realfred_film_BERT_pretrained

Hardware

Trained and Tested on:

GPU - RTX A6000
CPU - Intel(R) Core(TM) i7-12700K CPU @ 3.60GHz
RAM - 64GB
OS - Ubuntu 20.04

Citation

ReALFRED

@inproceedings{kim2024realfred,
  author    = {Kim, Taewoong and Min, Cheolhong and Kim, Byeonghwi and Kim, Jinyeon and Jeung, Wonje and Choi, Jonghyun},
  title     = {ReALFRED: Interactive Instruction Following Benchmark in Photo-Realistic Environment},
  booktitle = {ECCV},
  year      = {2024}
  }

FILM

@inproceedings{min2021film,
  author    = {Min, So Yeon and Chaplot, Devendra Singh and Ravikumar, Pradeep and Bisk, Yonatan and Salakhutdinov, Ruslan},
  title     = {FILM: Following Instructions in Language with Modular Methods},
  booktitle = {ICLR},
  year      = {2022}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

film

film

README.md

ReALFRED - FILM

Environment

Clone repository

Install requirements

Additional preliminaries to use ReALFRED scenes

Download Trained models

Run FILM on Valid/ Tests Sets

Evaluate results

Train BERT models

Download pretraind models

Hardware

Citation

Files

film

Directory actions

More options

Directory actions

More options

Latest commit

History

film

Folders and files

parent directory

README.md

ReALFRED - FILM

Environment

Clone repository

Install requirements

Additional preliminaries to use ReALFRED scenes

Download Trained models

Run FILM on Valid/ Tests Sets

Evaluate results

Train BERT models

Download pretraind models

Hardware

Citation