QWA

Code for paper Perceiving the World: Question-guided Reinforcement Learning for Text-based Games

Yunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Tianyi Zhou and Chengqi Zhang

An overview of the decision making process:

Model architecture:

Installation

Our code depends heavily on xingdi-eric-yuan/GATA-public. The additional dependencies could be found at requirements.txt
Download the word embeddings:

wget "https://bit.ly/2U3Mde2"

Datasets for pre-training the task selector and the action validator are provided at this link, other datasets could be downloaded at:

# AP
wget https://aka.ms/twkg/ap.0.2.zip

# RL
wget https://aka.ms/twkg/rl.0.2.zip

Training

Modify the paths within the config files, e.g. "word_embedding_path"
Action prediction (providing initialization for the encoders):

python train_ap.py config/config_pretrainAP.yaml

Task selector (pre-training phase):

python train_vt.py config/config_pretrainVT.yaml

Action validator (pre-training phase):

python train_va.py config/config_pretrainVA.yaml

Action selector (reinforcement learning phase):

# Medium games
python train_rl_medium.py config/config_trainRL_medium.yaml

# Hard games
python train_rl_hard.py config/config_trainRL_hard.yaml

Citation

@inproceedings{xu-etal-2022-perceiving,
    title = "Perceiving the World: Question-guided Reinforcement Learning for Text-based Games",
    author = "Xu, Yunqiu  and
      Fang, Meng  and
      Chen, Ling  and
      Du, Yali  and
      Zhou, Joey  and
      Zhang, Chengqi",
    booktitle = "Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = may,
    year = "2022",
    address = "Dublin, Ireland",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.acl-long.41",
    doi = "10.18653/v1/2022.acl-long.41",
    pages = "538--560"
}

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
documentation		documentation
vocabularies		vocabularies
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
agent.py		agent.py
dataset_AP.py		dataset_AP.py
dataset_RL.py		dataset_RL.py
dataset_VA.py		dataset_VA.py
dataset_VT.py		dataset_VT.py
dqn_memory_priortized_replay_buffer.py		dqn_memory_priortized_replay_buffer.py
evaluate.py		evaluate.py
focal_loss.py		focal_loss.py
generic.py		generic.py
graph_dataset.py		graph_dataset.py
kg_utils.py		kg_utils.py
layers.py		layers.py
model.py		model.py
model_AP.py		model_AP.py
model_VA.py		model_VA.py
model_VT.py		model_VT.py
radam.py		radam.py
refine_action_utils.py		refine_action_utils.py
requirements.txt		requirements.txt
segment_tree.py		segment_tree.py
train_ap.py		train_ap.py
train_rl_hard.py		train_rl_hard.py
train_rl_medium.py		train_rl_medium.py
train_va.py		train_va.py
train_vt.py		train_vt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

QWA

Installation

Training

Citation

License

About

Releases

Packages

Languages

License

YunqiuXu/QWA

Folders and files

Latest commit

History

Repository files navigation

QWA

Installation

Training

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages