PPLM

This repository contains code to run the Plug and Play Language Model (PPLM), as described in this blog post and arXiv paper. A demo and Colab notebook are also available.

PPLM is also integrated into the 🤗/Transformers repository.

Plug and Play Language Models: a Simple Approach to Controlled Text Generation

Authors: Sumanth Dathathri, Andrea Madotto, Janice Lan, Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, and Rosanne Liu

PPLM allows a user to flexibly plug in one or more tiny attribute models representing the desired steering objective into a large, unconditional language model (LM). The method has the key property that it uses the LM as is—no training or fine-tuning is required—which enables researchers to leverage best-in-class LMs even if they do not have the extensive hardware required to train them.

See also our arXiv paper, blog post, and try it out for yourself with no setup using the Colab notebook.

Setup

pip install -r requirements.txt

Citation

@article{dathathri2019plug,
    title={Plug and Play Language Models: a Simple Approach to Controlled Text Generation},
    author={Sumanth Dathathri and Andrea Madotto and Janice Lan and Jane Hung and Eric Frank and Piero Molino and Jason Yosinski and Rosanne Liu},
    journal={arXiv preprint arXiv:1912.02164},
    year={2019},
}

PPLM-BoW

Example command for bag-of-words control

python run_pplm.py -B military --cond_text "The potato" --length 50 --gamma 1.5 --num_iterations 3 --num_samples 10 --stepsize 0.03 --window_length 5 --kl_scale 0.01 --gm_scale 0.99 --colorama --sample

Tuning hyperparameters for bag-of-words control

Increase --stepsize to intensify topic control, and decrease its value to soften the control. --stepsize 0 recovers the original uncontrolled GPT-2 model.
If the language being generated is repetitive (For e.g. "science science experiment experiment"), there are several options to consider:
a) Reduce the --stepsize
b) Increase --kl_scale (the KL-loss coefficient) or decrease --gm_scale (the gm-scaling term)
c) Add --grad-length xx where xx is an (integer <= length, e.g. --grad-length 30).

PPLM-Discrim

Example command for discriminator based sentiment control

python run_pplm.py -D sentiment --class_label 2 --cond_text "My dog died" --length 50 --gamma 1.0 --num_iterations 10 --num_samples 10 --stepsize 0.04 --kl_scale 0.01 --gm_scale 0.95 --sample

Tuning hyperparameters for discriminator control

Increase --stepsize to intensify topic control, and decrease its value to soften the control. --stepsize 0 recovers the original uncontrolled GPT-2 model.
Use --class_label 3 for negative, and --class_label 2 for positive

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
imgs		imgs
.fossa.yml		.fossa.yml
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
pplm_classification_head.py		pplm_classification_head.py
requirements.txt		requirements.txt
run_pplm.py		run_pplm.py
run_pplm_discrim_train.py		run_pplm_discrim_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPLM

Plug and Play Language Models: a Simple Approach to Controlled Text Generation

Setup

Citation

PPLM-BoW

Example command for bag-of-words control

Tuning hyperparameters for bag-of-words control

PPLM-Discrim

Example command for discriminator based sentiment control

Tuning hyperparameters for discriminator control

About

Releases

Packages

Languages

License

aceport/PPLM

Folders and files

Latest commit

History

Repository files navigation

PPLM

Plug and Play Language Models: a Simple Approach to Controlled Text Generation

Setup

Citation

PPLM-BoW

Example command for bag-of-words control

Tuning hyperparameters for bag-of-words control

PPLM-Discrim

Example command for discriminator based sentiment control

Tuning hyperparameters for discriminator control

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages