P³LM

The code for the EMNLP2022(Findings) paper：P³LM: Probabilistically Permuted Prophet Language Modeling for Generative Pre-Training

Requirements

python==3.6.8
pytorch==1.5.0+cu101

GLGE Data

download here

Models on GLGE Used in the Paper

download here

Our Results

Reproduce the Reported Results

Pre-process the GLGE data and put it in the ./P2DeNet/glge/ folder.
Download the trained models, and place them in ./P2DeNet/glge/models/[DATASET]/ respectively.
Run the following command:

python finetune.generation.py test [DATASET NAME]

Finetune the Models Yourself

python finetune.generation.py train [DATASET NAME]

Reference

Thanks for your citation:

@inproceedings{wang-etal-2020-learning-decouple,
    title = "Learning to Decouple Relations: Few-Shot Relation Classification with Entity-Guided Attention and Confusion-Aware Training",
    author = "Bao, Junwei  and
      Wang, Yifan  and
      Ying, Jiangyong  and
      Gong, Yeyun  and
      Zhao, Jing  and
      Wu, Youzheng  and
      He, Xiaodong  and
      Zhou, Bowen",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP)",
    year = "2021",
    abstract = "Conventional left-to-right (L2R) generative pre-training methods face two issues during decoding: limited to unidirectional target sequence modeling, and constrained on strong local dependencies. In this paper, we propose P2DeNet, a permutation over prophet decoding net, which strengthens the modeling of bi-directional information and long token dependencies in target sequences, for generative pre-training. Specifically, P2DeNet learns to generate tokens in permuted order upon an order-aware transformer decoder, as well as the corresponding future N tokens with a multi-stream attention mechanism. Extensive experiments are conducted on the GLGE benchmark, which includes four datasets for summarization, two for question generation, one for conversational question answering, and one for dialog response generation, where P2DeNet achieves state-of-the-art results compared with published methods.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
jdnet_pretrain_v6		jdnet_pretrain_v6
LICENSE		LICENSE
P3LM.pdf		P3LM.pdf
P3LM.png		P3LM.png
README.md		README.md
eval.glge.dev.sh		eval.glge.dev.sh
finetune.generation.py		finetune.generation.py
finetune_jdnet_glge.sh		finetune_jdnet_glge.sh
inference_eva_jdnet_glge.sh		inference_eva_jdnet_glge.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

P³LM

Requirements

GLGE Data

Models on GLGE Used in the Paper

Our Results

Reproduce the Reported Results

Finetune the Models Yourself

Reference

About

Releases

Packages

Languages

License

JD-AI-Research-NLP/P3LM

Folders and files

Latest commit

History

Repository files navigation

P3LM

Requirements

GLGE Data

Models on GLGE Used in the Paper

Our Results

Reproduce the Reported Results

Finetune the Models Yourself

Reference

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

P³LM

Packages