Skip to content

Latest commit

 

History

History
113 lines (95 loc) · 6.72 KB

README.md

File metadata and controls

113 lines (95 loc) · 6.72 KB

Fine-grained Contrastive Learning for Relation Extraction

Description

This repo contains the source code for the EMNLP 2022 paper Fine-grained Contrastive Learning for Relation Extraction.

Project Structure

├── configs                <- Configuration files
│   ├── experiment               <- Experiment configs
│   ├── mode                     <- Mode configs
│   ├── trainer                  <- Trainer configs
│   └── config.yaml              <- Main config 
│
├── data                   <- Project data
│
├── logs                   <- Logs and saved checkpoints
│
├── preprocess             <- Preprocessing scripts
│
├── saved_models           <- Saved models
│
├── src                    <- Source code
│   ├── stage_1                  <- Stage 1 code: record learning order
│   ├── stage_2                  <- Stage 2 code: contrastive pre-training
│   └── stage_3                  <- Stage 2 code: fine-tuning
│       ├── doc_level            <- Fine-tune for doc-level relation extraction
│       └── sentence_level       <- Fine-tune for sentence-level relation extraction
│
├── requirements.txt       <- File for installing python dependencies
├── run.py                 <- Controller
└── README.md

Instructions

📓 Pre-run Notes:

  • For convenience, we provide the complete preprocessed datasets, saved models, and logs for all of our experiments.
  • By using the preprocessed data and saved models, you can jump to any stage detailed below without needing to run the previous stages.
  • This repo uses Hyrda to run and configure experiments. You can override any parameter from command line like this: CUDA_VISIBLE_DEVICES=0 python run.py trainer.learning_rate=1e-5.
  • Please see the FineCL paper for details about each stage of training and evaluation.
  • The code in this repo was developed using Python (v3.9), PyTorch (v1.12.1), Hugginface transformers (v2.5.0), and CUDA (v11.6)

⓪ Initalize

  • Install dependencies from requirements.txt
  • Install Apex
  • Download the data and saved models via the command line:
    gdown --no-check-certificate --folder https://drive.google.com/drive/u/1/folders/13-iTHhde8B5BQPNk8bCA0z6dxxo42ov1
  • Unzip data.zip and then move both data and saved_models into the project's root directory.
  • [OPTIONAL] All the preprocessed data is provided, but if you'd like to preprocess the data yourself, run:
    python run.py mode=preprocess.yaml

① Stage 1 – Record learning order:

  • The exact learning order data used in the paper is provided in data/erica_data/order_dict_augmented.json. However, if you'd like to generate a learning order of relation instances from scratch, do the following:
  • There are 10 separate distantly labeled training data files numbered 0 through 9. Run:
    CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_1.yaml erica_file_num=[FILE_NUM] for each of the 10 files, replacing [FILE_NUM] with the appropriate file number for each run.
  • Each run outputs a json file containing the epoch and a list of learned unique identifiers (UIDs). A UID identifies a relational instance in the dataset.
  • Merge the outputs of each run into a single json file with the format: {str(UID): int(epoch_leared), ...}. For example: {"39582": 2, "49243": 12, ...}
  • This merged file will replace data/erica_data/order_dict_augmented.json in Stage 2 of training.

② Stage 2 – Fine-grained contrastive pre-training:

  • To pre-train a model using Fine-grained contrastive learning (FineCL), run:
CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_2.yaml

③ Stage 3 – Downstream fine-tuning:

  • Document-level RE fine-tuning:

    • For document-level fine-tuning, we load the pre-trained model from Stage 2 and train on annotated data from the DocRED dataset.
      CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_doc_re.yaml
      
      To evaluate the performance of the fine-tuned model from Stage 3, run:
      CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_eval.yaml
      
  • Sentence-level RE fine-tuning:

    • For sentence-level fine-tuning, we again load the pre-trained model from Stage 2 and train on the following annotated datasets:

      CUDA_VISIBLE_DEVICES=0 python run.py experiment=stage_3_sentence_re.yaml
      




Credits: This work began as a fork of the ERICA repository. If you found our code useful, please consider citing:

@inproceedings{hogan-etal-2022-fine,
    title = "Fine-grained Contrastive Learning for Relation Extraction",
    author = "Hogan, William  and
      Li, Jiacheng  and
      Shang, Jingbo",
    booktitle = "Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2022",
    address = "Abu Dhabi, United Arab Emirates",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.emnlp-main.71",
    pages = "1083--1095",
    abstract = "Recent relation extraction (RE) works have shown encouraging improvements by conducting contrastive learning on silver labels generated by distant supervision before fine-tuning on gold labels. Existing methods typically assume all these silver labels are accurate and treat them equally; however, distant supervision is inevitably noisy{--}some silver labels are more reliable than others. In this paper, we propose fine-grained contrastive learning (FineCL) for RE, which leverages fine-grained information about which silver labels are and are not noisy to improve the quality of learned relationship representations for RE. We first assess the quality of silver labels via a simple and automatic approach we call {``}learning order denoising,{''} where we train a language model to learn these relations and record the order of learned training instances. We show that learning order largely corresponds to label accuracy{--}early-learned silver labels have, on average, more accurate labels than later-learned silver labels. Then, during pre-training, we increase the weights of accurate labels within a novel contrastive learning objective. Experiments on several RE benchmarks show that FineCL makes consistent and significant performance gains over state-of-the-art methods.",
}