Skip to content

Latest commit

 

History

History
36 lines (28 loc) · 621 Bytes

README.md

File metadata and controls

36 lines (28 loc) · 621 Bytes

Chinese-Summary-Generation

基于T5、BART、CPT的文本摘要生成

├── README.md
├── requirements.txt
├── scripts
│   ├── predict.sh
│   └── train.sh
├── src
│   ├── main.py
│   ├── modeling_cpt.py
│   ├── predict.py
│   └── SFT_utils.py

Setup

pip install -r requirements.txt

Train

cd scripts
bash train.sh

Inference

cd scripts
bash predict.sh

Others

SFT means squeeze and fine tuning, just to copy some layers from pretrained models, and fine tuning on the layers copied