HET-MC

This is the implementation of Summarizing Medical Conversations via Identifying Important Utterances at COLING 2020.

You can e-mail Yuanhe Tian at yhtian@uw.edu, if you have any questions.

🔥 News 🔥

We recently released a large language model for the Chinese medical domain named ChiMed-GPT, which is trained on the medical dialog data. For more information, please visit our GitHub Repo.

Citation

If you use or extend our work, please cite our paper at COLING 2020.

@inproceedings{song-etal-2020-summarizing,
    title = "Summarizing Medical Conversations via Identifying Important Utterances",
    author = "Song, Yan and Tian, Yuanhe and Wang, Nan and Xia, Fei",
    booktitle = "Proceedings of the 28th International Conference on Computational Linguistics",
    month = dec,
    year = "2020",
    address = "Barcelona, Spain (Online)",
    pages = "717--729",
}

Requirements

Our code works with the following environment.

python=3.7
pytorch=1.3

Dataset

To obtain the data, you can go to data_preprocessing directory for details.

Downloading BERT, ZEN and HET-MC

In our paper, we use BERT (paper) and ZEN (paper) as the encoder.

For BERT, please download pre-trained BERT-Base Chinese from Google or from HuggingFace. If you download it from Google, you need to convert the model from TensorFlow version to PyTorch version.

For ZEN, you can download the pre-trained model from here.

For HET-MC, you can download the models we trained in our experiments from here (passcode: b1w1).

Run on Sample Data

Run run_sample.sh to train a model on the small sample data under the sample_data directory.

Training and Testing

You can find the command lines to train and test models in run.sh.

Here are some important parameters:

--do_train: train the model.
--do_test: test the model.
--use_bert: use BERT as token encoder.
--use_zen: use ZEN as token encoder.
--bert_model: the directory of pre-trained BERT/ZEN model.
--use_memory: use memories.
--utterance_encoder: the utterance encoder to be used (should be one of none, LSTM, and biLSTM).
--lstm_hidden_size: the size of hidden state in the LSTM/biLSTM utterance encoder.
--decoder: the decoder to be used (can be either crf or softmax).
--use_party: use the speaker role information.
--use_department: use the department information.
--use_disease: use disease information
--model_name: the name of model to save.

To-do List

Release the code to get the data.
Regular maintenance.

You can leave comments in the Issues section, if you want us to implement any functions.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data_preprocessing		data_preprocessing
pytorch_pretrained_bert		pytorch_pretrained_bert
pytorch_pretrained_zen		pytorch_pretrained_zen
rouge		rouge
sample_data		sample_data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hetmc_eval.py		hetmc_eval.py
hetmc_helper.py		hetmc_helper.py
hetmc_model.py		hetmc_model.py
hetmc_run.py		hetmc_run.py
run.sh		run.sh
run_sample.sh		run_sample.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HET-MC

🔥 News 🔥

Citation

Requirements

Dataset

Downloading BERT, ZEN and HET-MC

Run on Sample Data

Training and Testing

To-do List

About

Releases

Packages

Languages

License

synlp/HET-MC

Folders and files

Latest commit

History

Repository files navigation

HET-MC

🔥 News 🔥

Citation

Requirements

Dataset

Downloading BERT, ZEN and HET-MC

Run on Sample Data

Training and Testing

To-do List

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages