Video-Helpful Multimodal Machine Translation

EVA is a dataset that contains 852k Japanese-English parallel subtitle pairs, 520k Chinese-English parallel subtitle pairs, and corresponding video clips collected from movies and TV episodes:

Extensive training set
Video-helpful evaluation set in which subtitles are ambiguous, and videos can help disambiguate the source subtitles

Examples:

SRC: きれいだ

REF: You look beautiful

NMT: Beautiful

VMT: You look beautiful

Splits:

Split	Train	Validation	test
#sample (Ja-En)	848,164	2,138	2,138
#sample (Zh-En)	516,733	1,470	1,470
video-helpful		✓	✓

Usage:

You can read json files to find the mapping from videos to parallel subtitle pairs.

Json Files Structure:

"train"/"test"/"val": {
	video_file_name: {  
    		{ "ja": Japanese_subtitle },  
    		{ "en": English_subtitle }  
	}
}

Download:

Considering the copyright of the included videos, we're unable to directly provide download links. However, if you're interested in accessing the dataset, please kindly send an email to l763541405@gmail.com. By the way, please include your academic affiliation in the email, and we'll respond as soon as possible.

Note:

Please, note that by downloading the dataset, you agree to the following conditions:

Do not re-distribute the dataset without our permission.
The dataset can only be used for research purposes. Any other use is explicitly prohibited.

Citation:

If you find this dataset helpful, please cite our publication "Video-Helpful Multimodal Machine Translation":

@inproceedings{li-etal-2023-video,
    title = "Video-Helpful Multimodal Machine Translation",
    author = "Li, Yihang  and
      Shimizu, Shuichiro  and
      Chu, Chenhui  and
      Kurohashi, Sadao  and
      Li, Wei",
    editor = "Bouamor, Houda  and
      Pino, Juan  and
      Bali, Kalika",
    booktitle = "Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing",
    month = dec,
    year = "2023",
    address = "Singapore",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2023.emnlp-main.260",
    doi = "10.18653/v1/2023.emnlp-main.260",
    pages = "4281--4299",
    abstract = "Existing multimodal machine translation (MMT) datasets consist of images and video captions or instructional video subtitles, which rarely contain linguistic ambiguity, making visual information ineffective in generating appropriate translations. Recent work has constructed an ambiguous subtitles dataset to alleviate this problem but is still limited to the problem that videos do not necessarily contribute to disambiguation. We introduce EVA (Extensive training set and Video-helpful evaluation set for Ambiguous subtitles translation), an MMT dataset containing 852k Japanese-English parallel subtitle pairs, 520k Chinese-English parallel subtitle pairs, and corresponding video clips collected from movies and TV episodes. In addition to the extensive training set, EVA contains a video-helpful evaluation set in which subtitles are ambiguous, and videos are guaranteed helpful for disambiguation. Furthermore, we propose SAFA, an MMT model based on the Selective Attention model with two novel methods: Frame attention loss and Ambiguity augmentation, aiming to use videos in EVA for disambiguation fully. Experiments on EVA show that visual information and the proposed methods can boost translation performance, and our model performs significantly better than existing MMT models.",
}

Contact:

If you have any questions about this dataset, please contact l763541405@gamil.com.

License:

GNU General Public License v3.0

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
fairseq_mmt_safa		fairseq_mmt_safa
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video-Helpful Multimodal Machine Translation

Examples:

Splits:

Usage:

Json Files Structure:

Download:

Note:

Citation:

Contact:

License:

About

Releases

Packages

Languages

ku-nlp/video-helpful-MMT

Folders and files

Latest commit

History

Repository files navigation

Video-Helpful Multimodal Machine Translation

Examples:

Splits:

Usage:

Json Files Structure:

Download:

Note:

Citation:

Contact:

License:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages