This repository contains clinical corpora in the Dutch language for validating and training biomedical natural language processing models.
Currently, the following resources are available in this repository:
Name | Task | Original |
---|---|---|
MedMentions | UMLS concept extraction | LINK |
Mantra | UMLS concept extraction | LINK |
Both MedMentions and Mantra have been translated from English to Dutch using the Google Cloud Translate API and OpenAI GPT-4 API. The details of this procedure are described in our pre-printed paper.
Creative Commons Attribution 4.0 International