We reproduce here some of the datasets on which PARIS was tested.
The Ontology Alignment Evaluation Initiative provides several knowledge bases for alignment. We used:
- OAEI Persons
- The ontologies are available at the homepage of the OAEI 2010 team. We reproduce them here.
- The gold standard for instances, classes, and relations can trivially be determined from the URIs.
- OAEI Restaurants
- The ontologies are available at the homepage of the OAEI 2010 team. We reproduce them here. Note that this version has been modified by us to fix errors in the structure of the dataset.
- The gold standard for instances, classes, and relations can trivially be determined from the URIs.
YAGO and DBpedia are two large general-purpose knowledge bases.
Since the sources have changed in both format and content, and since the original data sources are no longer available, we are currently unable to reproduce the original experimental results!
For YAGO, we used the core version in native text format. For DBpedia, we combined "DBpedia Ontology" + "Ontology Infobox Types" + "Ontology Infobox Properties". The gold standard for the instances can be established by simply comparing the URIs.
The gold standard for relations is here. It contains a tab-separated list of subproperty-superproperty pairs. Note that this gold standard is incomplete and can serve only for precision!
The gold standard for classes is here. It contains a tab-separated list of subclass-superclass pairs, together with the notion TRUE or FALSE. Note that this gold standard is incomplete and can serve only for computing precision, not recall!
We also matched YAGO and IMDb, a large movie database.
Again, these sources have changed and we are currently unable to reproduce the original experimental results!
The IMDB dataset is available upon request.
We also make available
- The gold standard for relations. The file contains a tab-separated list of subproperty-superproperty pairs.
- The gold standard for instances. The file is a tab-separated list of YAGO names and IMDB person/movie identifiers.
- The gold standard for classes. The file contains a tab-separated list of subclass-superclass pairs, together with the notion TRUE or FALSE. Note that this gold standard is incomplete and can serve only to compute precision, not recall!
We provide here the mappings between the concepts, instances and relations of YAGO and DBpedia, as computed by PARIS in 2012. These mappings are not 100%, as they are the output of an automated process.
- Matchings of instances/individuals between YAGO and DBpedia as TSV with precision values.
- Mappings of the classes/concepts between YAGO and DBpedia as TSV with precision, as well as in RDF/TTL cut at 60% precision. DBpedia uses multiple types of classes. The mappings computed by PARIS concern the classes of the manually constructed DBpedia ontology with YAGO classes. These mappings are asymmetric rdfs:subclassOf-mappings in both directions. These mappings are not of very good quality.
- Mappings of the relations/properties between YAGO and DBpedia as TSV with precision, as well as in RDF/TTL cut at 40% precision. These are asymmetric rdfs:subPropertyOf mappings between the YAGO relations and the relations of the manual ontology of DBpedia.