-
Notifications
You must be signed in to change notification settings - Fork 3
Alias merging #82
Comments
What about reading in a dictionary of synonyms? This should be pretty straightforward and we can curate the list later? So, maybe: accept an optional
in real data
it technically counts as feature, but it's a nice improvement for the data so I would rebrand it as bug-fix? 💃 |
We probably can't curate a list for all entities, but creating such a list manually once seems like an easy start |
The idea would then be to have the file crowdsourced, ideally opening a repo dedicated to it :D you know, leverage the power of github :P :P |
This is a great idea! We, the mentors, could definitely provide a hand on creating the list manually. When can we start? |
@goldbergtatyana right now. https://github.com/MusicConnectionMachine/dictionaries/blob/master/artist_synonyms you can edit directly in the browser, and then open a PR, which we can later merge :) |
We will need a more authoritative source thea manually curating the dictionary. A list of pseudonyms, synonyms and canonical names is available from https://portal.dnb.de. To obtain a list of synonyms for an artist we will need to:
The tasks are:
|
For now, just implement a way to read in the dictionary as previously described, and @gyachdav 's suggested workflow will hopefully be implemented in some spare time at the hackathon and it will output on a file which conforms to the idea in the dictionary file (aka: preferred name, synonym1, synonym2,...) |
Aliases don't have to be merged into a single DB entry for our use case (@MusicConnectionMachine/group-2). It would suffice if all aliases had the same |
What's the status on merging different aliases for entities? (e.g. "Mozart", "Wolfgang Amadeus Mozart", "Joannes Chrysostomus Wolfgangus Theophilus Mozart")
Do we have any logic in place for this? What's the strategy here?
The text was updated successfully, but these errors were encountered: