This project is an interactive entity resolution plugin for Elasticsearch based on Duke. Basically, it uses [Bayesian probabilities] (http://en.wikipedia.org/wiki/Bayesian_probability) to compute probability. You can pretty much use it as an interactive deduplication engine.
To understand basics, go to Duke project documentation.
A list of [available comparators] (https://github.com/larsga/Duke/wiki/Comparator) is available here.
This project is licended under LGPLv3
Copyright (c) 2013 Yann Barraud