scikit-learn: machine learning in Python — scikit-learn 0.15.2 documentation
GitHub - datamade/dedupe: A python library for accurate and scaleable data deduplication and entity-resolution.
dedupe - :id: A python library for accurate and scaleable fuzzy matching, record deduplication and entity-resolution.