zingg-0.3.2-SNAPSHOT-spark-3.1.2
What's Changed
- Giving same cluster id to all records linked from multiple sources #108 by @navinrathore in #109
- zingg.sh path included in PATH env var in Dockerfile #111 by @navinrathore in #112
- locale set to C.UTF-8 in Dockerfile #116 by @navinrathore in #122
- Documentation broken links #118 by @navinrathore in #123
- z_source column is added at the end of dataset in alignDupes by @navinrathore in #119
- Labeller - if it is not known that it is a match or not, similarity score should not be printed #114 by @navinrathore in #125
- new febrl models by @navinrathore in #121
- Blocking tree are saved in parquet file #82 by @navinrathore in #120
- unionByName() with allowMissingColumns=true for training data by @navinrathore in #127
- handling null value of datatype Double #95 by @navinrathore in #126
Full Changelog: v0.3.1...v0.3.2