Releases: moj-analytical-services/splink
Releases · moj-analytical-services/splink
v3.9.4
What's Changed
- Add dataset table generation script to docs workflow by @ADBond in #1399
- ccl table fix by @RossKen in #1400
- Bump minimum duckdb version to 0.8.0 by @ADBond in #1405
- Postgres docs by @ADBond in #1404
- SL docs edits by @samnlindsay in #1402
- fix docs links to point to master by @ThomasHepworth in #1419
- [FEAT] Detect equi-join conditions in a blocking rule to count the number of comparisons without needing to perform the join by @RobinL in #1388
- fix else_level examples - no parameter needed by @ADBond in #1423
- remove survey in banner by @RossKen in #1432
- FIX: add parens to blocking rules by @NickCrews in #1422
- run actions on
_dev
branches by @ThomasHepworth in #1433 - [FEAT] Blocking Rule helper functions by @ThomasHepworth in #1370
- Update splink demos by @RossKen in #1407
- Contributing guide by @RossKen in #1394
- ref: Remove pre-check for path when loading file by @NickCrews in #1438
- add blocking rule library to existing functions by @ThomasHepworth in #1436
- Blocking Topic Guides by @RossKen in #1389
- Remove_pkg_resources by @NickCrews in #1425
- String comparisons doc text formatting by @samnlindsay in #1445
- V3.9.4 by @RossKen in #1458
Full Changelog: v3.9.3...v3.9.4
v3.9.3
What's Changed
- Fellegi sunter topic guide by @RossKen in #1318
- [MAINT] Backend agnostic comparison composition tests by @ThomasHepworth in #1341
- 1109 athena datediff by @RossKen in #1338
- Extend CacheDictWithLogging so that it also stores all tables materialises by Splink, not just the named ones (Issue 1059) by @RobinL in #1061
- Issue 1225 - Poor performance of estimate u in a link_only job by @RobinL in #1359
- Add a timer into debug mode by @ThomasHepworth in #1367
- lint for
print()
statements by @ThomasHepworth in #1374 - Expectation maximisation speedup option by @aymonwuolanne in #1369
- record linkage topic guides by @RossKen in #1297
- add icons to docs and generated tables by @RossKen in #1353
- [FEAT] Splink Labelling tool beta by @RobinL in #1208
- Docs navigation improvements by @RossKen in #1381
- Postgres bug fixes by @ADBond in #1335
- Txt replacement bash script by @ThomasHepworth in #1378
- Basic settings validator by @ThomasHepworth in #1252
- Add summary of each backend to docs by @RossKen in #1385
- [BUG] fix how nulls are registered in pyspark when loading a pandas df by @ThomasHepworth in #1373
- Tweak readme by @RobinL in #1393
- Splink dummy data by @ADBond in #1358
- Release v3.9.3 by @RossKen in #1398
New Contributors
- @aymonwuolanne made their first contribution in #1369
Full Changelog: v3.9.2...v3.9.3
v3.9.2
What's Changed
- Postgres Linker by @hanslemm in #1191
- Fix altair dependency - redo by @RossKen in #1308
- Add Google analytics to docs by @RossKen in #1313
- Add docs on udfs in sqlite and duckdb by @RobinL in #1317
- satisfy the linter by @ThomasHepworth in #1322
- Adjust import paths to remove backend prefixes by @ThomasHepworth in #1320
- Initial commit for email comparison level feature. by @sama-ds in #1277
- migrate duckdbless action to release by @ThomasHepworth in #1323
- fix symlinks action by @ThomasHepworth in #1324
- make datediff tests backend agnostic by @ThomasHepworth in #1294
- Postgres backend by @ADBond in #1251
- Fix calculation of link-only sample size for u-training by @ADBond in #1312
- Sqlite - fix default connect and levenshtein by @ADBond in #1336
- Altair 5: All Splink charts become alt.Chart() objects rather than custom VegaLiteNoValidate by @RobinL in #1315
- Update actions by @zslade in #1342
- Remove redundant headers of PR template by @RossKen in #1347
- add banner pointing to google form by @RossKen in #1349
- updating splink version by @aliceoleary0 in #1351
New Contributors
Full Changelog: v3.9.1...v3.9.2
v3.9.1
What's Changed
- Update releases.md by @zslade in #1273
- Readme formatting by @RobinL in #1274
- Use
tmp_path
in deterministic link test by @ADBond in #1275 - allow lowercase postcodes by @RossKen in #1263
- update linting bash script by @ThomasHepworth in #1290
- clean datediff code by @ThomasHepworth in #1291
save_settings_to_json
->save_model_to_json
by @RossKen in #1283- Add PR template by @RossKen in #1253
- Settings Topic Guide by @RossKen in #1292
- Comparison pseudo symlinks by @ThomasHepworth in #1279
- Update parameter_estimate_comparisons.json by @samnlindsay in #1301
Full Changelog: v3.9.0...v3.9.1
v3.9.0
What's Changed
- Docs upgrades by @RossKen in #1222
- Adjust table registration by @ThomasHepworth in #1219
- Add regex extract functionality to comparisons by @zslade in #1203
- Issue 1227 - Allow materialisation of df_representatives with no _ suffix by @RobinL in #1228
- 1189 tf topic guide by @RossKen in #1214
- Write splinkdf to csv parquet by @ThomasHepworth in #1194
- pretty print erroneous sql by @ThomasHepworth in #1238
- Cleaned up comparison levels documentation to be a multi-line code bl… by @mastratton3 in #1236
- Postcode comparison template by @zslade in #1230
- Forename Surname ctl by @RossKen in #1174
- 430 Term frequency adjustment chart by @samnlindsay in #1226
- 1111 add damerau levenshtein by @RossKen in #1181
- Duckdbless splink by @ThomasHepworth in #1244
term_frequency_adjustments_names
->term_frequency_adjustments
by @ThomasHepworth in #1254- 1175 deterministric clusters by @RossKen in #1213
- Update citations by @RossKen in #1255
- update benchmarking action to run on PR merge by @ThomasHepworth in #1262
- Backend-agnostic testing by @ADBond in #1205
- Adjust cl imports by @ThomasHepworth in #1248
- Add Topic guide for choosing comparisons & thresholds by @RossKen in #1198
- tweak duckdbless action by @ThomasHepworth in #1270
- fix duckdbless reqs url by @ThomasHepworth in #1271
- New release v3.9.0 by @zslade in #1272
New Contributors
- @mastratton3 made their first contribution in #1236
Full Changelog: v3.8.1...v3.9.0
v3.8.1
What's Changed
- Releases dev guide by @RossKen in #1202
- Fix link only cartesian calc by @ThomasHepworth in #1204
Full Changelog: v3.8.0...v3.8.1
v3.8.0
What's Changed
- Make the example notebooks run faster by @RobinL in #1160
- Add tags by @RossKen in #1165
- Benchmark timeseries commit workflow to run only in upstream repo by @ADBond in #1152
- Create
_register_input_tables
method in our main linker class by @ThomasHepworth in #1172 - Documentation examples by @RossKen in #1159
- Fix autoblack checkout step by @ADBond in #1169
- Add emojis rather than bullets by @RobinL in #1180
- Add option to pass seed into
estimate_u_using_random_sampling
by @RossKen in #1161 - Adjust the outputs of
truth_space_table_from_labels_with_predictions_sqls
to be lowercase by @ThomasHepworth in #1183 - Improve Logging by @NickCrews in #1084
- Improve logging by @ThomasHepworth in #1186
- Add docs for Feature Engineering by @RossKen in #1178
- Add UDFs dev guide by @RossKen in #1182
- [BUG] Fix source dataset issue when running link jobs by @ThomasHepworth in #1193
- 1107 add jaro similarity by @RossKen in #1167
- migrate ComparisonProperties by @ThomasHepworth in #1195
- revert to old comparison script structure by @RossKen in #1197
- 1030 option for auto typecasting datediff by @aliceoleary0 in #1162
- Athena updates by @ThomasHepworth in #1187
- Release 3.8.0 by @RossKen in #1201
New Contributors
- @aliceoleary0 made their first contribution in #1162
Full Changelog: v3.7.3...v3.8.0
v3.7.3
What's Changed
- Linting update by @ADBond in #1131
- Fix autoblack workflow for forks by @ADBond in #1133
- remove invalid comma by @wilko77 in #1143
- Improve readme what does splink do by @RobinL in #1129
- Improve copy writing on readme by @RobinL in #1144
- Improve readme images for clarity by @RobinL in #1145
- Attempt to make examples notebooks action faster by @RobinL in #1147
- ComparisonLevel composition v2 by @NickCrews in #1114
- Update run_demos_examples.yml by @RobinL in #1154
- 1151 fix term frequencies for cols reversed by @afua-moj in #1156
- Add previously breaking tf case to tests by @RossKen in #1157
- Version 3.7.3 by @RossKen in #1158
New Contributors
Full Changelog: v3.7.2...v3.7.3
v3.7.2
v3.7.1
What's Changed
- Fix a couple of typos by @ADBond in #1123
- Fix athena linker invalid reference by @davidschrooten in #1135
- Fix clustering in issue 1136 by @RobinL in #1137
New Contributors
- @davidschrooten made their first contribution in #1135
Full Changelog: v3.7.0...v3.7.1