Releases: data61/anonlink
Releases · data61/anonlink
0.12.5a2
Test binary wheel release for 0.12.5a1 to pypi
Add alpha tag for 0.12.5a1 to test release to pypi
0.12.4
0.12.3
0.12.2
0.12.0
Release 0.11.2
- Fixes an issue that caused the loading functions in
anonlink.serialization
to raise when loading from Minio objects.
Release 0.11.1
- Fixes an issue that prevented anonlink being installed from a .tar.gz archive. This caused installations from PyPI to fail.
Release 0.11.0
Major changes:
- The greedy solver has been ported to C++, bringing performance improvements. The pure Python version remains in the package as
anonlink.solving.greedy_solve_python
. - Candidate pair generation now supports blocking. Some blocking functions are defined in
anonlink.blocking
but custom ones may be defined. - New utilities assist in analysis of similarity scores. They can help an analyst find a good threshold or determine the quality of the linkage, and can be found in
anonlink.stats
. Examples are located indocs/examples/similarity-plots
. - Adds a probabilistic multiparty greedy solver. It generally yields more accurate results than the previous multiparty greedy solver. It is able to infer that some pairs match even they are below the similarity threshold.
Minor changes:
- The
hamming_similarity
in thesimilarities
module is renamed tosimple_matching_coefficient
, which is the canonical name for this similarity measure.hamming_similarity
is now a deprecated alias. anonlink.similarities
is now imported wheneveranonlink
is imported. This means thatanonlink.similarities
no longer has to be imported separately.- The new helper function
anonlink.solving.pairs_from_groups
turns the output of the greedy solver (a set of groups) into an iterable of pairs for bipartite problems. - Dice similarity functions now support
bytes
as input. Previously the inputs had to bebitarray
s. - Mypy typing is enforced in the automated tests.
- Adds a heuristic for estimating the quality of the linkage,
anonlink.stats.nonmatch_index_score
.
Release 0.10.0
Major changes:
- Adds ability to serialise similarities into an iterable of bytes, instead of into a stream.
- Similarly, files with serialised similarities can now be serialised into an iterable of bytes.
Minor changes:
- Some flaky tests were adjusted to allow occasional mismatches.
- Minor changes to type annotations.
- The greedy solver is now tested with Hypothesis.
- Use of the old API generates
DeprecationWarning
. - Similarity serialisation functions that write to file return the number of bytes written.