Releases: data61/anonlink-entity-service
v1.11.2
v1.11.1
- Include multiparty linkage tutorial/example.
- Tightened up how we use a database connection from the flask app.
- Deployment and logging documentation updates.
Documentation now available on readthedocs - https://anonlink-entity-service.readthedocs.io/en/stable/
v1.11.0
- Adds support for multiparty record linkage.
- Logging is now configurable from a file.
Other improvements
- Another tutorial for directly using the REST api was added.
- K8s deployment updated to use
3.15.0
Postgres chart.
Postgres configuration now uses aglobal
namespace
so sub-charts can all use the same configuration as documented
here. - Jenkins testing now fails if the benchmark exits incorrectly or if the benchmark
results contain failed results. - Jenkins will now execute the tutorials notebooks and fail if any cells error.
Images are available on quay.io:
docker pull quay.io/n1analytics/entity-app:v1.11.0
v1.10.0
- Updates Anonlink and switches to using Anonlink's default format for serialization
of similarity scores. - Sorts similarity scores before solving, improving accuracy.
- Uses Anonlink's new API for similarity score computation and solving. Note this increases accuracy at the expense of impacting speed.
- Add support for using an external Postgres database.
- Added optional support for Redis discovery via the sentinel protocol.
- Kubernetes deployment no longer includes a default postgres password.
Ensure that you set your ownpostgresqlPassword
. - The Kubernetes deployment documentation has been extended.
Prebuilt docker images:
quay.io/n1analytics/entity-app:v1.10.0
quay.io/n1analytics/entity-benchmark:v0.1.0
quay.io/n1analytics/entity-nginx:v1.4.0
v1.9.4
v1.9.3
Minor release with fixes for kubernetes deployments.
- Redis can now be used in highly available mode. Includes upstream fix where the redis sentinels crash.
- The custom kubernetes certificate management templates have been removed.
- Minor updates to the kubernetes resources. No longer using beta apis.
v1.9.2
v1.9.1
- Adds support for (almost) arbitrary sized encodings. A minimum and maximum size (in bytes) can be set at deployment time, although note
anonlink
currently requires the size to be a multiple of 8. - Adds support for opentracing with Jaeger.
- improvements to the benchmarking container
- internal refactoring of tasks
v1.9.0
- minio and redis services are now optional for kubernetes deployment.
- Introduction of a high memory worker and associated task queue.
- Fix issue where we could start tasks twice.
- Structlog now used for celery workers.
- CI now tests a kubernetes deployment.
- Many Jenkins CI updates and fixes including k8s deployment testing.
- Updates to Jupyter notebooks and docs.
- Updates to Python and Helm chart dependencies and docker base images.
- Addition of a benchmarking service
Version 1.8.1
The first release featuring a new API, the 1.8 series simplifies the analysts workflow by introducing the concept of projects
which reuse the same CLKs between runs
. These breaking changes to the REST API allow an analyst to reuse uploaded CLKs. Instead of a linkage project only having one result, we introduce a new sub-resource runs
. A project holds the schema and CLKs from all data providers; and multiple runs can be created with different parameters. A run has a status
and a result
endpoint. Runs can be queued before the CLK data has been uploaded.
We also introduced changes to the result types:
- The result type
permutation
, which was producing permutations and an encrypted mask, was removed. - And the result type
permutation_unecrypyted_mask
was renamed topermutations
.
Brief summary of API changes:
- essentially the
mapping
endpoint has been renamed toprojects
- To carry out a linkage computation you must post to a project's
runs
endpoint: `/api/v1/project/<PROJECT_ID>/runs - Results are now accessed under the
runs
endpoint:/api/v1/project/<PROJECT_ID>/runs/<RUN_ID>/result
- result type
permutation_unecrypyted_mask
was renamed topermutations
- result type
permutation
was removed
For all the updated API details check the Open API document (docs/api.rst
)
Other improvements
- Improve system stability while handling large intermediate results.
- Intermediate results are now stored in files instead of in Redis. This permits us to stream them instead of loading everything into memory.
- The documentation is now served at the root.
- The flower monitoring tool for celery is now included with the docker-compose deployment.
Note this will be disabled for production deployment with kubernetes by default. - The docker containers have been migrated to alpine linux to be much leaner.
- Substantial internal refactoring - especially of views.
- Move to pytest for end to end tests.