Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] EstNLTK analyzer #818

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from
Draft

[WIP] EstNLTK analyzer #818

wants to merge 5 commits into from

Conversation

osma
Copy link
Member

@osma osma commented Nov 12, 2024

This PR adds a new analyzer to support lemmatization using EstNLTK, a natural language analysis toolkit for the Estonian language.

Note that the indirect dependencies of EstNLTK are quite large, with around ~500MB of libraries.

@osma osma self-assigned this Nov 12, 2024
annif/analyzer/estnltk.py Fixed Show fixed Hide fixed
annif/analyzer/estnltk.py Fixed Show fixed Hide fixed
Copy link

sonarcloud bot commented Nov 12, 2024

Copy link

codecov bot commented Nov 12, 2024

Codecov Report

Attention: Patch coverage is 92.85714% with 2 lines in your changes missing coverage. Please review.

Project coverage is 99.58%. Comparing base (4809561) to head (d2a0051).

Files with missing lines Patch % Lines
annif/analyzer/__init__.py 60.00% 2 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main     #818      +/-   ##
==========================================
- Coverage   99.60%   99.58%   -0.03%     
==========================================
  Files          93       95       +2     
  Lines        7134     7162      +28     
==========================================
+ Hits         7106     7132      +26     
- Misses         28       30       +2     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant