Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preprints caught #43

Open
virginiascarlett opened this issue Jul 31, 2024 · 3 comments
Open

Preprints caught #43

virginiascarlett opened this issue Jul 31, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@virginiascarlett
Copy link
Collaborator

Add to https://dis.int.janelia.org/dois_preprint browser page: how many preprints did you catch with fuzzy matching that weren't in crossref metadata?

@virginiascarlett virginiascarlett added the enhancement New feature or request label Jul 31, 2024
@virginiascarlett
Copy link
Collaborator Author

On second thought, this doesn't have to be in the GUI. If you could just tell me a ball park number, that's good enough.

@virginiascarlett
Copy link
Collaborator Author

The pie chart looks good. I was expecting the fuzzy match to capture more preprints, hmm

@robsv
Copy link
Contributor

robsv commented Aug 2, 2024

I plan on getting some diagnostic code into the preprint assigner to see if I should tweak the threshold.
For the match, I'm currently using token_sort_ratio with a threshold of 90% for both title and author name(s). I might switch to using separate title/name thresholds.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants