Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a filter type which is context aware, e.g. localized voting #11

Open
gregdan3 opened this issue Sep 2, 2024 · 0 comments
Open
Labels
enhancement New feature or request

Comments

@gregdan3
Copy link
Owner

gregdan3 commented Sep 2, 2024

There is a small set of words which are currently not able to be scored in their own context, because they appear as words in other languages too often.

For example, the quotative particle "to" clashes with both "to" and "too" in English, which means I can't even include it in the dictionary.

It would be more apt if the score of these words were dependent on the words near them. For example, in "i talk to him", the neighbors of "to" match alphabetically and not at all, respectively; this would be grounds to mark "to" at zero.

However, there are some complexities: "to" in particular is not helped so much by this, because it's often at the end of the sentence, lacking one following neighbor. This could be helped by pulling the nearest three neighbors if there are any, and then considering any missing neighbors to just be zeroes.

@gregdan3 gregdan3 added the enhancement New feature or request label Sep 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant