-
-
Notifications
You must be signed in to change notification settings - Fork 228
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
2020-12-15: Japanese glosario page is not alphabetical order #254
Comments
The order of the entries is determined by a I can see two options for solving it:
Perhaps @fmichonneau or @gvwilson will have another idea? |
It looks like option 1 is going to be the way to go. |
I think option 1 is certainly easier to implement in the short-term. I can take a stab at writing Python code to do this, though I may need someone to verify the output in those languages. If I do this, unless someone has an objection, I'll probably try to remove the |
Hi @baileythegreen @fmichonneau , Thank you for your attention and suggestions. A new sort logic will definitely help for non-alphabet languages. I am happy to check Japanese output. Please let me know if there is anything I can help. |
@masamiy It'll probably take me a couple of days to get to it because I have some deadlines coming up, but I'll tag you when I do, unless @fmichonneau beats me to it. |
Take your time :) |
@masamiy I think the issue is a mixture of Romaji, Katakana, and Kanji in the terms defined. It's sorting them correctly (as expected for this). I see two solutions:
Either way furigana (kanji readings) would need to be supported to sort by them and added for each entry (for option No. 2 this would be a need a new slot I think). You cannot parse furigana from Kanji automatically (although some databases already exist). I think it is easier to specify the intended reading for each entry. |
@TomKellyGenetics @masamiy @baileythegreen This has taken a while to address, but please check the output on the new Glosario site to raise any sorting issues that still need addressing! |
https://carpentries.github.io/glosario/ja/ lists Japanese entries based on the first character of the entry. It means that the entries are not categorised by Japanese alphabet (nor English alphabet), but characters. The last entry, 'function', should be top of the current list as it is read as 'kansuu' if terms are categorise by Japanese alphabet.
As there are 46+ characters in Japanese alphabet, I feel we need to have some indexing strategy.
The text was updated successfully, but these errors were encountered: