Read this Medium article for full discussion.
🎉 🎉 🎉 The Amharic RoBERTa model is uploaded in Huggingface Amharic RoBERTa Model 🎉 🎉 🎉
🎉 🎉 The Amharic FLAIR embedding model is integrated into the FLAIR library as am-forward
🎉 🎉 The model will be accessible on the next FLAIR release. Details
🎉 🎉 The Amharic Segmenter is released and can be installed as pip install amseg
🎉 🎉
🎉 🎉 The Flair based Amharic NER classifier model is now released am-flair-ner 🎉 🎉
🎉 🎉 The Flair based Amharic Sentiment classifier model is now released am-flair-sent 🎉 🎉
🎉 🎉 The Flair based Amharic POS tagger is now released am-flair-pos 🎉 🎉
- Here, we have described the different NLP tasks for which we built models using the benchmark datasets Tasks
- NER
- Sentiment
- POS tagging
- The different datsets and resources are available under: Datasets
- Named Entity recognition dataset
- POS dataset
- Sentiment Dataset
- For Amahric word segmentation and tokenization, check this project: Segmentation
To cite the different Amharic NLP models and resources, use the following paper
@Article{fi13110275,
AUTHOR = {Yimam, Seid Muhie and Ayele, Abinew Ali and Venkatesh, Gopalakrishnan and Gashaw, Ibrahim and Biemann, Chris},
TITLE = {Introducing Various Semantic Models for Amharic: Experimentation and Evaluation with Multiple Tasks and Datasets},
JOURNAL = {Future Internet},
VOLUME = {13},
YEAR = {2021},
NUMBER = {11},
ARTICLE-NUMBER = {275},
URL = {https://www.mdpi.com/1999-5903/13/11/275},
ISSN = {1999-5903},
DOI = {10.3390/fi13110275}
}
To cite the impacts of homophone normalization, use the the following paper
@InProceedings{ayele2021,
AUTHOR = {Belay, Tadesse. Destaw and Ayele, Abinew Ali and Gelaye, Getie and Yimam, Seid Muhie and Biemann, Chris},
TITLE = {Impacts of Homophone Normalization on Semantic Models for Amharic},
booktitle = {Proceedings of the Third International Conference on ICT for Development for Africa (ICT4DA 2021)},
address = {Bahir Dar, Ethiopia}
}