An Arabic text processing library intended for use in NLP applications
Maha is a text processing library specially developed to deal with Arabic text. The beta version can be used to clean and parse text, files, and folders with or without streaming capability.
If you need help or want to discuss topics related to Maha, feel free to reach out to our Discord server. If you would like to submit a bug report or feature request, please open an issue.
Simply run the following to install Maha:
pip install mahad # pronounced maha d
For source installation, check the documentation.
Check out the overview section in the documentation to get started with Maha.
Documentation is hosted at ReadTheDocs.
Maha welcomes and encourages everyone to contribute. Contributions are always appreciated. Feel free to take a look at our contribution guidelines in the documentation.
Maha is BSD-licensed.