This project involves preprocessing scanned handwritten Amharic text images and segmenting them into individual sentences. The provided image preprocessing script processes the input image to detect and extract each sentence.
- Image preprocessing for scanned handwritten text
- Sentence segmentation
- Support for Amharic script
- Python 3.x
- OpenCV
- NumPy
- Matplotlib (optional, for visualization)
-
Clone this repository:
git clone https://github.com/Metasebiya-21/amharic-image-preprocessing.git
-
Navigate into the project directory:
cd amharic-image-preprocessing
-
Install the required Python packages:
pip install -r requirements.txt
-
Place your scanned handwritten text image in the
images
directory. Ensure the image is in a format supported by OpenCV (e.g., PNG, JPEG). -
Run the preprocessing script:
python preprocess.py --image images/your_image.png
Replace
your_image.png
with the name of your image file. -
The script will output the segmented sentences as individual images in the
output
directory.
Below is an example of the preprocessing steps:
The output will be individual images for each sentence, saved in the output
directory.
Contributions are welcome! Please feel free to open issues or submit pull requests.
This project is licensed under the MIT License. See the LICENSE file for details.
- OpenCV - Open Source Computer Vision Library.
- NumPy - Library for scientific computing with Python.
- Matplotlib - Plotting library for Python (optional, for visualization).
For any inquiries or support, please contact [[email protected]].