OCR (Optical Character Recognition) Project

This project is a simple Optical Character Recognition (OCR) system that extracts text from images using the Tesseract OCR engine and OpenCV for image preprocessing.

Description

This program provides a basic OCR functionality by extracting text from images. It loads an image, preprocesses it using OpenCV (grayscale conversion, and optional preprocessing steps), and then performs OCR using the Tesseract OCR engine. The extracted text is then printed to the console.

Getting Started

Dependencies

Tesseract OCR
OpenCV
Python 3.x

Installing

Install Tesseract OCR. You can download it from the Tesseract GitHub repository.
Install OpenCV using pip:

pip install opencv-python

Install the pytesseract library:

pip install pytesseract

Executing program

Clone the repository or download the Python script.
Ensure you have an image file (e.g., JPEG, PNG) containing text.
Run the script:

python ocr_script.py

Paste in the path to your image file when prompted by the terminal.

Authors

Andrew Reyes
Github: @areyes42

Version History

0.1
- Initial Release

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

OCR (Optical Character Recognition) Project

Description

Getting Started

Dependencies

Installing

Executing program

Authors

Version History

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

OCR (Optical Character Recognition) Project

Description

Getting Started

Dependencies

Installing

Executing program

Authors

Version History

License