Skip to content

This project is a simple Optical Character Recognition (OCR) system that extracts text from images using the Tesseract OCR engine and OpenCV for image preprocessing.

License

Notifications You must be signed in to change notification settings

Areyes42/Optical-Character-Recognition

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

OCR (Optical Character Recognition) Project

This project is a simple Optical Character Recognition (OCR) system that extracts text from images using the Tesseract OCR engine and OpenCV for image preprocessing.

Description

This program provides a basic OCR functionality by extracting text from images. It loads an image, preprocesses it using OpenCV (grayscale conversion, and optional preprocessing steps), and then performs OCR using the Tesseract OCR engine. The extracted text is then printed to the console.

Getting Started

Dependencies

  • Tesseract OCR
  • OpenCV
  • Python 3.x

Installing

  1. Install Tesseract OCR. You can download it from the Tesseract GitHub repository.
  2. Install OpenCV using pip:
pip install opencv-python
  1. Install the pytesseract library:
pip install pytesseract

Executing program

  1. Clone the repository or download the Python script.
  2. Ensure you have an image file (e.g., JPEG, PNG) containing text.
  3. Run the script:
python ocr_script.py
  1. Paste in the path to your image file when prompted by the terminal.

Authors

Version History

  • 0.1
    • Initial Release

License

This project is licensed under the MIT License - see the LICENSE.md file for details

About

This project is a simple Optical Character Recognition (OCR) system that extracts text from images using the Tesseract OCR engine and OpenCV for image preprocessing.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages