- Need to find a way to get PyTesseract to install automatically through a bash file and not have to go through the installation loop
- Keras_ocr is also a good alternative, will need to check on it later on
- Need to find a good way to understand what threshold is good, might need to go a little more DL/CNN route
- "--psm 6" feels right for now, can try the other ones, namely 11, 12 to see what else I can improve on the reading
Need to add black and white filter and see output differential- Need to add thresholding to the filter applied
- Need to add output cleaning using re
- Need to improve quality of text being read
- Alternatives for object detection: yoloX, mmDetection, yoloV(4/7/8), openCV, moondream, timm
- Make a function to get the logo from the image
- Can use logoHunter
- vision_encoder.py - line 100
- vision_encoder.py - line 40
- get_color_palette() this gets the color palette from the image and returns the top 5 most
- get_overlay_box() this turns the image to black and white and then ocr's the image
- object_detection() this uses mmDetection to detect objects in the image and returns the image with boxes around the objects