Captcha Solver

Disclaimer

This project demonstrates CAPTCHA solving techniques for research/educational purposes only. Please be aware that using this software to bypass CAPTCHAs on websites may violate their Terms of Service and/or have legal consequences.

Description

Very basic proof-of-concept google recaptcha solver that uses the LLaVA-v1.6-7b model to extract the object name and detect the object for each square. The solver relies solely on vision, no HTML or similar. It takes screenshots, and clicks the images at the given location. It also detects the grid size, and if new images are appearing. In my limited testing it was able to solve the captcha after a max. of 2 minutes, but is often much faster.

Here is a short video demonstrating the solver:

demo_video.mp4

Limitations

Requires a GPU with at least 16 gb of vram
Currently only works in Ubuntu, because:

I detect the captcha window for exactly this os (and the button border only looks like part1_bottom_2.png in ubuntu)
LLaVa currently only supports linux, and running it via Ollama is not accurate enough

If images disappear, it has to re-classify all images at the end
Only works for this specific recaptcha layout, if it changes, the reference images also have to be updated

Installation

Follow installation instructions at LLaVA's Repo
Install sudo apt install gnome-screenshot
pip install protobuf PyAutoGUI opencv-python pillow
Run the script main.py to solve a captcha, once its done it will close (the llava model should be automatically downloaded on first start)

Contributions

Contributions welcome! If you have any issues or improvements feel free to change the code or let me know by submitting a new issue.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
box.png		box.png
check_difference.py		check_difference.py
custom_inference.py		custom_inference.py
determine_grid_size.py		determine_grid_size.py
extract_captcha.py		extract_captcha.py
extract_captcha_grid.py		extract_captcha_grid.py
extract_captcha_instr.py		extract_captcha_instr.py
extract_individual_images.py		extract_individual_images.py
findAndClick.py		findAndClick.py
highlight_image.py		highlight_image.py
info.png		info.png
main.py		main.py
next_done.png		next_done.png
part1_bottom.png		part1_bottom.png
part1_bottom_2.png		part1_bottom_2.png
part2_top.png		part2_top.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Captcha Solver

Disclaimer

Description

Limitations

Installation

Contributions

About

Releases

Packages

Languages

License

jamshidkhaksaar/captcha-solver

Folders and files

Latest commit

History

Repository files navigation

Captcha Solver

Disclaimer

Description

Limitations

Installation

Contributions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages