captcha
by - Thursday, January 1, 1970 at 12:00 AM
I am very new to python and image processing, so i want to ask whether captcha like these below here are solvable by processing (e.g. with opencv) and then using google's tesseract (or another ocr/ml process) to read them or not? To be exact, i want to know how to make these images more readable by tesseract.

Reply
maybe you can first remove the long colored lines that is obscuring the characters and then turn the image to black/white high contrast image? separate each characters by using the large white space between them? that would surely make things easier for OCR to do its work.
Reply
If you start by mapping all the different chars you can find and assigning labels to them, you can use a CNN to solve the problem with pytorch, smth like the models used for imagenet.
Reply
Captcha generally poor implemented like it will be implemented like post request does not send the Captcha, or you can bypass it. Tried that first.

Next thing I recommend  what captcha code (I mean the service like breached also used bb2 captcha) is used or get them, use OCR( you can use the existing code made by someone or make your  own (I will prefer this If you can do it)) and train yourself a model.
Those who share kindness, I will repay that payment 10-fold, and Who do injustice, try to hurt the innocent, I will repay that injustice a 1000 times over.
Reply
(August 11, 2022, 10:43 PM)dthang Wrote: If you start by mapping all the different chars you can find and assigning labels to them, you can use a CNN to solve the problem with pytorch, smth like the models used for imagenet.


Thamks bro, is a great idea
Reply
(August 12, 2022, 09:59 AM)Mysterious Wrote: Captcha generally poor implemented like it will be implemented like post request does not send the Captcha, or you can bypass it. Tried that first.

Next thing I recommend  what captcha code (I mean the service like breached also used bb2 captcha) is used or get them, use OCR( you can use the existing code made by someone or make your  own (I will prefer this If you can do it)) and train yourself a model.


right, the website actually seems not so properly written, gonna look into that method. Image processing also looks like another monster to learn lol. Thanks for the suggestion.
Reply
If you use OpenCV you can add a thresholding filter to the image to make letters stand out and those thin lines probably can be ignored as they are just too small.
Reply
surely they are
Reply
There are a large amount of websites which have a captcha but dont actually need it or store the captcha in a session so you can reuse it over and over again.
Reply
(August 16, 2022, 04:24 AM)plaga789 Wrote:
(August 11, 2022, 10:43 PM)dthang Wrote: If you start by mapping all the different chars you can find and assigning labels to them, you can use a CNN to solve the problem with pytorch, smth like the models used for imagenet.


Thamks bro, is a great idea

Its a great idea thank you
Reply


 Users viewing this thread: captcha: No users currently viewing.