helpful-purple
helpful-purple2y ago

Catch and solve captchas

Hey, I'm facing some captchas, reCaptcha V2 in this case. After I solve the captcha, then it'll 'mark' me as 'safe' and I can continue scraping. But I'm wondering how should I approach getting the captcha programmatically, solve it and send back the required response. This way I can run on a server and 'whitelist' it's IP or do the same for proxies(it keeps throwing captchas on proxies too!). Just not sure how to do all this in code.
7 Replies
Lukas Krivka
Lukas Krivka2y ago
Unles the captcha is mandatory, it is better to simply throw away the session https://docs.apify.com/academy/anti-scraping
Anti-scraping protections | Academy | Apify Documentation
Understand the various anti-scraping measures different sites use to prevent bots from accessing them, and how to appear more human to fix these issues.
helpful-purple
helpful-purpleOP2y ago
I see this heavily promoted when it comes to captchas, but from what I gather, I can neither see the source code of how it grabs captchas(I think it's the user's responsibility to do so and send it) or else. It's just an integration to a 3rd party. The interesting part to me is how to actually get that captcha, visualize it(to myself) and solve it and send it back to G. And once I know how to do this, I'll be able to send the token(?) to a 3rd party as well. As for throwing away the session, yes, I could do that, but solving the captcha will 'whitelist' the session and it'll be more reliable than throwing out thousands of proxies. Also, it's interesting to me to know how it's actually done. Thank you guys for both of your inputs!
national-gold
national-gold2y ago
I have built an automatic recaptcha v2 solver. I may be able to assist you if you DM me
foreign-sapphire
foreign-sapphire2y ago
@ThePhantom checkout nopecha for your captcha solving needs. It does a good job.
helpful-purple
helpful-purpleOP2y ago
I'd like to solve the captcha internally, not send to 3rd parties. @microworlds sent you a friend request since I can't DM you directly
national-gold
national-gold2y ago
Accepted and replied 👍

Did you find this page helpful?