genetic-orange•14mo ago
Crawler stopped abruptly and exited with a success message
I'm running a crawler on Apify. Looking at the logs, the 2 messages stand out:
FYI:
CPU
Average:8.95%
Maximum:29.11%
Memory
Average:179.2 MB
Maximum:335.1 MB
Request Queue tab:
NAME: Unnamed
TOTAL: 921
PENDING: 101
HANDLED: 820
READS: 995
WRITES: 3894
DELETES: 0
HEAD ITEM READS: 100
STORAGE SIZE: 196.7 kB
Any clues on how I can even begin to troubleshoot this?
6 Replies
hey, could you share the run's ID?
genetic-orangeOP•14mo ago
Thanks for replying, apologies for the delay. id:
p1uNn3ltHeW2zii8o
Thanks for the ID. The crawler shouldn’t have crashed like that, and we’ll look into why it happened. You can resurrect the run to fix the problem and let it continue, but you’re getting blocked so much that I wouldn’t recommend it. You’re using an
HttpCrawler
, but it’s running into Cloudflare Turnstile, which blocks it. To be able to bypass that, you would need to use a browser with fingerprints.genetic-orangeOP•14mo ago
Thanks for looking into it. So to bypass it, using the
CheerioCrawler
would be sufficient?robust-apricot•14mo ago
Yeah, I think so
genetic-orangeOP•13mo ago
I've got a bunch of 403 but that might not per se means that Cloudflare Turnstile blocked it. How can I distinguish if it's a Cloudflare 403 or not?