adverse-sapphire
adverse-sapphire12mo ago

Any suggestions for improving the speed of the crawling run?

Hello there! Beside reducing the scope of what is being crawled, for example of number of pages, etc, what can we do in order to accelerate the run? Any suggestions are welcomed, I'm simply curious.
3 Replies
Hall
Hall12mo ago
View post on community site
This post has been pushed to the community knowledgebase. Any replies in this thread will be synced to the community site.
Apify Community
lemurio
lemurio12mo ago
hey, aside from obvious improvements like allocating more memory or increasing maximum concurrency, if you're using a browser for scraping, try to find ways to replace parts of the process with simple HTTP calls, without relying on the browser navigation. You can scrape the website's internal or public JSON API or the site's pure HTML this way. Use the browser only when necessary.
ambitious-aqua
ambitious-aqua12mo ago
Basically increasing parallelism and concurrency is the first factor. But in some cases its worth it to optimize request and parsing logic. @gablabelle

Did you find this page helpful?