optimistic-gold
optimistic-gold4y ago

Parallel crawling

Ho to Parallel crawling in puppeteer crawler.
6 Replies
optimistic-gold
optimistic-goldOP4y ago
thank you
evident-indigo
evident-indigo4y ago
I recommend using desiredConcurrency to boost your starting concurrency. https://crawlee.dev/docs/guides/scaling-crawlers#desiredconcurrency
Scaling our crawlers | Crawlee
To infinity and beyond! ...within limits
Casper
Casper4y ago
Based on my experience crawlee is quite efficient at figuring out how much it can scale to without any configuration.
evident-indigo
evident-indigo4y ago
@Casper That is true, but the initial concurrency is quite low, which is why it’s good to use desiredConcurrency. It improves performance a lot, especially for short crawls with a lot of requests
Casper
Casper4y ago
I agree 👍

Did you find this page helpful?