harsh-harlequin
harsh-harlequin3y ago

Parallel crawling

Ho to Parallel crawling in puppeteer crawler.
6 Replies
harsh-harlequin
harsh-harlequinOP3y ago
thank you
unwilling-turquoise
unwilling-turquoise3y ago
I recommend using desiredConcurrency to boost your starting concurrency. https://crawlee.dev/docs/guides/scaling-crawlers#desiredconcurrency
Scaling our crawlers | Crawlee
To infinity and beyond! ...within limits
afraid-scarlet
afraid-scarlet3y ago
Based on my experience crawlee is quite efficient at figuring out how much it can scale to without any configuration.
unwilling-turquoise
unwilling-turquoise3y ago
@Casper That is true, but the initial concurrency is quite low, which is why it’s good to use desiredConcurrency. It improves performance a lot, especially for short crawls with a lot of requests
afraid-scarlet
afraid-scarlet3y ago
I agree 👍

Did you find this page helpful?