NeoNomade
NeoNomade16mo ago

custom logic for status codes

Is it possible to do some custom logic based on status codes in the Puppeteer Crawler ? If yes, how ?
3 Replies
lemurio
lemurio16mo ago
You can check the status code in the response object from the PuppeteerCrawlingContext and do some logic based on that: https://crawlee.dev/api/puppeteer-crawler/interface/PuppeteerCrawlingContext#response
NeoNomade
NeoNomadeOP16mo ago
For example if the status code is 403 it gets retried before I have the chance to check this response object
mere-teal
mere-teal16mo ago
In the session pool options, you can specify what status codes should be treated as blocked requests. I think 403 is there by default, so passing an empty array will allow you full control here. Look this up - https://crawlee.dev/api/core/interface/SessionPoolOptions#blockedStatusCodes

Did you find this page helpful?