dangerous-fuchsiaD
Apify & Crawlee4y ago
15 replies
dangerous-fuchsia

Canada411 site failing after 4 hours

I am using a CheerioCrawler actor to process input files of 500,000 records against this dynamically populated url: https://www.canada411.ca/search/?stype=re&what=

The actor has been mysteriously failing after 4 to 4.5 hours, and we have not observed such behavior before. I have included below the log toward the end of the failed run (#KcMSz5QQp8qIQnbYF).

Any insight on this error message would be greatly appreciated. Thank you!

2022-11-09T21:39:19.012Z ERROR CheerioCrawler: An exception occurred during handling of failed request. This places the crawler and its underlying storages into an unknown state and crawling will be terminated. This may have happened due to an internal error of Apify's API or due to a misconfigured crawler.
2022-11-09T21:39:19.015Z   Error: Handling request failure of https://www.canada411.ca/search/?stype=re&what=4505855104 (undefined) timed out after 320 seconds.
2022-11-09T21:39:19.017Z       at Timeout._onTimeout (/usr/src/app/node_modules/@apify/timeout/index.js:62:68)
2022-11-09T21:39:19.018Z       at listOnTimeout (node:internal/timers:559:17)
2022-11-09T21:39:19.020Z       at processTimers (node:internal/timers:502:7)
2022-11-09T21:39:19.022Z ERROR CheerioCrawler:AutoscaledPool: runTaskFunction failed.
2022-11-09T21:39:19.023Z   Error: Handling request failure of https://www.canada411.ca/search/?stype=re&what=4505855104 (undefined) timed out after 320 seconds.
2022-11-09T21:39:19.025Z       at Timeout._onTimeout (/usr/src/app/node_modules/@apify/timeout/index.js:62:68)
2022-11-09T21:39:19.027Z       at listOnTimeout (node:internal/timers:559:17)
2022-11-09T21:39:19.029Z       at processTimers (node:internal/timers:502:7)
...
2022-11-09T21:39:23.868Z ERROR Actor finished with an error (exit code 91)
Reverse phone lookup for finding someone quickly. Enter a 7-digit number in our reverse phone number lookup for general listings or a 10-digit one for a specific listing.
Was this page helpful?