Replies: 2 comments 6 replies
-
You can first define sessionPoolOptions in your crawler like this then you can handle the 403 as a middleware in your routes. |
Beta Was this translation helpful? Give feedback.
2 replies
-
Also unclear to me how to handle rate limits etc. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey, I have a question related to the PlaywrightCrawler.
Is it possible to pause the crawl in case I started to get 403 due to rate limit violation? wait for 2 min and back to crawl again?
Also the url's that was not succeed to crawled due to this blocking will back to the queue?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions