We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
For example, when crawling blog.apify.com, some pages are visited multiple-times since they have different URL params:
blog.apify.com
https://blog.apify.com/contact-information-scraper-7104cb0df25e?source=post_recirc---------1------------------ https://blog.apify.com/contact-information-scraper-7104cb0df25e?source=collection_home---4------8-----------------------
We could add an input option to list URL parameters that should be ignored when deciding if URL is unique.
The text was updated successfully, but these errors were encountered:
No branches or pull requests
For example, when crawling
blog.apify.com
, some pages are visited multiple-times since they have different URL params:We could add an input option to list URL parameters that should be ignored when deciding if URL is unique.
The text was updated successfully, but these errors were encountered: