Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pages ending with a fragment are always marked as uncrawled #278

Closed
jacomyma opened this issue Jul 19, 2018 · 2 comments
Closed

Pages ending with a fragment are always marked as uncrawled #278

jacomyma opened this issue Jul 19, 2018 · 2 comments

Comments

@jacomyma
Copy link
Member

By "fragment" we mean internal anchors: index.html versus index.html#section1.
This is just an observation. It might be the desired behavior.

Pro: it makes sense since they should not be crawled, being just the same content as other already crawled pages.

Con: it leaves an impression that the crawl is unfinished, even though it actually is (ie. we do not want to crawl these pages anyway)

@boogheta
Copy link
Member

Decision CPH191021-1130 : we keep them, even though uncrawled, but when we will have Javascript crawling we will crawl them with these kind of crawls

@boogheta
Copy link
Member

attaching #288

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants