Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Keep track of where the date has been found #124

Open
adbar opened this issue Jan 10, 2024 · 2 comments
Open

Keep track of where the date has been found #124

adbar opened this issue Jan 10, 2024 · 2 comments
Labels
enhancement New feature or request
Milestone

Comments

@adbar
Copy link
Owner

adbar commented Jan 10, 2024

So far only the logs provide info on this. It would be nicer to be able to pinpoint the type (header, element, or text) or even the exact location of the result.

To do so the location info has to be propagated back along with the result.

@adbar adbar added the enhancement New feature or request label Jan 10, 2024
@adbar adbar added this to the v2.0 milestone Jan 10, 2024
@PetroffSky
Copy link

Hello!
Problem: your wonderful scanner finds dates, but sometimes they are not the ones you need, for example:
on the page: https://sushi-prk.ru/kak-sdelat-chtoby-pogony-ne-gnulis/ there is no date of creation of the article and the scanner selects the date from the footer of the page:
https://sushi-prk.ru/kak-sdelat-chtoby-pogony-ne-gnulis/#:~:text=2024
How to avoid this wrong date?

@adbar
Copy link
Owner Author

adbar commented Jun 26, 2024

This enhancement is not implemented yet. If you set the extensive_search option to False you'll restrict the search to less error-prone patterns.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants