Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rework the language detection mechanism #146

Open
merlinschumacher opened this issue Dec 2, 2024 · 1 comment
Open

Rework the language detection mechanism #146

merlinschumacher opened this issue Dec 2, 2024 · 1 comment

Comments

@merlinschumacher
Copy link
Owner

The current language detection works fine, if we have structured data or at least can reliably determine the structure of the page. For pages where we have to guess where the metadata is placed in the description we need a better way.
It probably would be better to split the detection between simple but very unreliable keywords like "en" and longer detailed ones like "englische Originalversion" for structured and non-structured metadata.

@merlinschumacher
Copy link
Owner Author

merlinschumacher commented Jan 20, 2025

85b93d5 partially addreses this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant