-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ZWJ missing from name ? #42
Comments
Yes. Known issue with Wikidata. Need to check all attributes with running a bot. Data entry issue. Its better with osm i think. |
Ohh. OSM also have same issue!. Mostly the things came via wikidata. There is no normalization scrip working in wikidata like ml.wikipedia |
Most of the labels of the panchayat were fetched into Wikidata from Malayalam wiki long before by bots or by users. So most of them will have ZWJ characters on the labels. In WD we can have one label for every entity as well as several names can be added to Aliases. So we can use the names without ZWJ characters in the label filed and with ZWJ characters as aliases. I have created a Google sheet that lists the panchayat labels from Wikidata which doesn't match the label in Wikipedia. https://docs.google.com/spreadsheets/d/1U8cCNhUx7u_nKOvuLbuwvTjpOW-2GAnS/edit#gid=1579957521 |
Awesome work! Just checked it out. Found that the placenames in website has ZWJ missing in it. This gives incorrect place names. I checked OSM node, the chil used there is ല + ് + ZWJ : https://www.openstreetmap.org/relation/11312298
Maybe all chil in OSM names should be migrated to atomic chil ? Doesn't the current use of ZWJ in names of OSM affect searching ? or just fix the script that made this webpage
The text was updated successfully, but these errors were encountered: