-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a feature to work with Hispanic and Portuguese names (latinos) #103
Comments
The current parser logic basically takes the first name it gets and sticks it in the first name list, then sticks all other names into the middle name list until it reaches the last name. It seems feasible that the logic could be changed to switch from middle names to last names at some other point as it proceeds from beginning to end, but if there can be an arbitrary number of both middle and last names how would the parser know when to switch to last names? |
Also if middle names are not really a separate concept like in English, maybe the surnames attribute is what you want: https://nameparser.readthedocs.io/en/latest/modules.html#nameparser.parser.HumanName.surnames It's basically all names except the first name or any titles or suffixes, so middle and last concated together. |
In the "Cristiano Ronaldo dos Santos Aveiro" example, what would be a useful way to separate out the name parts in Portuguese? You referred to "second" and "surname". I know in Portuguese people can have like 10 names, but I don't know what's a useful way to group them. If more names were added would you want "third" name and then "surname" for the just final name part? Would a numerical index be useful? Also might want to look at #72 for some related conversation. I think that's when I added the surnames attribute. |
yes, same in Spanish. Maybe a good way would be to have a field called "Dr. Miguel Ángel González-Fierro Palacios":
|
Is this ready to use |
The library doesn't work for latino names, in Spanish and Portuguese we have 2 surnames, ie:
Spanish: Rafael Nadal Parera
Portuguese: Cristiano Ronaldo dos Santos Aveiro (in this case Cristiano is the first name, Ronaldo de second name and the rest surnames)
Also, we can have several names, they are not called first and middle name, but fist, second, third names...
I guess it not an easy thing to implement, but there are around 700M people on that group :-)
The text was updated successfully, but these errors were encountered: