Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

a merge name function #72

Open
NicolasBelloy opened this issue Jun 25, 2015 · 1 comment
Open

a merge name function #72

NicolasBelloy opened this issue Jun 25, 2015 · 1 comment

Comments

@NicolasBelloy
Copy link

Is it possible to merge name depending on the formatting in the the imported file ?
ex : Doe, J = Doe, John

@erickpeirson
Copy link
Collaborator

Yes, we could do something like that. Right now, full-names and initial-names are stored separately. In v0.7 (coming very soon), it's something like:

>>> paper.authors_init
[('MCMURRICH', 'J P'),...]
>>> paper.authors_full
[('MCMURRICH', 'JAMES PLAYFAIR'),...]

Since WoS is inconsistent in whether it records include author full names, some papers have authors_full and others don't. The authors property will return the value of authors_full if it's available; otherwise, it returns authors_init.

>>> paper.authors
[('MCMURRICH', 'JAMES PLAYFAIR'),...]
>>> del paper.authors_full
>>> paper.authors    # authors_full is no longer available.
[('MCMURRICH', 'J P'),...]

So there are quite a few things that we could do to match full-name and initial-name representations, depending on the use-case. Here are some places we could do the matching:

  • When we index a Corpus by authors;
  • In network-building methods that evaluate author names (e.g. coauthors);
  • ???

Could you say more about the use-cases that you have in mind? That might help us hone in on a good solution. Thanks for your help!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants