Quantitative evaluation of gender bias in astronomical publications from citation counts

This repository contains the data and the random forest algorithm from the paper. Arxiv version of the paper is available (Caplar_Tacchella_Birrer_Quantitative-evaluation-gender.pdf). Note that the Arxiv version differs from the published version in Nature Astronomy in style and the amount of content, as Nature Astronomy asked for more succinct version of the findings.

paper id,
name as it appears in the publication,
full name, deduced from the whole database,
last name,
sex,
year of first publication,
number of citations,
number of references,
number of authors,
institution,
year of publication,
journal,
field (1-6, see Table 1 from the paper),
number of floats in the manuscript,
number of equations in the manuscript,
number of math inline in the manuscript,
number of words in the manuscript,
id of first paper by the same author

Random_Forest = folder with random forest algorithm. Inside this folder you can find:

Gender_Random_Forest.ipynb = ipython routine which does the main part of the analysis
Gender_Random_Forest_Visualization.nb = Wolfram Mathematica notebook to visualize the results
maleset, femaleset, Male_Train, Male_Test, Female = auxiliary files from the analysis and visualization parts of the algorithm

Help

For problems with using the code or installation use GitHub issues page or send us an email.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReadMe.md

ReadMe.md

Quantitative evaluation of gender bias in astronomical publications from citation counts

Contents

Help

Files

ReadMe.md

Latest commit

History

ReadMe.md

File metadata and controls

Quantitative evaluation of gender bias in astronomical publications from citation counts

Contents

Help