Skip to content

Commit

Permalink
post: couperin data update
Browse files Browse the repository at this point in the history
  • Loading branch information
jbartlewski committed Sep 17, 2024
1 parent 314beb6 commit 59b6e50
Show file tree
Hide file tree
Showing 4 changed files with 249 additions and 0 deletions.
123 changes: 123 additions & 0 deletions Rmd/2024-09-17-couperin.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,123 @@
---
layout: post
author: Julia Bartlewski
author_lnk: https://github.com/jbartlewski
title: Couperin.org provides additional APC data for 13 institutions
date: 2024-09-17 10:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo = FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=",")
})
```

The [Couperin consortium](https://couperin.org) collects data on APC expenditures from universities and other research institutions in France.

APC expenditures for 13 different institutions have now been made available to OpenAPC, with one institution being included for the first time.

Contact Persons for Couperin are [Valerie Larroque](mailto:[email protected]) and [Sandrine Malotaux](mailto:[email protected]).

## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc(c("https://raw.githubusercontent.com/OpenAPC/openapc-de/v4.129.1-0-0/data/apc_de.csv"))
my.apc <- my.apc[226020:226809,]
my.apc$publisher <- gsub("American Society for Biochemistry & Molecular Biology \\(ASBMB\\)", "ASBMB", my.apc$publisher)
my.apc$publisher <- gsub("Institute of Electrical & Electronics Engineers \\(IEEE\\)", "IEEE", my.apc$publisher)
my.apc$publisher <- gsub("American Association for the Advancement of Science \\(AAAS\\)", "AAAS", my.apc$publisher)
my.apc <- my.apc[my.apc$institution != "",]
my.apc <- droplevels(my.apc)
```

The new data set provided by Couperin covers publication fees for `r format(nrow(my.apc), big.mark =",")` articles, total expenditure amounts to `r sum(my.apc$euro)`€ and the average fee is `r sum(my.apc$euro)/nrow(my.apc)`€.

The data originates from the following institutions:

- AgroParisTech
- Université des Antilles
- Université de Caen Normandie
- Université de Technologie de Compiègne
- ENS Paris - Ecole normale supérieure de Paris
- INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement
- Université de Lorraine
- MNHN - Muséum National d'Histoire Naturelle
- Université de Picardie Jules Verne
- Université de Poitiers
- Université Savoie Mont Blanc
- Université de Toulon
- Université Sorbonne Nouvelle - Paris III - USN (**new**)



## Overview

A detailed analysis of the contributed data sets provides the following overview:

### Breakdown by publisher

```{r}
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc$publisher <- factor(my.apc$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc$euro, my.apc$publisher, length),
tapply(my.apc$euro, my.apc$publisher, sum),
tapply(my.apc$euro, my.apc$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

### Fees paid per publisher (in EURO)

```{r tree_couperin_2024_09_17_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```

### Average costs per publisher (in EURO)

```{r box_couperin_2024_09_17_publisher_full, echo = FALSE, message = FALSE}
require(ggplot2)
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 3])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher (> 3 articles)") + ylab("APC")
```
126 changes: 126 additions & 0 deletions _posts/2024-09-17-couperin.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,126 @@
---
layout: post
author: Julia Bartlewski
author_lnk: https://github.com/jbartlewski
title: Couperin.org provides additional APC data for 13 institutions
date: 2024-09-17 10:00:00
summary:
categories: [general, openAPC]
comments: true
---




The [Couperin consortium](https://couperin.org) collects data on APC expenditures from universities and other research institutions in France.

APC expenditures for 13 different institutions have now been made available to OpenAPC, with one institution being included for the first time.

Contact Persons for Couperin are [Valerie Larroque](mailto:[email protected]) and [Sandrine Malotaux](mailto:[email protected]).

## Cost data



The new data set provided by Couperin covers publication fees for 790 articles, total expenditure amounts to 1,600,110€ and the average fee is 2,025€.

The data originates from the following institutions:

- AgroParisTech
- Université des Antilles
- Université de Caen Normandie
- Université de Technologie de Compiègne
- ENS Paris - Ecole normale supérieure de Paris
- INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement
- Université de Lorraine
- MNHN - Muséum National d'Histoire Naturelle
- Université de Picardie Jules Verne
- Université de Poitiers
- Université Savoie Mont Blanc
- Université de Toulon
- Université Sorbonne Nouvelle - Paris III - USN (**new**)



## Overview

A detailed analysis of the contributed data sets provides the following overview:

### Breakdown by publisher



| | Articles| Fees paid in EURO| Mean Fee paid|
|:-----------------------------------------------------------|--------:|-----------------:|-------------:|
|Elsevier BV | 153| 233205| 1524|
|Springer Nature | 152| 382414| 2516|
|MDPI AG | 140| 263985| 1886|
|Frontiers Media SA | 87| 205395| 2361|
|Public Library of Science (PLoS) | 34| 67206| 1977|
|Wiley-Blackwell | 34| 82335| 2422|
|Oxford University Press (OUP) | 29| 79079| 2727|
|American Society for Microbiology | 20| 48876| 2444|
|Copernicus GmbH | 13| 24642| 1896|
|Informa UK Limited | 13| 32915| 2532|
|American Dairy Science Association | 12| 27298| 2275|
|IEEE | 12| 16778| 1398|
|AAAS | 6| 15908| 2651|
|Schweizerbart | 6| 8893| 1482|
|American Geophysical Union (AGU) | 5| 13166| 2633|
|Scientific Societies | 5| 11821| 2364|
|Cambridge University Press (CUP) | 4| 7842| 1961|
|Canadian Center of Science and Education | 4| 1724| 431|
|Pensoft Publishers | 4| 4351| 1088|
|Hindawi Publishing Corporation | 3| 3640| 1213|
|International Journal of Science and Research | 3| 465| 155|
|Royal Society of Chemistry (RSC) | 3| 5399| 1800|
|The Royal Society | 3| 4915| 1638|
|CSIRO Publishing | 2| 2145| 1072|
|EDP Sciences | 2| 1810| 905|
|Firenze University Press | 2| 1600| 800|
|IOP Publishing | 2| 5412| 2706|
|IOS Press | 2| 2400| 1200|
|Optica Publishing Group | 2| 4120| 2060|
|Advances in Weed Science | 1| 456| 456|
|Aging and Disease | 1| 1444| 1444|
|AIP Publishing | 1| 597| 597|
|American Chemical Society (ACS) | 1| 1534| 1534|
|American Physical Society (APS) | 1| 2411| 2411|
|American Society of Hematology | 1| 769| 769|
|Association for Research in Vision and Ophthalmology (ARVO) | 1| 1699| 1699|
|Berghahn Books | 1| 1127| 1127|
|BMJ | 1| 2800| 2800|
|Brill | 1| 2400| 2400|
|Corpus Publishers | 1| 397| 397|
|eLife Sciences Publications, Ltd | 1| 2901| 2901|
|F1000 Research, Ltd. | 1| 1265| 1265|
|International Journal Of Advanced Research | 1| 109| 109|
|Iris Publishers | 1| 2114| 2114|
|IWA Publishing | 1| 1975| 1975|
|John Libbey Eurotext | 1| 333| 333|
|Korean Society for Stem Cell Research | 1| 470| 470|
|LIDSEN Publishing Inc | 1| 120| 120|
|Medknow | 1| 194| 194|
|MyJove Corporation | 1| 3691| 3691|
|North Atlantic University Union (NAUN) | 1| 600| 600|
|PeerJ | 1| 1113| 1113|
|Proceedings of the National Academy of Sciences | 1| 1355| 1355|
|Resilience Alliance, Inc. | 1| 1118| 1118|
|SAGE Publications | 1| 1311| 1311|
|Science Publishing Group | 1| 925| 925|
|Scientific Research Publishing, Inc. | 1| 750| 750|
|Slovenian Association Informatika | 1| 500| 500|
|Society for Neuroscience | 1| 2222| 2222|
|Ubiquity Press, Ltd. | 1| 695| 695|
|University of California Press | 1| 603| 603|
|Wageningen Academic Publishers | 1| 375| 375|



### Fees paid per publisher (in EURO)

![plot of chunk tree_couperin_2024_09_17_full](/figure/tree_couperin_2024_09_17_full-1.png)

### Average costs per publisher (in EURO)

![plot of chunk box_couperin_2024_09_17_publisher_full](/figure/box_couperin_2024_09_17_publisher_full-1.png)
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_couperin_2024_09_17_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 59b6e50

Please sign in to comment.