Skip to content

Commit

Permalink
post: lib4ri apc data 2023
Browse files Browse the repository at this point in the history
  • Loading branch information
jbartlewski committed Apr 9, 2024
1 parent 5726684 commit 19a28dd
Show file tree
Hide file tree
Showing 4 changed files with 157 additions and 0 deletions.
99 changes: 99 additions & 0 deletions Rmd/2024-04-09-lib4ri.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,99 @@
---
layout: post
author: Julia Bartlewski
author_lnk: https://github.com/jbartlewski
title: Lib4RI provides APC data for 2022
date: 2024-04-09 08:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo = FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=",")
})
```

[Lib4RI](https://www.lib4ri.ch/) (Library for the Research Institutes within the ETH Domain) collects data on APC expenditures from four Swiss institutes: [Eawag](https://www.eawag.ch/en/) (Swiss Federal Institute of Aquatic Science and Technology), [Empa](https://www.empa.ch/) (Swiss Federal Laboratories for Materials Science and Technology), [PSI](https://www.psi.ch/en) (Paul Scherrer Institute) and [WSL](https://www.wsl.ch/en/index.html) (Swiss Federal Institute for Forest, Snow and Landscape Research).

APC expenditures for the 2023 period have now been made available to OpenAPC.

Contact person is [Jochen Bihn](mailto:[email protected]).

## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc()
my.apc <- my.apc[my.apc$institution %in% c("Empa - Swiss Federal Laboratories for Materials Science and Technology", "Eawag - Swiss Federal Institute of Aquatic Science and Technology", "WSL - Swiss Federal Institute for Forest, Snow and Landscape Research", "PSI - Paul Scherrer Institute"),]
my.apc <- my.apc[my.apc$period == 2023,]
my.apc$publisher <- gsub("American Association for the Advancement of Science \\(AAAS\\)", "AAAS", my.apc$publisher)
my.apc$publisher <- gsub("Institute of Electrical & Electronics Engineers \\(IEEE\\)", "IEEE", my.apc$publisher)
my.apc$publisher <- gsub("Regional Euro-Asian Biological Invasions Centre Oy \\(REABIC\\)", "REABIC", my.apc$publisher)
my.apc <- droplevels(my.apc)
```

The new data set covers publication fees for `r format(nrow(my.apc), big.mark =",")` articles published at Empa, Eawag, WSL and PSI in 2023. Total expenditure amounts to `r sum(my.apc$euro)`€ and the average fee is `r sum(my.apc$euro)/nrow(my.apc)`€. The following tables and plots provide more details.


```{r}
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc$publisher <- factor(my.apc$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc$euro, my.apc$publisher, length),
tapply(my.apc$euro, my.apc$publisher, sum),
tapply(my.apc$euro, my.apc$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

#### Fees paid per publisher (in EURO)

```{r tree_lib4ri_2024_04_09_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```


#### Average costs per publisher (in EURO)

```{r box_lib4ri_2024_04_09_publisher_full, echo = FALSE, message = FALSE}
require(ggplot2)
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 0])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher") + ylab("APC")
```
58 changes: 58 additions & 0 deletions _posts/2024-04-09-lib4ri.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,58 @@
---
layout: post
author: Julia Bartlewski
author_lnk: https://github.com/jbartlewski
title: Lib4RI provides APC data for 2022
date: 2024-04-09 08:00:00
summary:
categories: [general, openAPC]
comments: true
---




[Lib4RI](https://www.lib4ri.ch/) (Library for the Research Institutes within the ETH Domain) collects data on APC expenditures from four Swiss institutes: [Eawag](https://www.eawag.ch/en/) (Swiss Federal Institute of Aquatic Science and Technology), [Empa](https://www.empa.ch/) (Swiss Federal Laboratories for Materials Science and Technology), [PSI](https://www.psi.ch/en) (Paul Scherrer Institute) and [WSL](https://www.wsl.ch/en/index.html) (Swiss Federal Institute for Forest, Snow and Landscape Research).

APC expenditures for the 2023 period have now been made available to OpenAPC.

Contact person is [Jochen Bihn](mailto:[email protected]).

## Cost data



The new data set covers publication fees for 100 articles published at Empa, Eawag, WSL and PSI in 2023. Total expenditure amounts to 202,942€ and the average fee is 2,029€. The following tables and plots provide more details.




| | Articles| Fees paid in EURO| Mean Fee paid|
|:-------------------------------------------|--------:|-----------------:|-------------:|
|MDPI AG | 27| 50496| 1870|
|Frontiers Media SA | 25| 62109| 2484|
|Copernicus GmbH | 18| 28892| 1605|
|Springer Nature | 8| 19036| 2379|
|Wiley-Blackwell | 5| 11738| 2348|
|Public Library of Science (PLoS) | 4| 7695| 1924|
|Optica Publishing Group | 3| 6454| 2151|
|Cambridge University Press (CUP) | 2| 4236| 2118|
|American Physical Society (APS) | 1| 2702| 2702|
|Elsevier BV | 1| 786| 786|
|IEEE | 1| 2108| 2108|
|Österreichische Akademie der Wissenschaften | 1| 491| 491|
|Pensoft Publishers | 1| 1357| 1357|
|REABIC | 1| 1062| 1062|
|Royal Society of Chemistry (RSC) | 1| 2287| 2287|
|University of California Press | 1| 1494| 1494|



#### Fees paid per publisher (in EURO)

![plot of chunk tree_lib4ri_2024_04_09_full](/figure/tree_lib4ri_2024_04_09_full-1.png)


#### Average costs per publisher (in EURO)

![plot of chunk box_lib4ri_2024_04_09_publisher_full](/figure/box_lib4ri_2024_04_09_publisher_full-1.png)
Binary file added figure/box_lib4ri_2024_04_09_publisher_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_lib4ri_2024_04_09_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 19a28dd

Please sign in to comment.