Skip to content

Commit

Permalink
post: darmstadt oapk 2023
Browse files Browse the repository at this point in the history
  • Loading branch information
cbroschinski committed Feb 5, 2025
1 parent d9364f5 commit 7f7f57b
Show file tree
Hide file tree
Showing 5 changed files with 190 additions and 0 deletions.
117 changes: 117 additions & 0 deletions Rmd/2025-02-05-darmstadt.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,117 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: TU Darmstadt 2023 OAPK data integrated
date: 2025-02-05 07:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo =FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=" ")
})
```

The Forschungszentrum Jülich collects publication cost data from German institutions as part of the DFG programme "[Open Access Publication Funding](https://www.fz-juelich.de/en/zb/open-science/open-access/monitoring-dfg-oa-publication-funding)".

When reporting their publication costs, the [Technische Universität Darmstadt](https://www.tu-darmstadt.de/index.en.jsp) had agreed to share the data with OpenAPC as well. This data has now been transferred and integrated.

Contact person is [Karen Schiele](mailto:[email protected]).

## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc()
my.apc <- my.apc[my.apc$institution == "TU Dresden",]
my.apc$publisher <- gsub("Institute of Electrical & Electronics Engineers \\(IEEE\\)", "IEEE", my.apc$publisher)
my.apc <- droplevels(my.apc)
my.apc_new <- download_apc(c("https://raw.githubusercontent.com/OpenAPC/openapc-de/refs/tags/v4.143.4-3-2/data/tudarmstadt/oapk_2023_tu_darmstadt_enriched.csv"))
my.apc_new <- my.apc_new[my.apc_new$institution == "TU Darmstadt" & !is.na(my.apc_new$institution),]
my.apc_new <- droplevels(my.apc_new)
```

The new data set covers publication fees for `r format(nrow(my.apc_new), big.mark =",")` articles, total expenditure amounts to `r sum(my.apc_new$euro)`€ and the average fee is `r sum(my.apc_new$euro)/nrow(my.apc_new)`€. Please note that BPCs are not included in this list, as they are aggregated in OpenAPC's [BPC](https://github.com/OpenAPC/openapc-de/blob/master/data/bpc.csv) data set.

The following table provides an overview of the reported APCs:


```{r}
d_frame = data.frame(table(my.apc_new$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc_new$publisher <- factor(my.apc_new$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc_new$euro, my.apc_new$publisher, length),
tapply(my.apc_new$euro, my.apc_new$publisher, sum),
tapply(my.apc_new$euro, my.apc_new$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

## Overview

With the recent contribution included, the overall APC data for the TU Darmstadt now looks as follows:

### Fees paid per publisher (in EURO)

```{r tree_darmstadt_2025_02_05_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```

### Average costs per year (in EURO)

```{r box_darmstadt_2025_02_05_year_full, echo=FALSE, message = FALSE}
require(ggplot2)
q <- ggplot(my.apc, aes(factor(period), euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18)
q + xlab("Funding period") + ylab("APC")
```

### Average costs per publisher (in EURO)

```{r box_darmstadt_2025_02_05_publisher_full, echo = FALSE, message = FALSE}
require(ggplot2)
require(tidyverse)
my.apc <- my.apc %>%
mutate(publisher = str_replace(publisher, ".+\\((\\w+)\\)", "\\1"))
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 10])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher (> 10 articles)") + ylab("APC")
```
73 changes: 73 additions & 0 deletions _posts/2025-02-05-darmstadt.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,73 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: TU Darmstadt 2023 OAPK data integrated
date: 2025-02-05 07:00:00
summary:
categories: [general, openAPC]
comments: true
---




The Forschungszentrum Jülich collects publication cost data from German institutions as part of the DFG programme "[Open Access Publication Funding](https://www.fz-juelich.de/en/zb/open-science/open-access/monitoring-dfg-oa-publication-funding)".

When reporting their publication costs, the [Technische Universität Darmstadt](https://www.tu-darmstadt.de/index.en.jsp) had agreed to share the data with OpenAPC as well. This data has now been transferred and integrated.

Contact person is [Karen Schiele](mailto:[email protected]).

## Cost data



The new data set covers publication fees for 129 articles, total expenditure amounts to 290 374€ and the average fee is 2 251€. Please note that BPCs are not included in this list, as they are aggregated in OpenAPC's [BPC](https://github.com/OpenAPC/openapc-de/blob/master/data/bpc.csv) data set.

The following table provides an overview of the reported APCs:




| | Articles| Fees paid in EURO| Mean Fee paid|
|:------------------------------------------------------|--------:|-----------------:|-------------:|
|MDPI AG | 45| 85453| 1899|
|Springer Nature | 16| 35049| 2191|
|Frontiers Media SA | 12| 27922| 2327|
|Wiley-Blackwell | 12| 33269| 2772|
|Elsevier BV | 10| 29143| 2914|
|Institute of Electrical & Electronics Engineers (IEEE) | 6| 13453| 2242|
|American Chemical Society (ACS) | 4| 12445| 3111|
|Copernicus GmbH | 4| 8135| 2034|
|Royal Society of Chemistry (RSC) | 3| 6318| 2106|
|American Astronomical Society | 2| 6663| 3332|
|American Physical Society (APS) | 2| 5787| 2893|
|Cambridge University Press (CUP) | 2| 4292| 2146|
|Oxford University Press (OUP) | 2| 7472| 3736|
|Fortune Journals | 1| 2868| 2868|
|Institution of Engineering and Technology (IET) | 1| 2380| 2380|
|Nomos Verlag | 1| 2368| 2368|
|OAPublishing Collective | 1| 700| 700|
|Optica Publishing Group | 1| 2225| 2225|
|Polish Academy of Sciences Chancellery | 1| 409| 409|
|Public Library of Science (PLoS) | 1| 1985| 1985|
|Scientific Research and Community Ltd | 1| 1265| 1265|
|Virtus Interpress | 1| 774| 774|



## Overview

With the recent contribution included, the overall APC data for the TU Darmstadt now looks as follows:

### Fees paid per publisher (in EURO)

![plot of chunk tree_darmstadt_2025_02_05_full](/figure/tree_darmstadt_2025_02_05_full-1.png)

### Average costs per year (in EURO)

![plot of chunk box_darmstadt_2025_02_05_year_full](/figure/box_darmstadt_2025_02_05_year_full-1.png)

### Average costs per publisher (in EURO)

![plot of chunk box_darmstadt_2025_02_05_publisher_full](/figure/box_darmstadt_2025_02_05_publisher_full-1.png)
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_darmstadt_2025_02_05_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 7f7f57b

Please sign in to comment.