Skip to content

Commit

Permalink
post: htwg apc, bpc, deal 2020 - 2023
Browse files Browse the repository at this point in the history
  • Loading branch information
cbroschinski committed Nov 7, 2024
1 parent 84e377f commit deee5c4
Show file tree
Hide file tree
Showing 5 changed files with 156 additions and 0 deletions.
110 changes: 110 additions & 0 deletions Rmd/2024-11-06-htwg.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,110 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: HTWG Konstanz updates APC, BPC and DEAL data
date: 2024-11-06 12:00:00
summary:
categories: [general, openAPC]
comments: true
---


```{r, echo = FALSE}
knitr::opts_knit$set(base.url = "/")
knitr::opts_chunk$set(
comment = "#>",
collapse = TRUE,
warning = FALSE,
message = FALSE,
echo = FALSE,
fig.width = 9,
fig.height = 6
)
options(scipen = 1, digits = 2)
knitr::knit_hooks$set(inline = function(x) {
prettyNum(x, big.mark=",")
})
```

The [HTWG Konstanz - University of Applied Sciences](https://www.htwg-konstanz.de/en/) has updated its expenditures, the latest contribution provides data on APCs, BPCs and Springer DEAL data ranging from 2020 to 2023.


## Cost data

```{r, cache.lazy = TRUE}
#' Download APC spreadsheet from github which requires to Curl installed
download_apc <- function(path = NULL, dir = "tmp", file = "apc_de.csv"){
if(is.null(path)) {
path <- c("https://raw.githubusercontent.com/OpenAPC/openapc-de/master/data/apc_de.csv")
}
dir.create(dir)
download.file(url = path, destfile = paste(dir, file, sep = "/"), method = "curl")
read.csv(paste(dir, file, sep = "/"), header = T,sep =",")
}
my.apc <- download_apc()
my.apc <- my.apc[my.apc$institution == "HTWG Konstanz",]
my.apc <- droplevels(my.apc)
my.apc_new <- download_apc(c("https://raw.githubusercontent.com/OpenAPC/openapc-de/refs/heads/master/data/htwg/OpenAPC_HTWG_bis2021_enriched.csv"))
my.apc_new <- droplevels(my.apc_new)
```

The new data set on APCs provided by HTWG Konstanz covers publication fees for `r format(nrow(my.apc_new), big.mark =",")` articles, total expenditure amounts to `r sum(my.apc_new$euro)`€ and the average fee is `r sum(my.apc_new$euro)/nrow(my.apc_new)`€. Please note that neither the articles published in **hybrid** journals under the DEAL agreement nor BPCs are included in this list, as they are aggregated in OpenAPC's [transformative agreements](https://github.com/OpenAPC/openapc-de/tree/master/data/transformative_agreements) and [BPC](https://github.com/OpenAPC/openapc-de/blob/master/data/bpc.csv) datasets, respectively.

```{r}
d_frame = data.frame(table(my.apc_new$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
my.apc_new$publisher <- factor(my.apc_new$publisher, levels = d_frame$Publisher)
df.summary <-cbind(tapply(my.apc_new$euro, my.apc_new$publisher, length),
tapply(my.apc_new$euro, my.apc_new$publisher, sum),
tapply(my.apc_new$euro, my.apc_new$publisher, mean))
colnames(df.summary) <- c("Articles", "Fees paid in EURO", "Mean Fee paid")
knitr::kable(as.data.frame(df.summary), digits = 2)
```

## Overview

With the recent contribution included, the overall APC data for the HTWG now looks as follows:

### Fees paid per publisher (in EURO)

```{r tree_htwg_2024_11_06_full}
tt <- aggregate(my.apc$euro, by = list(my.apc$publisher), sum)
colnames(tt) <- c("Publisher", "Euro")
treemap::treemap(tt, index = c("Publisher"), vSize = "Euro", palette = "Paired")
```

### Average costs per year (in EURO)

```{r box_htwg_2024_11_06_year_full, echo=FALSE, message = FALSE}
require(ggplot2)
q <- ggplot(my.apc, aes(factor(period), euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18)
q + xlab("Funding period") + ylab("APC")
```

### Average costs per publisher (in EURO)

```{r box_htwg_2024_11_06_publisher_full, echo = FALSE, message = FALSE}
require(ggplot2)
require(tidyverse)
my.apc <- my.apc %>%
mutate(publisher = str_replace(publisher, ".+\\((\\w+)\\)", "\\1"))
d_frame = data.frame(table(my.apc$publisher, dnn="Publisher"))
d_frame = d_frame[with(d_frame, order(-Freq, Publisher)), ]
publishers = as.character(d_frame$Publisher[d_frame$Freq > 0])
my.apc_reduced = my.apc[my.apc$publisher %in% publishers,]
q <- ggplot(my.apc_reduced, aes(publisher, euro)) + geom_boxplot() + geom_point()
q <- q + ylab("Fees paid (in EURO)") + theme(legend.position="top") + theme_bw(base_size = 18) + coord_flip()
q + xlab("Publisher") + ylab("APC")
```
46 changes: 46 additions & 0 deletions _posts/2024-11-06-htwg.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,46 @@
---
layout: post
author: Christoph Broschinski
author_lnk: https://github.com/cbroschinski
title: HTWG Konstanz updates APC, BPC and DEAL data
date: 2024-11-06 12:00:00
summary:
categories: [general, openAPC]
comments: true
---




The [HTWG Konstanz - University of Applied Sciences](https://www.htwg-konstanz.de/en/) has updated its expenditures, the latest contribution provides data on APCs, BPCs and Springer DEAL data ranging from 2020 to 2023.


## Cost data



The new data set on APCs provided by HTWG Konstanz covers publication fees for 7 articles, total expenditure amounts to 10,523€ and the average fee is 1,503€. Please note that neither the articles published in **hybrid** journals under the DEAL agreement nor BPCs are included in this list, as they are aggregated in OpenAPC's [transformative agreements](https://github.com/OpenAPC/openapc-de/tree/master/data/transformative_agreements) and [BPC](https://github.com/OpenAPC/openapc-de/blob/master/data/bpc.csv) datasets, respectively.



| | Articles| Fees paid in EURO| Mean Fee paid|
|:-------|--------:|-----------------:|-------------:|
|MDPI AG | 7| 10523| 1503|



## Overview

With the recent contribution included, the overall APC data for the HTWG now looks as follows:

### Fees paid per publisher (in EURO)

![plot of chunk tree_htwg_2024_11_06_full](/figure/tree_htwg_2024_11_06_full-1.png)

### Average costs per year (in EURO)

![plot of chunk box_htwg_2024_11_06_year_full](/figure/box_htwg_2024_11_06_year_full-1.png)

### Average costs per publisher (in EURO)

![plot of chunk box_htwg_2024_11_06_publisher_full](/figure/box_htwg_2024_11_06_publisher_full-1.png)
Binary file added figure/box_htwg_2024_11_06_publisher_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/box_htwg_2024_11_06_year_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added figure/tree_htwg_2024_11_06_full-1.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit deee5c4

Please sign in to comment.