This repository is dedicated to benchmarking GLMs using the Benchopt framework.
This is a benchmark based on the Benchopt framework. You can learn more about it here.
A generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a link function. In a generalized linear model, the outcome
where
Use benchopt run -h
for more details about the available options, or visit https://benchopt.github.io/api.html.
This repository is dedicated to benchmarking GLMs using the Benchopt framework.
This is a benchmark based on the Benchopt framework. You can learn more about it here.
A generalized linear model (GLM) is a flexible generalization of ordinary linear regression. The GLM generalizes linear regression by allowing the linear model to be related to the response variable via a link function. In a generalized linear model, the outcome
where
As already mentioned, let
In the case of linear regression,
The following datasets are used:
- The bodyfat LIBSVM dataset
- The diabetes
sklearn
dataset - The California housing
sklearn
dataset - A simulated dataset
In the case of logistic regression
The following datasets are used :
- The
sklearn
breast cancer dataset - A simulated dataset
In the case of poisson regression,
For Poisson regression, the following datasets were used :
- The freMTPL insurance dataset
- A simulated dataset with different levels of sparsity for the design matrix
$\boldsymbol{X}$
This benchmark can be run using the following commands:
$ pip install -U benchopt
$ git clone https://github.com/wassimmazouz/benchmark_glm
$ cd benchmark_glm
$ benchopt run .
Options can be passed to benchopt run
, to restrict the benchmarks to some solvers or datasets, e.g.:
$ benchopt run . -s sklearn -d bcancer --max-runs 10 --n-repetitions 10
Use benchopt run -h
for more details about these options, or visit https://benchopt.github.io/api.html.