Feature/108 implement sdt models #113

GidonFrischkorn · 2024-02-21T15:02:35Z

Summary

Add SDT models for old/new (signal/noise) recognition as a bmmmodel class.
Add distribution functions for density, and random generation for SDT models
Initiate aggregate data method (experimental) for aggregating data to speed up model

Tests

[x] Confirm that all tests passed
[x] Confirm that devtools::check() produces no errors

Release notes

GidonFrischkorn · 2024-02-21T15:04:13Z

I just wanted to share my progress so far. The old/new SDT models should be implemented. For now, I added all noise distributions that can be easily accommodated (Normal, EVD, Cauchy, and Logistic). However, we might want to discuss if there that matches the theoretical discussions around SDT models.

The main things that still need to be done with respect to the old/new SDT models are:

add adequate tests for the SDT models
write a vignette explaining how to specify and use the SDT models.
add an example data set to try out the SDT models

venpopov · 2024-02-21T15:20:54Z

ok, I'll try to review what you have so far before we meet next week

GidonFrischkorn · 2024-02-21T15:29:10Z

ok, I'll try to review what you have so far before we meet next week

There is no hurry. I just thought you might be interested. As the old/new models are mostly done. If you other things to do, I can also show you what I have done next week when we meet.

venpopov · 2024-02-21T16:20:16Z

yes, I just want to consider more broadly how the various different SDT models such as old/new,2-AFC, m-AFC, confidence ratings might fit together and what we would need to have as separate models

GidonFrischkorn · 2024-02-22T09:23:46Z

I found this article that should provide a good reference how to generalize the old/new SDT models to m-AFC SDT models: https://www.sciencedirect.com/science/article/pii/S0022249612000260

venpopov · 2024-02-22T14:26:20Z

I found this article that should provide a good reference how to generalize the old/new SDT models to m-AFC SDT models: https://www.sciencedirect.com/science/article/pii/S0022249612000260

good find! Given his interest and extensive reasearch and writing on SDT, SDT via regression and bayesian implementations, measurement in general (https://www.tc.columbia.edu/faculty/ld208/), maybe at some point we can contact him and chat about the package and get his thoughts on how the SDT is implemented, maybe get him involved if he wants to?

He's also written a bunch of programs for other software for SDT models, so I bet his experience and insight will be useful:

GidonFrischkorn · 2024-02-22T15:40:34Z

I thought the same. For the moment, I think we can implement plenty of things ourselves, but it would still be interesting to discuss with him about optimizations with respect to the SDT implementations.

He has also done some interesting work linking SDT to psychometric models such as IRT that I would find interesting to explore. But then again, there is already plenty of work on my desk. So maybe this is rather something for the future...

R/bmm_model_SDT.R

venpopov · 2024-02-28T08:04:40Z

R/distributions.R

+#' @param crit A vector of numerical values for the decision criterion.
+#' @param stimulus A character vector ("signal"/"noise", "old"/"new") or
+#'   integer vector (1/0) coding which stimuli were shown alongside the observed responses,
+#'   or for which to generate data for.  If no information is provided for `rSDT` then we will


the function only returns hits by default. I think this is probably for the best, and would rather remove the description than change the function. Or return a list of two vectors, one for hits and one for FAs

I added some documentation to clarify this point. My thought was to make the distribution functions consistent with the implementation as bmmmodel.

Maybe you can have a look if this resolves your concerns.

venpopov · 2024-02-28T08:07:11Z

R/distributions.R

+
+#' @rdname SDTdist
+#' @export
+rSDT <- function(n, size, dprime = 1, crit = dprime/2, stimulus = 1, dist_noise = "normal") {


could be good to have an option to return proportions instead of counts

Hmm... do you mean that we return the proportion of hits/FA instead of the count. Why would users need these? If they want they can directly compute them from returnedCounts/size, but they would loose the information on how many trials the proportion was calculated on.

Maybe I am misunderstanding something, but my thought was to make these functions consistent with the model implementation.

R/distributions.R

venpopov · 2024-02-28T08:23:36Z

R/bmm_model_SDT.R

+      nTrials <- model$resp_vars$nTrials
+      brms_formula <- brms::bf(paste0(response," | ", "trials(",nTrials,")", " ~ dprime*",stimulus," - crit" ), nl = TRUE)
+   } else {
+      brms_formula <- brms::bf(paste0(response," ~ dprime*",stimulus," - crit"),nl = TRUE)


this works for symmetric distributions, but for assymetric ones it gives the wrong part of the cdf. Will explain in person

As far as I have checked with the parameter recoveries I ran, this implementation does recover the parameters well using the cloglog link for Gumbel noise. But we should test this thoroughly to make sure that I did not miss anything.

R/distributions.R

- make "normal" default noise distribution - remove unnecessary functions - Improve documentation - move `stimulus` and `nTrials` to other_vars

GidonFrischkorn · 2024-03-01T08:36:07Z

Things that need to be added for finishing up the Old/New Recognition SDT models.

vignette introducing the model and providing an example how to fit the models
unit tests for the added functions
add an example data set to fit the models
evaluate parameter recovery more systematically

GidonFrischkorn · 2024-03-01T08:37:16Z

I will see that I take care of most of the to dos in the next week.

venpopov · 2024-03-01T09:36:56Z

Perfect, and I can finish the review and test it with the code you sent in the meantime

- update the already implemented SDT models to the current bmm version - add an info `custom_bmf2bf` to all model objects to handle the `bmmformula` to `brmsformula` translation more efficiently.

GidonFrischkorn · 2024-11-20T10:50:46Z

I just found this package implementing EV and UEV SDT models in STAN with a newly proposed link function that allows to fit hierarchical models also for the thresholds: https://github.com/boryspaulewicz/bhsdtr2/tree/master

Maybe we should have a look at how they coded this in STAN and see if we can adapt this for our package. Generally, the principle for predicting the model parameters looks pretty similar, with the difference that they do interact with brms at all.

GidonFrischkorn added 5 commits February 21, 2024 08:28

Initial implementation of SDT for 2-AFC

a4c9ef2

Add distribution functions for SDT

8006cfe

clean up documentation

06c8dca

Merge branch 'develop' into feature/108-implement-sdt-models

3b9c0bc

Merge branch 'develop' into feature/108-implement-sdt-models

e1274e1

GidonFrischkorn added the PR - minor Pull-request should update minor version label Feb 21, 2024

GidonFrischkorn added this to the 1.0.0 milestone Feb 21, 2024

GidonFrischkorn linked an issue Feb 21, 2024 that may be closed by this pull request

implement SDT models #108

Open

GidonFrischkorn marked this pull request as draft February 21, 2024 15:02

Update Documentation of SDT models

556b1db

Minor bugfixes.

9bf4fe9

venpopov added 2 commits February 28, 2024 08:31

Merge branch 'develop' into feature/108-implement-sdt-models

82cbf34

regenerate documentation after merge from develop

2ca0ca1