MATTER

MATTER is framework that aims to consistent comparisons of defect prediction models.

MATTER is developmented and tested under: R-4.2.1
RStudio 2022.07.1+554 "Spotted Wakerobin" Release (7872775ebddc40635780ca1ed238934c3345c5de, 2022-07-22) for Windows

Three main features of MATTER

SQA-effort alignment. compare multiple defect prediction models under the same available code inspection effort or context switch effort
baseline model ONE. a baseline model "ONE" for defect prediction
Novel indicators. implementation for traditional (recall, MCC, G1, G2, AUC, PF, etc.) and novel performance indicators (ROI, eIFA).

Usage of MATTER

To config the datasets, evaluated models, the path of origin prediction results in your evaluation experiments, modify:
./config.r

Note: prediction result files of compared models need to contain at least four columns below so that MATTER can evaluated in under specific SQA-effort:

sloc	predictedValue	predictLabel	actualBugLabel
numeric	numeric	binary	binary

To get prediction results of baseline model ONE on specific datasets,run:
./run_ONE.r
To compute performance indicators of compared models under aligned SQA-effort thresholds, run:
./r_scirpt/computeIndicatorFromDetailResult.r
- parameter threshold==-1 means compute indicators by the default classficiation cutoff of model
- parameter 0<=threshold<=1 means compute indicators under aligned threshold * 100% PII (percentage of instances inspected)
- parameter 1<threshold<=100 means compute indicators under aligned threshold% PCI (percentage of code inspected)

Implementations of models

in ./baseline-models/

model	code folder	language	DOI
Bellwether	`./baseline-models/Bellwether`	python2.x	10.1109/TSE.2018.2821670
CLA	`./baseline-models/CLA`	Java	10.1109/ASE.2015.56
Amasaki15-NB	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
CamargoCruz09-NB	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
Peters15-NB	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
CamargoCruz09-DT	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
Turhan09-DT	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
Watanabe08-DT	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
Menzies11-RF	`./baseline-models/Crosspare`	Java	10.1109/TSE.2017.2724538
EASC_NE	`./baseline-models/Crosspare`	Java	10.1109/TSE.2019.2939303
EASC_E	`./baseline-models/Crosspare`	Java	10.1109/TSE.2019.2939303
SC	`./baseline-models/SC`	R	10.1145/2884781.2884839
ManualDown	`./baseline-models/other`	Python3.x	10.1145/3183339
ManualUp	`./baseline-models/other`	Python3.x	10.1145/3183339
FCM	`./baseline-models/other`	Python3.x	10.1016/j.infsof.2020.106287

Datasets

in ./dataset/nominal/dataset.7z

Dataset	#project	#releases
AEEEM	5	5
ALLJURECZKO	31	62
IND-JLMIV+R-1Y_change59	38	59
MA-SZZ-2020	5	50
ReLink	3	3

Scripts for RQs and discussions in paper

run_one.r Get the prediction results of ONE on datasets, with different values of parameters (cutoff and excluded_code_size_percentage) of ONE. The results are saved in ./One-results

./rscript/computeIndicatorFromDetailResult.r Get the prediction performance of models in RQ1, RQ2, RQ3, RQ4. threshold=0.2 indicates comparing models under 20% PII. threshold=20 indicates comparing models under 20% PCI.

./rscript/KSETE_performance.R Run KSETE and add mean KSETE under one-to-one CPDP result to evaluated state of the art models

./rscript/new_rq_threshold.r Draw heatmap for RQ1

./rscript/indicator_relationship_under_threshold.r RQ2

./rscript/stability_of_EASC_NE_simulate.r RQ3 Fig.10. Get the simulated prediction results of EASC_NE

./rscript/splitResultByDataset.R Split result of RQ3 and RQ4 by dataset

./rscript/ScottKnottESD.r Get Scott-Knott ESD test results of models comparisons in RQ3 and RQ4

./rscript/rq1_table.r Get formatted table of median,mean,standard deviation values of models' performance indicators for RQ3 and RQ4

./rscript/new_excluded_code_size_percentage.r Discussion 1

./rscript/dicussion2.r Discussion 2

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
ONE-result		ONE-result
baseline-models		baseline-models
baseline-result		baseline-result
dataset/nominal		dataset/nominal
figure		figure
r_script		r_script
survey		survey
Appendix.pdf		Appendix.pdf
One.r		One.r
README.md		README.md
config.r		config.r
performance.r		performance.r
run_one.r		run_one.r

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MATTER

Three main features of MATTER

Usage of MATTER

Implementations of models

Datasets

Scripts for RQs and discussions in paper

About

Releases

Packages

Languages

liu906/MATTER

Folders and files

Latest commit

History

Repository files navigation

MATTER

Three main features of MATTER

Usage of MATTER

Implementations of models

Datasets

Scripts for RQs and discussions in paper

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages