Introduction to statistics

This is my short introduction to statistics. It is published under Creative Commons license CC BY-NC. You can use this work non-commercially, but credit must be given, and there is no allowance for commercial use.

I designed and built this course for One Acre Fund to quickly train their data analysts and data scientists in basic statistical concepts that are vital to trial design and analysis. Due to the atypical background of One Acre Fund analysts, this course is designed to be as math-lite as possible. The aim of this course is therefore to build a kind of statistical intuition and statistical critical thinking.

At the end of this course, the reader should be able to oversee, design, and analyse a variety of trials important in impact-evaluation and product innovation.

Lesson 1 - Distributions, power and sample size

Highlights:

Interpreting and plotting distributions
Null hypothesis testing, p-values and significance
Non-parametric hypothesis testing
Power and sample size calculations
Monte-Carlo methods for non-parametric power calculations

Lesson 2 - intra-cluster correlation and effective trial design

Highlights:

Formulating effective hypotheses
Randomization
Intra-cluster correlation and sample size
P-value thresholds (aka alpha levels)
Using minimum detectable effects in sample size calculations

Lesson 3 - Linear regression and mixed effect models

Highlights:

Analysing cluster randomized trials
- Summarizing clusters
- Cluster-robust methods with mixed-effect models
Diagnosing and interpreting regression methods

Lesson 4 - Logistic regression

Highlights:

Analysing binary outcome variables
- Fixed effect models
- Mixed effect logistic modelling

Lesson 5 - Advanced statistical concepts

Highlights:

Introducing the multiple comparison problem (and corrections)
Introducing positive predictive value and other advanced thoughts on power
Refreshing some useful R functions

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
AMP-1-distributions.html		AMP-1-distributions.html
AMP-2-RCT-principles.html		AMP-2-RCT-principles.html
AMP-3-regressions_NHT.html		AMP-3-regressions_NHT.html
AMP-FAQ.html		AMP-FAQ.html
AMP4_RCT_analysis_p2.html		AMP4_RCT_analysis_p2.html
AMP5-summary.html		AMP5-summary.html
Contents.html		Contents.html
RCT_template1.Rmd		RCT_template1.Rmd
README.md		README.md
Simple_Rmd.Rmd		Simple_Rmd.Rmd
lesson-3-data.csv		lesson-3-data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction to statistics

Lesson 1 - Distributions, power and sample size

Lesson 2 - intra-cluster correlation and effective trial design

Lesson 3 - Linear regression and mixed effect models

Lesson 4 - Logistic regression

Lesson 5 - Advanced statistical concepts

About

Releases

Packages

Languages

Michael-Bar/Introduction-to-statistics

Folders and files

Latest commit

History

Repository files navigation

Introduction to statistics

Lesson 1 - Distributions, power and sample size

Lesson 2 - intra-cluster correlation and effective trial design

Lesson 3 - Linear regression and mixed effect models

Lesson 4 - Logistic regression

Lesson 5 - Advanced statistical concepts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages