Please note that this page is updated every teaching/academic year with new resources
Instructor: Mabel Carabali.
Schedule: Tuesdays and Thursdays (10:30 AM -12:25 PM).
- First day of class: August 28th, 2025.
- Last day of class: December 2nd, 2025.
Teaching Assistants:
- Julia Brillinger
- Gabrielle Jacob
TA HOURS
- Monday: 10 - 11 AM in Rm 1122
- Wednesday: 10 - 11 AM in Rm 1122
This is a first semester doctoral level course in epidemiologic theory and the estimation of epidemiologic effect measures and their confidence intervals in a variety of different study designs. This course places an emphasis on causal inference concepts and its distinction from, and yet simultaneously dependency, on statistical models. The course will also emphasize the limitations of mainstream null hypothesis statistical significance (NHST) paradigm and encourage the recognition of the value of an estimation (Bayesian) paradigm for optimal inferences.
The overarching goal is to establish and strengthen the foundation of epidemiological knowledge, by providing the basis of epidemiology, addressing fundamental and advanced aspects of descriptive epidemiology, causal inference, and data analysis tied to the identification of best practices given the numerous study designs.
The objective of EPIB 704 is to provide students with a deeper theoretical understanding of the foundations of epidemiology some and empirical tools for data handling, data analysis, critical thinking and interpretation of the epidemiological inference. This course is also an entry point for advanced epidemiology (e.g., advanced knowledge of traditional biases), specially an introduction to modern methods. Hence, the emphasis on keep it up with the literature. EPIB 704 will equip student with strategies to:
-
identify common threats of the validity or biases,
-
identify and interpret their effects in the measurement of associations, and
-
identify opportunities to investigate, correct, and address these biases.
The course will have a “hands-on” approach whereby the various key statistical concepts covered will be illustrated by computer coding, to allow standard and new calculus for modern epidemiologic methods. Counting has historically been the essential background to epidemiologic research and remains so in the
Attention will also be given to exploratory data analysis (tabular and graphical), data interpretation, critical examination of assumptions and reproducible research. Students will focus mainly on cumulative incidence measures for categorical outcomes, with attention to model checking and screening for confounders and effect measure modifiers. Regression models covered will include linear, logistic, Poisson, survival, and if time allows, meta-analytical models. Experimental and quasi-experimental designs will be discussed. Other topics that the course will touch on include attributable fractions, selection bias, measurement error, model building and sensitivity analyses, bootstrapping, matched data, and missing data.
- What if? by Miguel Hernán and James Robins, available here: https://www.hsph.harvard.edu/miguel-hernan/causal-inference-book/ (Chapters 1-11)
- Modern Epidemiology by Timothy L. Lash, Tyler J. VanderWeele, Sebastien Haneuse, Kenneth J. Rothman, available at the McGill Library here (General reference appropriate for most program courses) https://mcgill.on.worldcat.org/oclc/1236198056
- Regression and Other Stories by Andrew Gelman, Jennifer Hill, and Aki Vehtari, available here: https://users.aalto.fi/~ave/ROS.pdf
- If you want a deeper dive into Bayesian statistics, we suggest as an optional resource the excellent:
- Statistical rethinking: a Bayesian course with examples in R and Stan by Richard McElreath available at the McGill Library here: https://mcgill.worldcat.org/title/statistical-rethinking-a-bayesian-course-with-examples-in-r-and-stan/oclc/1145123627&referer=brief_results (Chapters 1-10).
- Associated website: https://xcelab.net/rm/statistical-rethinking/ and YouTube lectures here.
- Free electronic versions of all these books are available online but, depending on your financial situation, it may be worthwhile for you to invest in your own copies.
- Other readings will be assigned each week from published journal articles.
We have pre-selected a list of topics to go through the 13 weeks-long course. Each lecture is aimed at addressing specific epidemiological concepts, addressing both theoretical and analytic/practical aspects. At the beginning of each lecture/topic I will present the learning objectives and expectation. Here you will find a list of the (tentative) Lectures. The recommended readings and links to other resources will be posted on MyCourses by the start of the academic semester.
| Date | Topic | Slides |
|---|---|---|
| 28-Aug-25 | Lecture 01 - Introduction EPIB 704 | html; pdf |
| 02-Sep-25 | Lecture 02 - Statistical inference - Frequentist / Bayesian | html; pdf |
| 04-Sep-25 | Lecture 03 - Causal inference with DAGs | html; pdf |
| 09-Sep-25 | Lecture 04 - Measures of Occurrence | html; pdf |
| 11-Sep-25 | Lecture 05 - Measures of Association (I) | html; pdf |
| 16-Sep-25 | Lecture 06 - Measures of Association (II) | |
| 18-Sep-25 | Lecture 07 - Risk & Hazards - Introduction to the concept of Survival Analysis | html; pdf |
| 23-Sep-25 | Lecture 08 - Confounding (I) | html;pdf |
| 25-Sep-25 | Lecture 09 - Confounding (II) | html; pdf |
| 30-Sep-25 | Lecture 10 - Linear regression (with an introduction to Bayesian models) | html;pdf |
| 02-Oct-25 | Lecture 11- Logistic regression 1 - introduction | html; pdf |
| 07-Oct-25 | Lecture 12 - Logistic regression part 2 - worked Bayesian example | |
| 09-Oct-25 | Lecture 13 - Logistic vs binomial regressions and other alternatives to model binary outcomes | html;pdf |
| 21-Oct-25 | Lecture 14 - Matching and conditional Logistic Regression | html ;pdf |
| Lecture 17 - Poisson Regression | ||
| 04-Nov-25 | Lecture 18 - Introduction to Cluster/Hierarchical Models (guest lecture) | |
| 06-Nov-25 | Lecture 19 - Introduction to target trial emulation studies (guest lecture) | |
| 11-Nov-25 | Lecture 20 - Correlated Data - IPD-MA analysis (guest lecture) | |
| 13-Nov-25 |
Lecture 21 - Effect Measure Modification & Interaction (I) | html;pdf |
| 18-Nov-25 |
Lecture 22 - Effect Measure Modification & Interaction (II) | html;pdf |
| 20-Nov-25 |
Lecture 23 - Propensity score | html;pdf |
| 25-Nov-25 |
Lecture 24 - Measurement error - Core concepts | html;pdf |
| 27-Nov-25 |
Lecture 25 - Selection bias - core concepts | |
| 02-Dec-25 |
Lecture 26 - Introduction to Quasi-Experimental Designs | html;pdf |
| 02-Dec-25 | Lecture 27 - Model building/selection | html;pdf |
Data for assignments can be found in the folder: EPIB-704 /EPIB704_HW_data_2025/ or in the overall data folder /EPIB-704/tree/main/data folder. We have three options for you to import the assignments datasets into R:
-
Option 1: Install our epib.704 package:
- a) First, install the remotes package:
install.packages("remotes") - b) Then, install the epib.704.data package:
remotes::install_github("mcarabali1/epib.704.data") - c) Select and load the respective dataset by using
epib.704.data::datasetname
- a) First, install the remotes package:
-
Option 2: Install them directly in R using their URL:
- a) Assign the df URL to an R object:
urlfile <- "https://raw.githubusercontent.com/mcarabali1/EPIB-704/main/data/covidkenya.csv" - b) Import the df into R:
mydata <- read.csv(url(urlfile))
- a) Assign the df URL to an R object:
-
Option 3: Download the files directly from the repository onto your computer and import them into R.
