Introduction

alternative accessible content

Link to video transcript in a Word file

Most health outcomes are multifactorial, meaning that there are multiple factors that influence whether a given outcome will occur, and these other risk factors can introduce confounding that distorts our primary analysis. While stratification is very useful for adjusting for confound by one or two confounders at a time, it is not an efficient way to adjust for multiple confounding factors in a single analysis. In this module we will first extend our discussion of simple linear regression to introduce you to multiple linear regression in which we evaluate multiple independent variables when looking at a continuously distributed dependent variable (health outcome.) We will then introduce multiple logistic regression analysis as a tool for evaluating associations between exposures and dichotomous outcomes. We will draw on examples from the public health literature to discuss the interpretation of results from multiple linear and logistic regression models. You will also gain skills in using R to conduct and interpret these analyses using a public health data set.

Essential Questions

  1. How can we simultaneously take into account multiple factors that might affect a health outcome?
  2. How can we calculate an adjusted measure of effect, controlling for other factors, through statistical analysis?
  3. What analysis should be done if the outcome is dichotomous instead of continuous? How I perform multivariable regression analyses in R?

Learning Objectives

After completing this section, you will be able to: