Provide a broad overview of bio-statistical methods and concepts used in the public health sciences, emphasizing interpretation and concepts. Develop ability to read the scientific literature to critically evaluate study designs and methods of data analysis. Introduce basic concepts of statistical inference, including hypothesis testing, p-values, and confidence intervals. Topics include comparisons of means and proportions; the normal distribution; regression and correlation; confounding; concepts of study design, including randomization, sample size, and power considerations; logistic regression; and an overview of some methods in survival analysis. Draw examples of the use and abuse of statistical methods from the current biomedical literature.
After completion of this course, students will be able to understand and give examples of different types of data arising in public health studies; interpret differences in data distributions via visual displays; calculate standard normal scores and resulting probabilities; calculate and interpret confidence intervals for population means and proportions; interpret and explain a p-value; perform a two-sample t-test and interpret the results; calculate a 95% confidence interval for the difference in population means; use Stata to perform two sample comparisons of means and create confidence intervals for the population mean differences; understand and interpret results from Analysis of Variance (ANOVA), a technique used to compare means amongst more than two independent populations; choose an appropriate method for comparing proportions between two groups; construct a 95% confidence interval for the difference in population proportions; use Stata to compare proportions amongst two independent populations; understand and interpret relative risks and odds ratios when comparing two populations; understand why survival (timed to event) data requires its own type of analysis techniques; construct a Kaplan-Meier estimate of the survival function that describes the "survival experience" of a cohort of subjects; interpret the result of a log-rank test in the context of comparing the "survival experience" of multiple cohorts; and interpret output from the statistical software package Stata related to the various estimation and hypothesis testing procedures covered in the course.