Department of Biostatistics
Introduction
The Department of Biostatistics in the Gillings School of Global Public Health is recognized as a worldwide leader in research and practice. The B.S.P.H. in biostatistics provides highly qualified undergraduates with a curriculum consisting of a strong mathematical foundation; advanced coursework in statistical applications, theory, and computing; and an understanding of the public health sciences.
The undergraduate major in biostatistics prepares students to apply quantitative methods to design studies, implement methods, analyze data, and interpret results across a range of disciplines.
The degree provides an excellent foundation for continued studies (primarily graduate school in biostatistics, statistics, data science, or medical school). The job market for B.S.P.H. biostatistics graduates is strong, with previous students employed in a variety of fields including public health, pharmaceuticals, and medicine. This undergraduate program is believed to be the first undergraduate degree in biostatistics in the country.
Advising
The academic coordinator and director of undergraduate studies advise biostatistics undergraduates. Undergraduate students are encouraged to meet regularly with these advisors and review their Tar Heel Tracker each semester. Departmental academic advising is particularly important for those students who are considering proceeding to graduate school, pursuing a second major, or pursuing the dual B.S.P.H.–M.S. degree. Further information may be obtained through the departmental website.
Facilities
The Department of Biostatistics has a student library, a student study room, and computer facilities for its students.
Graduate School and Career Opportunities
In recent years, the majority of undergraduate biostatistics students from this program have chosen to attend medical school or graduate school in biostatistics (or other closely related fields) following graduation. More than 65 percent of recent graduates have entered medical school or graduate programs (primarily biostatistics) immediately following graduation. Recent graduates are attending top-ranked medical schools at Johns Hopkins University, Vanderbilt University, University of Virginia, Duke University, and UNC–Chapel Hill. Biostatistics graduate programs where students have matriculated immediately following completion of this undergraduate program include the University of Washington–Seattle, Harvard, UNC–Chapel Hill, Johns Hopkins University, and the University of Michigan.
Previous graduates who chose to seek employment have taken positions in the pharmaceutical industry, contract research organizations (CROs), and medical settings. Examples of employers of recent graduates include Duke Clinical Research Institute, Blue Cross Blue Shield, IQVIA, Rho, PPD, National Institutes of Environmental Health Science, and Research Triangle Institute. Traditionally, career opportunities have been outstanding for skilled biostatisticians.
Professors
Kevin Anstrom, Jianwen Cai, David J. Couper, Michael G. Hudgens, Joseph G. Ibrahim, Anastasia Ivanova, Gary G. Koch, Michael R. Kosorok, Yun Li, Danyu Lin, Feng-Chang Lin, Yufeng Liu, James Stephen Marron, Jane Monaco, Andrew B. Nobel, John S. Preisser, Bahjat Qaqish, Todd A. Schwartz, Richard Smith, Daniela T. Sotres-Alvarez, Xianming Tan, Kinh Truong, Haibo Zhou, Hongtu Zhu, Fei Zou.
Associate Professors
Robert Agans, Jamie L. Crandell, Tanya Garcia, Annie Green Howard, Yuchao Jiang, Quefeng Li, Michael Love, Naim Rashid, Di Wu, Baiming Zou.
Assistant Professors
Ethan Alt, Didong Li, Xihao Li, Yusha Li, Kara McCormack, Bonnie Shook-Sa.
Adjunct Professors
Haoda Fu, Eric Laber, Sean Simpson, Wei Sun, William Valdar, Clarice Weinberg, Donglin Zeng, Richard Zink.
Adjunct Associate Professors
Shanshan Zhao, Xiaojing Zheng.
Adjunct Assistant Professors
Charles Pepe-Ranney, Matthew Psioda, Zhengwu Zhang.
Adjunct Instructors
Jane Eslinger, Kinsey Helton, Marcus Herman-Giddens, Jeff Laux, Vincent Toups.
Professors Emeriti
Shrikant Bangdiwala, Lloyd E. Chambless, Clarence E. Davis, James E. Grizzle, Ronald W. Helms, William D. Kalsbeek, Lawrence L. Kupper, Lisa M. LaVange, Keith E. Muller, Dana E. Quade, Michael J. Symons.
BIOS–Biostatistics
Undergraduate-level
Academic credit for approved internship experience.
Directed readings or laboratory study. May be taken more than once. Two to six laboratory hours a week.
Advanced Undergraduate and Graduate-level
Access to SAS, Excel required. Permission of instructor for nonmajors. Introductory course in probability, data analysis, and statistical inference designed for B.S.P.H. biostatistics students. Topics include sampling, descriptive statistics, probability, confidence intervals, tests of hypotheses, chi-square distribution, 2-way tables, power, sample size, ANOVA, non-parametric tests, correlation, regression, survival analysis.
Required preparation, previous or concurrent course in applied statistics. Permission of instructor for nonmajors. Introduction to use of computers to process and analyze data, concepts and techniques of research data management, and use of statistical programming packages and interpretation. Focus is on use of SAS for data management and reporting.
Students will gain proficiency with R, data wrangling, data quality control and cleaning, data visualization, exploratory data analysis, with an overall emphasis on the principles of good data science, particularly reproducible research. The course will also develop familiarity with several software tools for data science best practices, such as Git, Docker, Jupyter, Make, and Nextflow.
Arrangements to be made with the faculty in each case. A course for students of public health who wish to make a study of some special problem in the statistics of the life sciences and public health. Honors version available.
Required preparation, knowledge of basic descriptive statistics. Major topics include elementary probability theory, probability distributions, estimation, tests of hypotheses, chi-squared procedures, regression, and correlation.
Topics will include gaining proficiency with R and Python, data wrangling, data quality control and cleaning, data visualization, exploratory data analysis, and introductory applied optimization, with an overall emphasis on the principles of good data science, particularly reproducible research. Some emphasis will be given to large data settings such as genomics or claims data. The course will also develop familiarity with software tools for data science best practices, such as Git, Docker, Jupyter, and Nextflow.
This course will be an introductory course to machine learning. The goal is to equip students with knowledge of existing tools for data analysis and to prepare students for more advanced courses in machine learning. Students in the SPH Master of Public Health with a Public Health Data Science concentration receive priority for enrollment.
Course is designed to meet the needs of health care professionals to appraise the design and analysis of medical and health care studies and who intend to pursue academic research careers. Covers basics of statistical inference, analysis of variance, multiple regression, categorical data analysis. Previously offered as PUBH 741. Permission of instructor.
Continuation of BIOS 641. Main emphasis is on logistic regression; other topics include exploratory data analysis and survival analysis. Previously offered as PUBH 742.
Required preparation, basic familiarity with statistical software (preferably SAS able to do multiple linear regression) and introductory biostatistics, such as BIOS 600. Continuation of BIOS 600. Analysis of experimental and observational data, including multiple regression and analysis of variance and covariance. Previously offered as BIOS 545. Permission of the instructor for nonmajors.
Required preparation, two semesters of calculus (such as MATH 231, 232). Fundamentals of probability; discrete and continuous distributions; functions of random variables; descriptive statistics; fundamentals of statistical inference, including estimation and hypothesis testing.
Required preparation, three semesters of calculus (such as MATH 231, 232, 233). Introduction to probability; discrete and continuous random variables; expectation theory; bivariate and multivariate distribution theory; regression and correlation; linear functions of random variables; theory of sampling; introduction to estimation and hypothesis testing.
Distribution of functions of random variables; Helmert transformation theory; central limit theorem and other asymptotic theory; estimation theory; maximum likelihood methods; hypothesis testing; power; Neyman-Pearson Theorem, likelihood ratio, score, and Wald tests; noncentral distributions.
Principles of study design, descriptive statistics, sampling from finite and infinite populations, inferences about location and scale. Both distribution-free and parametric approaches are considered. Gaussian, binomial, and Poisson models, one-way and two-way contingency tables.
Required preparation, BIOS 662. Matrix-based treatment of regression, one-way and two-way ANOVA, and ANCOVA, emphasizing the general linear model and hypothesis, as well as diagnostics and model building. Reviews matrix algebra. Includes statistical power for linear models and binary response regression methods.
Fundamental principles and methods of sampling populations, with emphasis on simple, random, stratified, and cluster sampling. Sample weights, nonsampling error, and analysis of data from complex designs are covered. Practical experience through participation in the design, execution, and analysis of a sampling project.
Introduction to the analysis of categorized data: rates, ratios, and proportions; relative risk and odds ratio; Cochran-Mantel-Haenszel procedure; survivorship and life table methods; linear models for categorical data. Applications in demography, epidemiology, and medicine.
Matrix-based longitudinal data analysis emphasizing applications and interpretation. Linear and generalized linear, marginal and mixed regression models. Fixed effects and random effects. Maximum likelihood, REML, GEE. Regression diagnostics. Sample size. Simulation of longitudinal data.
Statistical concepts in basic public health study designs: cross-sectional, case-control, prospective, and experimental (including clinical trials). Validity, measurement of response, sample size determination, matching and random allocation methods.
Provides a foundation and training for working with data from clinical trials or research studies. Topics: issues in study design, collecting quality data, using SAS and SQL to transform data, typical reports, data closure and export, and working with big data.
Source and interpretation of demographic data; rates and ratios, standardization, complete and abridged life tables; estimation and projection of fertility, mortality, migration, and population composition.
Selected topics in calculus, real analysis including Taylor's series, Riemann, Stieltjes and Lebesgue integration, and complex variables. Introduction to measure theory.
This course introduces intermediate concepts and theories in statistical inferences, including multivariate transformation, convergence of random vectors, sufficient and complete statistics, methods of estimation, and advanced problems such as information inequality, unbiased estimators, Bayes estimators, asymptotically efficient estimation, nonparametric estimation, and simultaneous confidence intervals.
Introduction to concepts and techniques used in the analysis of time to event data, including censoring, hazard rates, estimation of survival curves, regression techniques, applications to clinical trials.
Field/topical/research seminar. Instructors use this course to offer instruction in particular topics or approaches.
Field visits to, and evaluation of, major nonacademic biostatistical programs in the Research Triangle area. Field fee: $25.
Directed research. Written and oral reports required.
Directed research. Written and oral reports required.
Department of Biostatistics
3103 McGavran-Greenberg Hall, CB# 7420
(919) 966-7256
Chair
Michael G. Hudgens