Department of Statistics and Operations Research
Introduction
The major in statistics and analytics (STAN) is an excellent program for students interested in statistical data science, operations research, and actuarial science, as well as in fields such as business, economics, public policy and health, psychology, and biomedicine, where the decision and statistical sciences play an increasingly important role.
Particular areas in which graduates can obtain employment or continue with graduate study include:
Statistics
Students in this area study the mathematical theories of probability and statistics and their application to mathematical models that contain an element of uncertainty or randomness. Opportunities for employment are manifold in businesses and government agencies, and include a broad range of areas from the natural sciences, social sciences, and technology. Concrete examples include pharmacology, genomics, medicine, environmental sciences, social network analysis, and information technology.
Operations Research
In this area, students study mathematical, statistical, and computational techniques related to decision making. Operations research is crucial in business, government, and other management areas where decisions are made by solving large, complex problems (for example, crew scheduling for airlines, and the design of online recommendation systems). In addition to their major courses, students interested in this field are encouraged to take courses in business and economics.
Actuarial Science
Actuaries work primarily in businesses that involve financial risk, including the insurance industry. Students interested in this field take advanced courses in statistics, stochastic processes, and the mathematical theory of risk.
Advising
All majors and minors have a primary academic advisor in Steele Building. Students are strongly encouraged to meet regularly with their advisor and review their Tar Heel Tracker each semester. STAN majors and minors are also encouraged to meet with departmental advisors to discuss course planning before registration each semester. The director of undergraduate studies works with prospective majors and minors by appointment. Additional information on courses, undergraduate research opportunities, the honors program, careers, and graduate schools may be obtained from the department’s website or by contacting the director of undergraduate studies.
Courses for Students from Other Departments
The Department of Statistics and Operations Research offers a variety of courses of potential value to students majoring in other disciplines. Introductory courses include STOR 113 and STOR 215, which are foundation courses in decision models, and the basic statistical courses, STOR 120 and STOR 155. At the intermediate level, STOR 305 provides an introduction to business decision models, while STOR 320 is an introductory course to data science. Substantial coverage of applied statistical methods is provided in STOR 455 and STOR 556. At more advanced mathematical levels, an introduction to probability theory is provided by STOR 435 (or STOR 535), an introduction to proof techniques and discrete mathematics is given in STOR 315, and the basic theory of statistical inference is given by STOR 555. More advanced deterministic and stochastic models of operations research are provided in STOR 415 and STOR 445. Machine learning is covered in STOR 565 and STOR 566.
Graduate School and Career Opportunities
Regardless of the electives chosen, the statistics and analytics degree program provides excellent preparation for graduate study. Graduates with concentrations in operations research or statistics often continue work at the graduate level in those fields or related areas such as industrial engineering, biostatistics, and environmental science, or enter business school to pursue a master’s in business administration (M.B.A.) degree.
A five-year B.S.–M.S. degree program in statistics, operations research and data science is also an option. Interested students should consult the director of graduate studies for the operations research program.
Graduates of the statistics and analytics program will find numerous opportunities for well-paid, challenging jobs.
Professors
Nilay Argon, Shankar Bhamidi, Amarjit Budhiraja, Jan Hannig, Vidyadhar G. Kulkarni, Yufeng Liu, James Stephen Marron, Andrew Nobel, Mariana Olvera-Cravioto, Gabor Pataki, Vladas Pipiras, Richard L. Smith, Serhan Ziya.
Associate Professors
Sayan Banerjee, Nicolas Fraiman, Chuanshu Ji, Quoc Tran-Dinh, Kai Zhang.
Assistant Professors
Guanting Chen, Xiangying Huang, Daniel Kessler, Yao Li, Patrick Lopatto, Michael O'Neill, Zhengwu Zhang.
Teaching Associate Professor
Jeffrey McLean.
Teaching Assistant Professors
Oluremi Abayomi, Charles Dunn, Mario Giacomazzo, William Lassiter.
Joint Professors
Joseph Ibrahim, Michael Kosorok, Jayashankar Swaminathan.
Professors Emeriti
George S. Fishman, Douglas G. Kelly, J. Scott Provan, David S. Rubin, Gordon D. Simons, Shaler Stidham Jr., Jon W. Tolle.
STOR–Statistics and Operations Research
Undergraduate-level Courses
In this course, we will investigate the structure of these decision problems, show how they can be solved (at least in principle), and solve some simple problems.
Networks, mathematical structures that are composed of nodes and a set of lines joining the nodes, are used to model a wide variety of familiar systems.
This seminar aims to show that contrary to common belief, statistics can be exciting and fun. The seminar will consist of three modules: statistics in our lives, randomness, and principles of statistical reasoning.
The aim of this class is to study the role of uncertainty in our daily lives, to explore the cognitive biases that impair us, and to understand how one uses quantitative models to make decisions under uncertainty in a wide array of fields including medicine, law, finance, and the sciences.
This seminar will use recently assembled historical material to tell the exciting story of the origins and development of operations research during and after World War II.
We will study some basic statistical decision-making procedures and the errors and losses they lead to. We will analyze the effects of randomness on decision making using computer experimentation and physical experiments with real random mechanisms like dice, cards, and so on.
Studies the Environmental Protection Agency's Criteria Document, mandated by the Clean Air Act; this document reviews current scientific evidence concerning airborne particulate matter. Students learn some of the statistical methods used to assess the connections between air pollution and mortality, and prepare reports on studies covered in the Criteria Document.
The theory of probability, which can be used to model the uncertainty and chance that exist in the real world, often leads to surprising conclusions and seeming paradoxes. We survey and study these, along with other paradoxes and puzzling situations arising in logic, mathematics, and human behavior.
This course is designed to emphasize the motivation, philosophy, and cultivation of statistical reasoning in the interdisciplinary areas of statistical science and bioinformatics.
Introduces basic concepts in finance and economics, useful tools for collecting and summarizing financial data, and simple probability models for quantification of market uncertainty.
This seminar looks at a variety of ways in which modern computational tools allow easy and informative viewing of data. Students will also study the kinds of choices that have to be made in data presentation and viewing.
Introduces students to the world of genetics and DNA and to the use of computers to organize and understand the complex systems associated with the structure and dynamics of DNA and heredity.
Special Topics Course. Contents will vary each semester.
An introduction to multivariable quantitative models in economics. Mathematical techniques for formulating and solving optimization and equilibrium problems will be developed, including elementary models under uncertainty.
Students will use mathematical and statistical methods to address societal problems, make personal decisions, and reason critically about the world. Authentic contexts may include voting, health and risk, digital humanities, finance, and human behavior. This course does not count as credit towards the psychology or neuroscience majors.
The course teaches critical concepts and skills in computer programming and statistical inference, in conjunction with hands-on analysis of real-world datasets, including economic data, document collections, geographical data, and social networks. It delves into social issues surrounding data analysis such as privacy and design.
Elementary introduction to statistical reasoning, including sampling, elementary probability, statistical inference, and data analysis. STOR 151 may not be taken for credit by students who have credit for ECON 400 or PSYC 210.
Data analysis; correlation and regression; sampling and experimental design; basic probability (random variables, expected values, normal and binomial distributions); hypothesis testing and confidence intervals for means, proportions, and regression parameters; use of spreadsheet software.
Examines selected topics from statistics and operations research. Course description is available from the department office.
Introduction to basic concepts and techniques of discrete mathematics with applications to business and social and physical sciences. Topics include logic, sets, functions, combinatorics, discrete probability, graphs, and networks.
This course introduces students to some of the key mathematical tools underlying algorithmic data science. The primary focus of the course is matrix algebra and multivariable calculus. The mathematical topics covered in the course will be motivated and connected by concrete applications in data science, with an emphasis on machine learning and optimization.
Examines selected topics from statistics and operations research. Course description is available from the department office.
Experience includes preparations, demonstrations, assistance, and attendance at weekly meetings and lab sections. This course will enable you to deepen your understanding of topics in statistics and data science by learning the material with an eye toward explaining it to other less-experienced students, and; develop pedagogical skills, such as developing a rapport with learners, engaging in clear oral and written communication, and taking the perspective of less experienced students.
The use of mathematics to describe and analyze large-scale decision problems. Situations involving the allocation of resources, making decisions in a competitive environment, and dealing with uncertainty are modeled and solved using suitable software packages. Students cannot enroll in STOR 305 if they have already taken STOR 415.
The course gives a solid introduction to rigorous mathematical thinking and problem solving, all of which are fundamental in data science. It covers proofs, mathematical induction, counting, and the basics of graph theory.
Development of basic skill set for data analysis from obtaining data to data carpentry, exploration, modeling, and communication. Topics covered include regression, clustering, classification, algorithmic thinking, and non-standard data objects (networks and text data). Students may not receive credit for both STOR 320 and STOR 520.
Fundamental principles and methods of sampling populations, with emphasis on simple, random, stratified, and cluster sampling. Sample weights, nonsampling error, and analysis of data from complex designs are covered. Practical experience through participation in the design, execution, and analysis of a sampling project.
Examines selected topics from statistics and operations research. Course description is available from the department office.
Advanced Undergraduate and Graduate-level Courses
Linear, integer, nonlinear, and dynamic programming, classical optimization problems, network theory.
Introduction to mathematical theory of probability covering random variables; moments; binomial, Poisson, normal and related distributions; generating functions; sums and sequences of random variables; and statistical applications. Students may not receive credit for both STOR 435 and STOR 535.
Introduction to Markov chains, Poisson process, continuous-time Markov chains, renewal theory. Applications to queueing systems, inventory, and reliability, with emphasis on systems modeling, design, and control.
Review of basic inference; two-sample comparisons; correlation; introduction to matrices; simple and multiple regression (including significance tests, diagnostics, variable selection); analysis of variance; use of statistical software.
Probability models for long-term insurance and pension systems that involve future contingent payments and failure-time random variables. Introduction to survival distributions and measures of interest and annuities-certain.
Short term probability models for potential losses and their applications to both traditional insurance systems and conventional business decisions. Introduction to stochastic process models of solvency requirements.
This course will introduce students to the healthcare industry and provide hands-on experience with key actuarial and analytical concepts that apply across the actuarial field. Using real world situations, the course will focus on how mathematics and the principles of risk management are used to help insurance companies and employers make better decisions regarding employee benefit insurance products and programs.
Examines selected topics from statistics and operations research. Course description is available from the department office.
Requires permission of the department. Statistics and analytics majors only. An opportunity to obtain credit for an internship related to statistics, operations research, or actuarial science. Pass/Fail only. Does not count toward the statistics and analytics major or minor.
Permission of the director of undergraduate studies. This course is intended mainly for students working on honors projects. May be repeated for credit.
This is an upper-level course focusing on optimization aspects of common and practical problems and topics in statistical learning, machine learning, neural networks, and modern AI. It covers several topics such as optimization perspective of linear regression, nonlinear regression, matrix factorization, stochastic gradient descent, regularization techniques, neural networks, deep learning techniques, and minimax models.
An introduction to algorithms and modeling techniques that use knowledge gained from prior experience to make intelligent decisions in real time. Topics include Markov decision processes, dynamic programming, multiplicative weights update, exploration vs. exploitation, multi-armed bandits, and two player games.
This course provides hands-on experience working with data sets provided in class and downloaded from certain public websites. Lectures cover basic topics such as R programming, visualization, data wrangling and cleaning, exploratory data analysis, web scraping, data merging, predictive modeling, and elements of machine learning. Programming analyses in more advanced areas of data science. Students may not receive credit for both STOR 320 and STOR 520.
This course is an advanced undergraduate course in probability with the aim to give students the technical and computational tools for advanced courses in data analysis and machine learning. It covers random variables, moments, binomial, Poisson, normal and related distributions, generating functions, sums and sequences of random variables, statistical applications, Markov chains, multivariate normal and prediction analytics. Students may not receive credit for both STOR 435 and STOR 535.
This course will survey the history of sports analytics across multiple areas and challenge students in team-based projects to practice sports analytics. Students will learn how applied statistics and mathematics help decision makers gain competitive advantages for on-field performance and off-field business decisions.
Functions of random samples and their probability distributions, introductory theory of point and interval estimation and hypothesis testing, elementary decision theory.
This course covers the fundamental theory and methods for time series data, as well as related statistical software and real-world data applications. Topics include the autocorrelation function, estimation and elimination of trend and seasonality, estimation and forecasting procedures in ARMA models and nonstationary time series models.
The course covers advanced data analysis methods beyond those in STOR 455 and how to apply them in a modern computer package, specifically R or R-Studio which are the primary statistical packages for this kind of analysis. Specific topics include (a) Generalized Linear Models; (b) Random Effects; (c) Bayesian Statistics; (d) Nonparametric Methods (kernels, splines and related techniques).
Introduction to theory and methods of machine learning including classification; Bayes risk/rule, linear discriminant analysis, logistic regression, nearest neighbors, and support vector machines; clustering algorithms; overfitting, estimation error, cross validation.
Deep neural networks (DNNs) have been widely used for tackling numerous machine learning problems that were once believed to be challenging. With their remarkable ability of fitting training data, DNNs have achieved revolutionary successes in many fields such as computer vision, natural language progressing, and robotics. This is an introduction course to deep learning.
This upper-level-undergraduate and beginning-graduate-level course introduces the concepts of modeling, programming, and statistical analysis as they arise in stochastic computer simulations. Topics include modeling static and discrete-event simulations of stochastic systems, random number generation, random variate generation, simulation programming, and statistical analysis of simulation input and output.
Examines selected topics from statistics and operations research. Course description is available from the department office.
STOR 612 consists of three major parts: linear programming, quadratic programming, and unconstrained optimization. Topics: Modeling, theory and algorithms for linear programming; modeling, theory and algorithms for quadratic programming; convex sets and functions; first-order and second-order methods such as stochastic gradient methods, accelerated gradient methods and quasi-Newton methods for unconstrained optimization.
STOR 614 consists of three major parts: Integer programming, conic programming, and nonlinear optimization. Topics: modeling, theory and algorithms for integer programming; second-order cone and semidefinite programming; theory and algorithms for constrained optimization; dynamic programming; networks.
Required preparation, advanced calculus. Lebesgue and abstract measure and integration, convergence theorems, differentiation. Radon-Nikodym theorem, product measures. Fubini theorems. Lp spaces.
Foundations of probability. Basic classical theorems. Modes of probabilistic convergence. Central limit problem. Generating functions, characteristic functions. Conditional probability and expectation.
The aim of this 3-credit graduate course is to introduce stochastic modeling that is commonly used in various fields such as operations research, data science, engineering, business, and life sciences. Although it is the first course in a sequence of three courses, it can also serve as a standalone introductory course in stochastic modeling and analysis. The course covers the following topics: discrete-time Markov chains, Poisson processes, and continuous-time Markov chains.
This 3-credit course is the second graduate-level course on stochastic modeling that expands upon the material taught in STOR 641. The course covers the following topics: renewal and regenerative processes, queueing models, and Markov decision processes.
Required preparation, two semesters of advanced calculus. Probability spaces. Random variables, distributions, expectation. Conditioning. Generating functions. Limit theorems: LLN, CLT, Slutsky, delta-method, big-O in probability. Inequalities. Distribution theory: normal, chi-squared, beta, gamma, Cauchy, other multivariate distributions. Distribution theory for linear models.
Point estimation. Hypothesis testing and confidence sets. Contingency tables, nonparametric goodness-of-fit. Linear model optimality theory: BLUE, MVU, MLE. Multivariate tests. Introduction to decision theory and Bayesian inference.
Permission of the instructor. Basics of linear models: matrix formulation, least squares, tests. Computing environments: SAS, MATLAB, S+. Visualization: histograms, scatterplots, smoothing, QQ plots. Transformations: log, Box-Cox, etc. Diagnostics and model selection.
ANOVA (including nested and crossed models, multiple comparisons). GLM basics: exponential families, link functions, likelihood, quasi-likelihood, conditional likelihood. Numerical analysis: numerical linear algebra, optimization; GLM diagnostics. Simulation: transformation, rejection, Gibbs sampler.
Introduces students to modeling, programming, and statistical analysis applicable to computer simulations. Emphasizes statistical analysis of simulation output for decision-making. Focuses on discrete-event simulations and discusses other simulation methodologies such as Monte Carlo and agent-based simulations. Students model, program, and run simulations using specialized software. Familiarity with computer programming recommended.
The purpose of this course is to provide a strong foundation in computational skills needed for reproducible research in data science and statistics. Topics will include computational tools and programming skills to facilitate reproducibility, as well as procedures and methods for reproducible conclusions.
Examines selected topics from statistics and operations research. Course description is available from the department office.
Permission of the department. Majors only. Individual reading, study, or project supervised by a faculty member.
Permission of the department. Majors only. Individual reading, study, or project supervised by a faculty member.
This course is designed to give Statistics & Analytics (STAN) majors an opportunity to integrate and apply the knowledge and skills acquired throughout the STAN degree. At the beginning of the semester, the instructor will present to the class a broad description of several problems originating from external industry partners, and covering a wide range of statistics, modeling, optimization, and data science topics. Students will work on these problems throughout the remainder of the semester.
Department of Statistics and Operations Research
318 Hanes Hall, CB# 3260
(919) 843-6024
Chair
Jan Hannig