Data Science (DATA)

Courses

DATA 1301. INTRODUCTION TO DATA SCIENCE. 3 Hours.

This course provides an introduction to the field of data science with a high level overview of basic concepts, data types, and techniques while introducing data-informed decision making.

DATA 3311. MATHEMATICS FOR DATA SCIENCE. 3 Hours.

This course covers techniques from linear algebra and probability with an emphasis on how they are used in data science. Working with real data sets will be emphasized, along with basics of Matlab or R programming. Prerequisite: MATH 1426.

DATA 3401. PYTHON FOR DATA SCIENCE 1. 4 Hours.

This is the first of a two course sequence offering the foundations of Python programming in the context of data science. It introduces the full syntax of the Python language as it overviews structured, functional, and object oriented programming methodologies. It also provides a basic conceptual understanding of computing and introduces Unix command-line tools, software employed in data science such as git and Jupyter, and Python libraries such as numpy, matplotlib, and Pandas. Prerequisite: MATH 1426 or concurrent enrollment in MATH 1426.

DATA 3402. PYTHON FOR DATA SCIENCE 2. 4 Hours.

This is the second of a two course sequence offering the foundations of Python programming in the context of data science. It reinforces concepts presented in DATA 3401 with greater depth with a focus on application to various problems in data science, while exploring the python library ecosystem. Prerequisite: DATA 3401, or consent of instructor.

DATA 3421. DATA MINING, MANAGEMENT, AND CURATION. 4 Hours.

This lecture and lab course will provide training in working with databases, including data mining techniques and principles and best practices in data management, storage, and curation. Prerequisite: DATA 3401 or consent of instructor.

DATA 3441. STATISTICAL METHODS FOR DATA SCIENCE 1. 4 Hours.

This lecture and lab course will provide an introduction to the fundamental building blocks of advanced data analysis, with emphasis on advanced linear algebra, optimization, statistical inference, and Monte Carlo methods. Working with real data sets will be emphasized, along with basics of R programming. Prerequisite: DATA 3401, DATA 3402 or consent of instructor.

DATA 3442. STATISTICAL METHODS FOR DATA SCIENCE 2. 4 Hours.

This lecture and lab course will provide an introduction to the principles and general methods for the analysis of categorical data. This type of data occurs extensively in both observational and experimental studies, as well as industrial applications. While some theoretical statistical detail is given, the primary focus will be on methods of data analysis. Topics include generalized regression models, logistic regression models, Poisson regression models, and multinomial regression models. Problems will be motivated from a scientific perspective. Prerequisite: DATA 3441.

DATA 3461. MACHINE LEARNING. 4 Hours.

This course introduces and surveys Machine Learning techniques and their application to various problems in data science. Prerequisite: DATA 3401, DATA 3402 or consent of instructor.

DATA 4380. DATA PROBLEMS. 3 Hours.

This course is intended for Junior-level Data Science students, and will enable them to identify, define, and explore a number of potential problems and projects, for follow-up in the capstone course sequence. Prerequisite: Permission of instructor.

DATA 4381. DATA CAPSTONE PROJECT 1. 3 Hours.

This is the first of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required. Prerequisite: DATA 4380.

DATA 4382. DATA CAPSTONE PROJECT 2. 3 Hours.

This is the second of a two-semester sequence that will involve deep engagement in a team or individual project in Data Science. Presentation of written and oral reports will be required. Prerequisite: DATA 4381.