400

DS 401 Machine & Statistical Learning

Machine and Statistical Learning is a collection of mathematical and statistical techniques used to detect, classify, and infer patterns in large and/or complex data sets. Examples of machine and statistical learning algorithms are all around us: speech recognition on your phone, text prediction in internet searches, medical school placement algorithms, and the prediction of what you may want to watch next on your video stream. This course gives an overview of many concepts, techniques, and algorithms in modern machine and statistical learning including both supervised and unsupervised learning. Topics include linear regression, classification, cross validation, dimension reduction, nonlinear regression, tree-based methods, support vector machines, principal component analysis, artificial neural networks, and clustering. The course will give students the ideas and intuition behind modern machine and statistical learning methods as well as a more formal understanding of how, why, and when they work. The underlying theme in the course is application of the algorithms to real data sets.

DS 405 Computing for Data Science

This course will explore current techniques in computation for data science, including (but not limited to) parallel and distributed coding algorithms and modern programming languages.

DS 430 Senior Project

This course will present students with a substantial experience in data analysis. Students will investigate and analyze data from a variety of sources, working both as individuals and in project teams. This serves as a capstone experience for the DS major.