Notice: For the latest information and guidance on Purdue's response to COVID-19 please visit:

Introduction to Data Science

Data Science


Milind Kulkarni, Associate Professor in the School of Electrical and Computer Engineering

Course Outcomes

A student who successfully fulfills the course requirements will have demonstrated the abilities to write data analyses in Python, to build statistical models and use them for prediction, and to design analyses/models to solve engineering problems.

These outcomes are extremely high level. In more detail, after taking this course you will be able to explain data analysis and modeling algorithms like sampling, estimation, and regression; to write basic data analyses in Python, taking advantage of language features such as higher-order functions (map/reduce) and complex data structures (including NumPy arrays); and to use these tools to propose, design, and implement a set of data analyses to solve engineering problems, then visualize and present the results.

Course Topics include Python Basics (loops, functions, arrays, lists); histograms; higher-order functions, closures, map/reduce; distribution; N-grams; Estimation: sampling, mean, variance, confidence intervals, significance tests; Numpy arrays and matrices; linear regression and prediction; dealing with missing data; classification; clustering; and neural networks.

CEUs: 1.5

Course Opens: Monday, January 13, 2020

Last Day to Register: Friday, January 17, 2020

Course Closes: Monday, February 17, 2020

For further information and to register, please visit Purdue Online's course catalog.

Questions? Email