IS 517 Methods for Data Science

A dramatic increase in computing power has enabled new areas of data science to develop in statistical modeling and analysis. These areas cover predictive and descriptive learning bridge ideas and theory in statistics, computer science and artificial intelligence. We will cover many of these new methods including predictive learning such as estimating models from data to predict future outcomes, notably regression and classification models. Regression topics include linear regression with recent advances to deal with large numbers of variables, smoothing techniques, additive models, and local regression. Classification topics include discriminant analysis, logistic regression, support vector machines, generalized additive models, naive Bayes, mixture models and nearest neighbor methods. Lastly we develop neural networks and deep learning techniques, bridging the theory introduced in the earlier parts of the class to purely empirical methods. We situate these methods in the "data science lifecycle" as part of the larger set of practices in the discovery and communication of scientific findings.

Previously IS590MD.

Recent syllabus

Scheduled Offerings

  • Spring 2020

    • IS590MD - Methods for Data Science
      Wed, 9:00 am - 11:50 am
      Room 131
      Prerequisites are LIS/IS542 or equivalent (e.g. intro probability/stats STAT 100, CS361, or ECON 202) and LIS490IDS/CS398IDS/STAT430IDS or CS101 or equivalent; or consent of the instructor. Linear Algebra recommended at the level of MATH 125 or higher; Calculus recommended at the level of MATH 220.
      Yang Wang
      16 weeks
      Credit hours

Textbooks and Course Materials

Available from the Illinois Union Bookstore (IUB).