Reinforcement Learning: Theory and Algorithms

ECE59500

Credit Hours:

Learning Objective:

A student who successfully fulfills the course requirements will have demonstrated an ability to:

Explain different problem formulations for reinforcement learning
Apply various algorithmic solutions to a wide range of sequential decision-making problems
Analyze the performance capabilities and limitations of different algorithm for sequential decision making
An ability to conduct a research project by collaborating with one or more partners and write a scientific report of the research findings

Description:

This course introduces the foundations and he recent advances of reinforcement learning, an area of machine learning closely tied to optimal control that studies sequential decision-making under uncertainty. This course aims to create a deep understanding of the theoretical and algorithmic foundations of reinforcement learning while discussing the practical considerations and various extensions of reinforcement learning.

Topics Covered:

Week	Lecture Topics
1	Introduction, motivation, overview of relevant background
2	Dynamic programming and policy evaluation
3	Policy iteration and value iteration
4	Monte Carlo and temporal difference methods
5	Computational complexity and statistical limits
6	Linear quadratic regulators (LQR) and optimal control
7	Optimal control for nonlinear systems (Iterative LQR)
8	Prediction, estimation, and Kalman filtering
9	Model-based and model-free reinforcement learning
10	Approximate policy iteration and deep Q-learning
11	Conservative policy iteration and trust region methods
12	Stochastic gradient descent and policy gradient
13	Exploration in reinforcement learning and multi-armed bandits
14	Partially observable Markov decision processes and risk-averse reinforcement learning
15	Inverse reinforcement learning, meta-learning, transfer learning, and multi-agent reinforcement learning

Prerequisites:

Undergraduate understanding of linear algebra, probability, calculus

Web Address:

https://purdue.brightspace.com

Textbooks:

Required:

None

Bandit Algorithms, Lattimore, Tor; Szepesvari, Csaba, Cambridge University Press, 2020
Dynamic Programming and Optimal Control, Bertsekas, Dimitri P., Athena Scientific, 2011
Foundations of Deep Reinforcement Learning, Graesser, Laura; Keng, Wah Loon, Addision-Wesley Professional, 2019
Markov Decision Processes: Discrete Stochastic Dynamic Programming, Puterman, Martin L., John Wiley & Sons, 2014
Neuro-dynamic Programming, Bertsekas, Dimitri P.; Tsisiklis, John N., Athena Scientific, 1996
Reinforcement Learning: An Introduction, Sutton, Richard S.; Barto, Andrew G., MIT Press, 2018
Reinforcement Learning: Theory and Algorithms, Agarwal, Alekh; Jiang, Nan; Kakade, Sham M.; Sun, Wen, 2019

ProEd Minimum Requirements:

view

Instructor(s)

About Us

WHY PURDUE UNIVERSITY

LEARN MORE

OUR TEAM & RESOURCES

Online Programs & Courses

DOCTORAL PROGRAMS

MASTER PROGRAMS

GRADUATE CERTIFICATIONS

COURSES

Courses

PROFESSIONAL CERTIFICATIONS

PROFESSIONAL COURSES

What’s Next for Prospective Students

APPLY

DEGREE PROGRRAM APPLICANTS

GRADUATE CERTIFICATE & NON-DEGREE ADMISSIONS INFORMATION

Student Resources

QUICK RESOURCES

YOUR COURSES

EXAMS

Reinforcement Learning: Theory and Algorithms

ECE59500

Credit Hours:

Learning Objective:

Description:

Topics Covered:

Prerequisites:

Web Address:

Required:

Recommended:

ProEd Minimum Requirements:

Instructor(s)

Mahsa Ghasemi

Discover

Explore

Connect

People