Reinforcement Learning and its applications

Interdisciplinary Areas: Data/Information/Computation, Smart City, Infrastructure, Transportation

Project Description

In this project, we will consider fundamental analysis of reinforcement learning. Further, the concepts of reinforcement learning and bandits will be applied for problems in networking, transportation, and manufacturing. 

Start Date


Postdoc Qualifications

PhD in Electrical Engineering, Computer Science, Statistics, or related areas.  


Vaneet Aggarwal, IE,

Shweta Singh, ABE, 


1. C. Pike-Burke, S. Agrawal, S. Grunewalder, C. Szepesvari, Bandits with Delayed, Aggregated Anonymous Feedback, ICML 2018.

2. S. Agrawal, R. Jia, "Optimistic posterior sampling for reinforcement learning: worst-case regret bounds". NIPS 2017

3. Yu, Zhe, Yunjian Xu, and Lang Tong. "Deadline scheduling as restless bandits." IEEE Transactions on Automatic Control (2018).

4. Maillard, Odalric-Ambrym, and Shie Mannor. "Latent Bandits." International Conference on Machine Learning. 2014.