Reinforcement Learning and its applications

Interdisciplinary Areas: Data/Information/Computation, Smart City, Infrastructure, Transportation

Project Description

In this project, we will consider fundamental analysis of reinforcement learning. Further, the concepts of reinforcement learning and bandits will be applied for problems in networking, transportation, and manufacturing. 

Start Date

05/01/2019

Postdoc Qualifications

PhD in Electrical Engineering, Computer Science, Statistics, or related areas.  

Co-advisors

Vaneet Aggarwal, IE, https://engineering.purdue.edu/CLANLabs

Shweta Singh, ABE, https://sites.google.com/site/shwetasinghlab/ 

References

1. C. Pike-Burke, S. Agrawal, S. Grunewalder, C. Szepesvari, Bandits with Delayed, Aggregated Anonymous Feedback, ICML 2018.

2. S. Agrawal, R. Jia, "Optimistic posterior sampling for reinforcement learning: worst-case regret bounds". NIPS 2017

3. Yu, Zhe, Yunjian Xu, and Lang Tong. "Deadline scheduling as restless bandits." IEEE Transactions on Automatic Control (2018).

4. Maillard, Odalric-Ambrym, and Shie Mannor. "Latent Bandits." International Conference on Machine Learning. 2014.