I research and design hardware architectures and software systems that improve
performance, energy-efficiency and programmer productivity. Most of my work has focused on
architectural changes to general-purpose hardware accelerators like GPUs.
In 2016 I started as an assistant
professor of Computer Engineering at Purdue University,
where I lead the AALP research group.
Academically, I am a computer architect. I study the design of the hardware components that efficiently solve problems written in software. My research has been recognized with an NSF Career Award in 2020, an NVIDIA graduate fellowship in 2013, best paper nominations at MICRO 2012, PPoPP 2017 and ISPASS 2021, an IEEE Micro Top Picks in Computer Architecture in 2013, as a Communications of the ACM Research Highlight in 2014, and my PhD thesis was nominated for the Governor General of Canada's Gold Medal and the 2016 ACM Doctoral Dissertation Award. My teaching has been recognized with multiple Outstanding Engineering Teacher citations, the 2018 Ruth and Joel Spira Award for Excellence in Teaching, the 2020 Hesselberth Award for Teaching Excellence, and the 2022 College of Engineering Excellence in Early Career Teaching Award. I completed my PhD in Computer Architecture at the University of British Columbia in 2015. During the course of my PhD, I interned for the research divisions of both AMD and NVIDIA, where I worked on the design of future GPU computing microarchitectures. Prior to entering graduate school, I worked as a software engineer at Electronic Arts where I gained insight into how industrial software is really made.
I am actively seeking motivated, resourceful, dedicated PhD students who are interested in computer architecture and software systems to start both immediately and in the fall of 2020. Choosing a PhD advisor is probably one of the most important things you do in the first few years of graduate school. I think it is very important to choose one who not only does research in a field you find interesting, but also that you work well with. Graduate school is a lot of work, but given the right mix of motivation, creativity and diligence, it can be very rewarding and a lot of fun.
HPCA |
Aaron Barnes, Fangjia Shen, and Timothy G. Rogers. Mitigating GPU Core Partitioning Performance Effects. In proceedings of the 29th IEEE International Symposium on High-Performance Computer Architecture (HPCA 2023). Acceptance rate: (91/364) = 25%. |
MICRO |
Mahmoud Khairy, Ahmad Alawneh, Aaron Barnes, and Timothy G. Rogers. SIMR: Single Instruction Multiple Request Processing for Energy-Efficient Data Center Microservices. In proceedings of the 55th IEEE/ACM International Symposium on Microarchitecture (MICRO). Acceptance rate: (86/348) = 22%. |
ISPASS |
Ahmad Alawneh, Mahmoud Khairy and Timothy G Rogers. A SIMT Analyzer for Multi-Threaded CPU Applications. Poster in The 2022 IEEE International Symposium on Performance Analysis of Systems and Software, Singapore, May 2022 |
MICRO |
Cesar Avalos, Mahmoud Khairy, Roland N. Green, Mathias Payer, Timothy G. Rogers. Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads. In proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO). Acceptance rate: (94/430) = 21.9%. |
MICRO |
Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Amogh Manjunath, Junrui Pan, Timothy G. Rogers, Tor Aamodt, Nikos Hardavellas. AccelWattch: A Power Modeling Framework for Modern GPUs. In proceedings of the 54th IEEE/ACM International Symposium on Microarchitecture (MICRO). Acceptance rate: (94/430) = 21.9%. |
ISPASS |
Mengchi Zhang, Ahmad Alawneh, Timothy G Rogers. Characterizing Massively Parallel Polymorphism. In the proceedings of the 2021 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS 2021) . Acceptance rate: (24/65) = 36.9%. (Best Paper Nominee) |
ASPLOS |
Mengchi Zhang, Ahmad Alawneh, Timothy G Rogers. Judging a Type by its Pointer: Optimizing Virtual Function Calls on GPUs. In the 26th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS-2021). Acceptance rate: (75/398) = 18.8%. |
HPCA |
Tsung Tai Yeh, Matthew D. Sinclair, Brad Beckmann, Timothy G Rogers. Deadline-Aware Offloading for High-Throughput Accelerators. In the 27th IEEE International Symposium on High-Performance Computer Architecture (HPCA-2021). Acceptance rate: (63/258) = 24.4%. |
MICRO |
Mahmoud Khairy, Vadim Nikiforov, David Nellans, Timothy G Rogers. Locality-Centric Data and Threadblock Management for Massive GPUs. In proceedings of the 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO). Acceptance rate: (82/424) = 19.3%. |
MICRO |
Yuan Hsi Chou, Christopher Ng, Shaylin Cattel, Jeremy Intan, Mattew Sinclair, Joseph Devietti, Timothy G Rogers, Tor M. Aamodt. Deterministic Atomic Buffering. In proceedings of the 53rd IEEE/ACM International Symposium on Microarchitecture (MICRO). Acceptance rate: (82/424) = 19.3%. |
ISCA |
Mahmoud Khairy, Zhesheng Shen, Tor M. Aamodt, Timothy G Rogers. Accel-Sim: An Extensible Simulation Framework for Validated GPU Modeling. In proceedings of the 47th IEEE/ACM International Symposium on Computer Architecture (ISCA). Acceptance rate: (77/421) = 18.3%. |
ASPLOS |
Tsung Tai Yeh, Roland N Green, Timothy G Rogers. Dimensionality-Aware Redundant SIMT Instruction Elimination. In proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS). Acceptance rate: (86/476) = 18.1%. |
PACT |
Mengchi Zhang, Roland N Green, Timothy G Rogers. POSTER: Quantifying the Direct Overhead of Virtual Function Calls on Massively Parallel Architectures Proceedings of the IEEE 28th International Conference on Parallel Architectures and Compilation Techniques (PACT) |
TOPC |
Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, and Timothy G. Rogers, Pagoda: A GPU Runtime System For Narrow Tasks. In the ACM Transactions on Parallel Computing (Invited publication) |
ISPASS |
Mahmoud Khairy, Akshay Jain, Tor Aamodt and Timothy G. Rogers, A Detailed Model for Contemporary GPU Memory Systems. Poster in the 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2019) |
ISPASS |
Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew Sinclair, Timothy G. Rogers and Tor Aamodt, Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. Poster in the 2019 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2019). Full Arxiv Paper. |
Synthesis Lectures on Computer Architecture |
Tor M. Aamodt, Wilson Wai Lun, Timothy G. Rogers, General-Purpose Graphics Processor Architectures. Morgan & Claypool Publishers. May 2018. |
SIGMETRICS |
Akshay Jain, Mahmoud Khairy, Timothy G. Rogers, A Quantitative Evaluation of Contemporary GPU Simulation Methodology. To appear in the 2018 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Science (SIGMETRICS-2018), Irvine, California (acceptance rate 18.5%). |
ISPASS |
Mengchi Zhang, Roland Green, Timothy G. Rogers, Characterizing the Runtime Effects of Object-Oriented Workloads on GPUs. Poster in the 2018 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS-2018) |
HPCA |
Anthony Gutierrez, Bradford M. Beckmann, Alexandru Dutu, Joseph Gross, John Kalamatianos, Onur Kayiran, Michael LeBeane, Matthew Poremba, Brandon Potter, Sooraj Puthoor, Matthew D. Sinclair, Mark Wyse, Jieming Yin, Xianwei Zhang, Akshay Jain , Timothy G. Rogers, Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level. In the 24th IEEE International Symposium on High-Performance Computer Architecture (HPCA-2018). |
PPoPP |
Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers, Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. To appear in proceedings of 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP-2017), Austin, Texas. (acceptance rate 29/132 = 22.0%) Best Paper Nominee |
PACT |
POSTER: Pagoda: A Runtime System to Maximize GPU Utilization in Data Parallel Tasks with Limited Parallelism
Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Timothy G. Rogers PACT '16 Proceedings of the 2016 International Conference on Parallel Architectures and Compilation, 2016 |
ISCA |
A Variable Warp Size Architecture
Timothy G. Rogers, Daniel R. Johnson, Mike O'Connor, Stephen W. Keckler ISCA '15 Proceedings of the 42nd Annual International Symposium on Computer Architecture, 2015. (acceptance rate: 58/305 ≈ 19.0%) |
CACM |
MICRO |
Divergence-aware warp scheduling
Timothy G. Rogers, Mike O'Connor, Tor M. Aamodt MICRO-46 Proceedings of the 46th Annual IEEE/ACM International Symposium on Microarchitecture, 2013. (acceptance rate: 39/239 ≈ 16.3%) |
Top Picks |
Timothy G. Rogers, Mike O'Connor, Tor M. Aamodt, Cache-Conscious Thread Scheduling for Massively Multithreaded Processors, IEEE Micro, Special Issue: Micro's Top Picks from 2012 Computer Architecture Conferences, 2013. |
MICRO |
Timothy G. Rogers, Mike O'Connor, Tor M. Aamodt, Cache-Conscious Wavefront Scheduling, In proceedings of the 45th IEEE/ACM International Symposium on Microarchitecture (MICRO-45), pp. 72-83 Vancouver, BC, December 1-5, 2012. (acceptance rate: 40/228 ≈ 17.5%) (slides, poster, simulator code, benchmarks). Best Paper Runner-Up, Selected for IEEE Micro Top Picks |
ISPASS |
Tayler H. Hetherington, Timothy G. Rogers, Lisa Hsu, Mike O'Connor, Tor M. Aamodt, Characterizing and Evaluating a Key-Value Store Application on Heterogeneous CPU-GPU Systems, In Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), New Brunswick, NJ, April 1-3, 2012. code, slides. (acceptance rate: 20/65 ≈ 30.8%) |
Timothy G Rogers, Bradford M Beckmann, James M O'connor, Data Processor and Method of Lane Realignment. US Patent Application No. 14/045114. Applied 2013. |
Timothy G Rogers, Bradford M Beckmann, James M O'connor, High level software execution mask override. US Patent No. 9,317,296. Applied 2012. Granted 2016. |
Timothy G Rogers, Bradford M Beckmann, James M O'connor, Creating SIMD efficient code by transferring register state through common memory. US Patent No. 9,354,892. Applied 2012. Granted 2016. |