Publications | Tim Rogers

Fangjia Shen, Aaron Barnes, Anusuya Nallathambi, Tim Rogers (2025). RayFlex: An Open-Source RTL Implementation of the Hardware Ray Tracer Datapath. In ISPASS 2025.

Ahmad Alawneh, Ni Kang, Mahmoud Khairy, Tim Rogers (2024). ThreadFuser: A SIMT Analysis Framework for MIMD Programs. In MICRO 2024.

PDF

Aaron Barnes, Fangjia Shen, Tim Rogers (2024). Extending GPU Ray-Tracing Units for Hierarchical Search Acceleration. In MICRO 2024.

PDF

Ni Kang, Ahmad Alawneh, Mengchi Zhang, Tim Rogers (2024). Concurrency-Aware Register Stacks for Efficient GPU Function Calls. In MICRO 2024.

PDF

Christin Bose, Cesar Avalos, Junrui Pan, Mahmoud Khairy, Tim Rogers (2024). MAccel-sim: A multi-gpu simulator for architectural exploration. In IISWC 2024 (poster).

PDF

Junrui Pan, Tim Rogers (2024). CRISP: Concurrent Rendering and Compute Simulation Platform for GPUs. In IISWC 2024. Best Paper Nominee.

PDF

Mahmoud Khairy, Zhesheng Shen, Tor M. Aamodt, Tim Rogers (2023). RETROSPECTIVE: Accel-sim: An Extensible Simulation Framework for Validated GPU Modeling. In ISCA@50 25-year Retrospective 1996-2020.

PDF Cite Code Project Press

Aaron Barnes, Fangjia Shen, Tim Rogers (2023). Mitigating GPU Core Partitioning Performance Effects. In HPCA 2023.

PDF DOI

Mahmoud Khairy, Ahmad Alawneh, Aaron Barnes, Tim Rogers (2022). SIMR: Single Instruction Multiple Request Processing for Energy-Efficient Data Center Microservices. In MICRO 2022.

PDF DOI

Ahmad Alawneh, Mahmoud Khairy, Tim Rogers (2022). A SIMT Analyzer for Multi-Threaded CPU Applications. In ISPASS 2022.

PDF DOI

Cesar Avalos, Mahmoud Khairy, Roland Green, Mathias Payer, Tim Rogers (2021). Principal Kernel Analysis: A Tractable Methodology to Simulate Scaled GPU Workloads. In MICRO 2021.

PDF DOI

Vijay Kandiah, Scott Peverelle, Mahmoud Khairy, Amogh Manjunath, Junrui Pan, Tim Rogers, Tor M. Aamodt, Nikos Hardavellas (2021). AccelWattch: A Power Modeling Framework for Modern GPUs. In MICRO 2021.

PDF DOI

Mengchi Zhang, Ahmad Alawneh, Tim Rogers (2021). Judging a type by its pointer: optimizing GPU virtual functions. In ASPLOS 2021.

PDF DOI

Mengchi Zhang, Ahmad Alawneh, Tim Rogers (2021). Characterizing Massively Parallel Polymorphism. In ISPASS 2021. Best Paper Nominee.

PDF DOI

Tsung Tai Yeh, Matthew D. Sinclair, Bradford M. Beckman, Tim Rogers (2021). Deadline-Aware Offloading for High-Throughput Accelerators . In HPCA 2021.

PDF DOI

Mahmoud Khairy, Dima Nikiforov, David Nellans, Tim Rogers (2020). Locality-Centric Data and Threadblock Management for Massive GPUs. In MICRO 2020.

PDF DOI

Yuan Hsi Chou, Christopher Ng, Shaylin Cattel, Jeremy Intan, Mattew D. Sinclair, Joseph Devietti, Tim Rogers, Tor M. Aamodt (2020). Deterministic Atomic Buffering. In MICRO 2020.

PDF DOI

Mahmoud Khairy, Zhesheng Shen, Tor M. Aamodt, Tim Rogers (2020). Accel-Sim: An Extensible Simulation Framework for Validated GPU Modeling. In ISCA 2020.

PDF DOI

Tsung Tai Yeh, Roland N. Green, Tim Rogers (2020). Dimensionality-Aware Redundant SIMT Instruction Elimination. In ASPLOS 2020.

PDF DOI

Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagool, Rudolf Eigenmann, Tim Rogers (2019). Pagoda: A GPURuntime System for Narrow Tasks. In TOPC 2021. Invited Paper.

PDF DOI

Mengchi Zhang, Roland N. Green, Tim Rogers (2019). POSTER: Quantifying the Direct Overhead of Virtual Function Calls on Massively Parallel Architectures. In PACT 2019.

PDF DOI

Jonathan Lew, Deval Shah, Suchita Pati, Shaylin Cattell, Mengchi Zhang, Amruth Sandhupatla, Christopher Ng, Negar Goli, Matthew D. Sinclair, Tim Rogers, Tor M. Aamodt (2019). Analyzing Machine Learning Workloads Using a Detailed GPU Simulator. In ISPASS 2019.

PDF DOI

Mahmoud Khairy, Akshay Jain, Tor M. Aamodt, Tim Rogers (2019). A Detailed Model for Contemporary GPU Memory Systems. In ISPASS 2019.

PDF DOI

Tor M. Aamodt, Wilson Wai Lun Fung, Tim Rogers (2018). General-Purpose Graphics Processor Architectures. In * Synthesis Lectures on Computer Architecture*.

DOI

Akshay Jain, Mahmoud Khairy, Tim Rogers (2018). A Quantitative Evaluation of Contemporary GPU Simulation Methodology. In SIGMETRICS 2018..

PDF DOI

Mengchi Zhang, Ahmad Alawneh, Tim Rogers (2018). Characterizing the Runtime Effects of Object-Oriented Workloads on GPUs. In ISPASS 2018.

PDF DOI

Anthony Gutierrez, Bradford M. Beckmann, Alexandru Dutu, Joseph Gross, John Kalamatianos, Onur Kayiran, Michael LeBeane, Matthew Poremba, Brandon Potter, Sooraj Puthoor, Matthew D. Sinclair, Mark Wyse, Jieming Yin, Xianwei Zhang, Akshay Jain, Tim Rogers (2018). Lost in Abstraction: Pitfalls of Analyzing GPUs at the Intermediate Language Level. In HPCA 2018.

PDF DOI

Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagol, Rudolf Eigenmann, Tim Rogers (2017). Pagoda: Fine-Grained GPU Resource Virtualization for Narrow Tasks. In PPoPP 2017. Best Paper Nominee.

PDF DOI

Tsung Tai Yeh, Amit Sabne, Putt Sakdhnagol, Rudolf Eigenmann, Tim Rogers (2016). POSTER: Pagoda: A Runtime System to Maximize GPU Utilization in Data Parallel Tasks with Limited Parallelism. In PACT 2016.

PDF DOI

Tim Rogers, Daniel R. Johnson, Mike O’Connor, Stephen W. Keckler (2015). A Variable Warp Size Architecture. In ISCA 2015.

PDF DOI

Tim Rogers, Mike O’Connor, Tor M. Aamodt (2014). Learning Your Limit: Managing Massively Multithreaded Caches Through Scheduling. In CACM Research Highlight.

PDF DOI

Tim Rogers, Mike O’Connor, Tor M. Aamodt (2013). Divergence-Aware Warp Scheduling. In MICRO 2013.

PDF DOI

Tim Rogers, Mike O’Connor, Tor M. Aamodt (2013). Cache-Conscious Thread Scheduling for Massively Multithreaded Processors. In TOP-PICKS 2013.

PDF DOI

Tim Rogers, Mike O’Connor, Tor M. Aamodt (2012). Cache-Conscious Wavefront Scheduling. In MICRO 2012. Best Paper Nominee. Top Picks. CACM Research Highlight..

PDF DOI

Tayler Hetherington, Tim Rogers, Mike O’Connor, Tor M. Aamodt (2012). Characterizing and Evaluating a Key-value Store Application on Heterogeneous CPU-GPU Systems. In ISPASS 2012.

PDF DOI