Research papers that used Cetus

Scaling Performance via Self-Tuning Approximation for Graphics Engines
Mehrzad Samadi, Janghaeng Lee, D. Anoushe Jamshidi, Scott Mahlke, and Amir Hormati.
ACM Transactions on Computing Systems
2014
OpenARC: open accelerator research compiler for directive-based, efficient heterogeneous computing
Seyong Lee and Jeffrey S. Vetter.
In Proceedings of the 23rd international symposium on High-performance parallel and distributed computing (HPDC)
2014, pp. 115-120
OpenARC: extensible OpenACC compiler framework for directive-based accelerator programming study
Seyong Lee and Jeffrey S. Vetter.
In Proceedings of the First Workshop on Accelerator Programming using Directives (WACCPD)
2014, pp. 1-11
The BonaFide C Analyzer: automatic loop-level characterization and coverage measurement
Sergio Aldea, Diego R. Llanos, and Arturo Gonzalez-Escribano.
The Journal of Supercomputing
2014
Leveraging GPUs using cooperative loop speculation
Mehrzad Samadi, Amir Hormati, Janghaeng Lee, and Scott Mahlke.
ACM Transactions on Architecture
2014
On Expressing Strategies for Directive-Driven Multicore Programing Models
Ricardo Nobre, Pedro Pinto, Tiago Carvalho, Joao M. P. Cardoso, and Pedro C. Diniz.
Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and Design Tools and Architectures for Multicore Embedded Computing Platforms (PARMA-DITAM)
2014
Compiler-Assisted Checkpointing of Parallel Codes: The Cetus and LLVM Experience
Gabriel Rodriguez, Maria J. Martin, Patricia Gonzalez, Juan Tourino, and Ramon Doallo.
International Journal on Parallel Programming
2013, pp. 782-805
Detecting source code similarity using code abstraction
Seongsoo Park, Seungcheol Ko, Jungsik Choi, Hwansoo Han, Seong-Je Cho, and Jongmoo Choi.
7th International Conference on Ubiquitous Information Management and Communication (ICUIMC)
2013
Efficient compilation of CUDA kernels for high-performance computing on FPGAs
Alexandros Papakonstantinou, Karthik Gururaj, John A. Stratton, Deming Chen, Jason Cong, and Wen-Mei W. Hwu.
ACM Transactions on Embedded Computing Systems
2013
Paralysis: An Extensible Multi-tiered Guidance Environment for Program Parallelization and Analysis
McCool, S.; Ran Shao; Milligan, P.; Kurugollu, F.
21st Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP)
2013, pp. 400-404
SAGE: self-tuning approximation for graphics engines
Mehrzad Samadi, Janghaeng Lee, D. Anoushe Jamshidi, Amir Hormati, and Scott Mahlke
46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO-46)
2013, pp. 13-24
Scaling large-data computations on multi-GPU accelerators
Amit Sabne, Putt Sakdhnagool, and Rudolf Eigenmann
27th ACM International Conference on Supercomputing (ICS)
2013, pp. 443-454
Portable section-level tuning of compiler parallelized applications
Mustafa, D.; Eigenmann, R.
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
2012, pp. 1-11
Patus for convenient high-performance stencils: Evaluation in earthquake simulations
Christen, M.; Schenk, O.; Yifeng Cui
International Conference for High Performance Computing, Networking, Storage and Analysis (SC)
2012, pp. 1-10
OpenMDSP: Extending OpenMP to Program Multi-Core DSP
Jiangzhou He; Wenguang Chen; Guangri Chen; Weimin Zheng; Zhizhong Tang; Handong Ye
International Conference on Parallel Architectures and Compilation Techniques (PACT)
2011, pp. 288-297
Hauberk: Lightweight Silent Data Corruption Error Detector for GPGPU
Keun Soo Yim; Cuong Pham; Saleheen, M.; Kalbarczyk, Z.; Iyer, R
IEEE International Parallel & Distributed Processing Symposium (IPDPS)
2011, pp. 287-300
Towards a Compiler Framework for Thread-Level Speculation
Aldea, S., Llanos, D.R. & Gonzá andlez-Escribano, A.
Parallel, Distributed and Network-Based Processing (PDP), 2011 19th Euromicro International Conference on
2011, pp. 267 -271
Performance Analysis and Tuning of Automatically Parallelized OpenMP Applications
Mustafa, Dheya, Aurangzeb & Eigenmann, Rudolf
Proc. of the International Workshop on OpenMP, IWOMP
Springer Verlag, 2011, Vol. 6665, pp. 150-164
PATUS: A Code Generation and Autotuning Framework For Parallel Iterative Stencil Computations on Modern Microarchitectures
Christen, M., Schenk, O. & Burkhart, H.
Parallel and Distributed Processing, 2011. IPDPS 2011. IEEE International Symposium on
2011
Analysis of Performance-impacting Factors on Checkpointing Frameworks: The CPPC Case Study
Rodríguez, Gabriel, Martín, María J., González, Patricia & Touriño, Juan
The Computer Journal, 2011
Region-based parallelization of irregular reductions on explicitly managed memory hierarchies
Kim, Seonggun, Han, Hwansoo & Choe, Kwang-Moo
The Journal of Supercomputing, Springer Netherlands, 2011, Vol. 56, pp. 25-55
A GPGPU compiler for memory optimization and parallelism management
Yang, Yi, Xiang, Ping, Kong, Jingfei & Zhou, Huiyang
Proceedings of the 31st ACM SIGPLAN Conference on Programming Language Design and Implementation
ACM, 2010, pp. 86-97
C code parallelization with paragraph
Bluemke, I. & Fugas, J.
Information Technology (ICIT), 2010 2nd International Conference on
2010, pp. 163 -166
memCUDA: map device memory to host memory on GPGPU platform
Jin, Hai, Li, Bo, Zheng, Ran, Zhang, Qin & Ao, Wenbing
Proceedings of the 2010 IFIP international conference on Network and parallel computing
Springer-Verlag, 2010, pp. 299-313
OpenMPC: Extended OpenMP Programming and Tuning for GPUs
Lee, Seyong & Eigenmann, Rudolf
SC'10: Proceedings of the 2010 ACM/IEEE conference on Supercomputing, Won the Best Student Paper award
IEEE press, 2010
CPPC: a compiler-assisted tool for portable checkpointing of message-passing applications
Rodrguez, Gabriel, Martn, Mara J., González, Patricia, Touri no, Juan & Doallo, Ramón
Concurr. Comput. : Pract. Exper., John Wiley and Sons Ltd., 2010, Vol. 22, pp. 749-766
A Tool Supporting C code Parallelization
Bluemke, Ilona & Fugas, Joanna
Sobh, T. & Elleithy, K. (ed.)
Innovations in Computing Sciences and Software Engineering
Springer Netherlands, 2010, pp. 259-264
A GPGPU compiler for memory optimization and parallelism management
Yang, Yi, Xiang, Ping, Kong, Jingfei & Zhou, Huiyang
Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
ACM, 2010, pp. 86-97
An optimizing compiler for GPGPU programs with input-data sharing
Yang, Yi, Xiang, Ping, Kong, Jingfei & Zhou, Huiyang
Proceedings of the 15th ACM SIGPLAN symposium on Principles and practice of parallel programming
ACM, 2010, pp. 343-344
FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs
Papakonstantinou, A., Gururaj, K., Stratton, J.A., Chen, D., Cong, J. & Hwu, W.-M.W.
Application Specific Processors, 2009. SASP '09. IEEE 7th Symposium on
2009, pp. 35 -42
Cetus: A Source-to-Source Compiler Infrastructure for Multicores
Bae, Hansang, Bachega, Leonardo, Dave, Chirag, Lee, Sang-Ik, Lee, Seyong, Min, Seung-Jai, Eigenmann, Rudolf & Midkiff, Samuel
Proc. of the 14th Int'l Workshop on Compilers for Parallel Computing (CPC'09)
2009, pp. 14 pages
Euro-Par 2008 Workshops - Parallel Processing
Castillo, R., Corbera, F., Navarro, A., Asenjo, R. & Zapata, E.L.
César, E., Alexander, M., Streit, A., Träff, J. L., Cérin, C., Knüpfer, A., Kranzlmüller, D. & Jha, S. (ed.)
Chapter Complete Def-Use Analysis in Recursive Programs with Dynamic Data Structures
Springer-Verlag, 2009, pp. 273-282
Cetus: A Source-to-Source Compiler Infrastructure for Multicores
Dave, Chirag, Bae, Hansang, Min, Seung-Jai, Lee, Seyong, Eigenmann, Rudolf & Midkiff, Samuel
IEEE Computer, IEEE, 2009, Vol. 42(12), pp. 36-42
Automatically Tuning Parallel and Parallelized Programs
Dave, Chirag & Eigenmann, Rudolf
LCPC '09: Proceedings of the 22nd International Workshop on Languages and Compilers for Parallel Computing
2009
OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization
Lee, Seyong, Min, Seung-Jai & Eigenmann, Rudolf
PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
ACM, 2009, pp. 101-110
A cross-input adaptive framework for GPU program optimizations
Yixun Liu, Zhang, Eddy Z. & Shen, Xipeng
Proceedings of the 2009 IEEE International Symposium on Parallel&Distributed Processing
IEEE Computer Society, 2009, pp. 1-10
High-performance CUDA kernel execution on FPGAs
Papakonstantinou, Alexandros, Gururaj, Karthik, Stratton, John A., Chen, Deming, Cong, Jason & Hwu, Wen-Mei W.
Proceedings of the 23rd international conference on Supercomputing
ACM, 2009, pp. 515-516
Parallelizing irregular C codes assisted by interprocedural shape analysis
Asenjo, R., Castillo, R., Corbera, F., Navarro, A., Tineo, A. & Zapata, E.L.
Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on
2008, pp. 1 -12
Incorporation of OpenMP Memory Consistency into Conventional Dataflow Analysis
Basumallik, Ayon & Eigenmann, Rudolf
Proc. of the International Workshop on OpenMP, IWOMP
Springer Verlag, 2008, Vol. 5004
Languages and Compilers for Parallel Computing
Stratton, John A., Stone, Sam S. & Hwu, Wen-Mei W.
Amaral, J. N. (ed.)
Chapter MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs
Springer-Verlag, 2008, pp. 16-30
The OpenTM Transactional Application Programming Interface
Baek, Woongki, Minh, Chi Cao, Trautmann, Martin, Kozyrakis, Christos & Olukotun, Kunle
Proceedings of the 16th International Conference on Parallel Architecture and Compilation Techniques
IEEE Computer Society, 2007, pp. 376-387
MIDHARD: Middleware and hardware solutions in high performance computation: Application to multimedia and simulation TIN2004-07797-C02
Bruguera, J. D & Biempica, R. D
2007
Speculative Thread Decomposition Through Empirical Optimization
Johnson, Troy A., Vijaykumar, T.N. & Eigenmann, Rudolf
Proceedings of the ACM Symposium on the Principles and Practice of Parallel Programming
2007
KeY-C: A Tool for Verification of C Programs
Mürk, Oleg, Larsson, Daniel & Hähnle, Reiner
Proceedings of the 21st international conference on Automated Deduction: Automated Deduction
Springer-Verlag, 2007, pp. 385-390
Artemis: practical runtime monitoring of applications for execution anomalies
Fei, Long & Midkiff, Samuel P.
Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
ACM, 2006, pp. 84-95
Shape analysis for dynamic data structures based on Coexistent Links Sets
Tineo, A., Corbera, F., Navarro, A., Asenjo, R. & Zapata, E.L.
12th Workshop on Compilers for Parallel Computers, CPC
2006
Argus: Online statistical bug detection
Fei, Long, Lee, Kyungwoo, Li, Fei & Midkiff, Samuel P.
Fundamental Approaches to Software Engineering, 2006, pp. 308-323
Towards a Versatile Pointer Analysis Framework
Castillo, R., Tineo, A., Corbera, F., Navarro, A., Asenjo, R. & Zapata, E.
Nagel, W., Walter, W. & Lehner, W. (ed.)
Euro-Par 2006 Parallel Processing
Springer Berlin / Heidelberg, 2006, Vol. 4128, pp. 323-333
CycleMeter: detecting fraudulent peers in internet cycle sharing
Zhang, Zheng, Hu, Y. Charlie & Midkiff, Samuel P.
Proceedings of the 2006 ACM/IEEE conference on Supercomputing
ACM, 2006
SOBER: statistical model-based bug localization
Liu, Chao, Yan, Xifeng, Fei, Long, Han, Jiawei & Midkiff, Samuel P.
Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
ACM, 2005, pp. 286-295
Empowering Software Debugging Through Architectural Support for Program Rollback
Teodorescu, Radu & Torrellas, Josep
Workshop on the Evaluation of Software Defect Detection Tools (BUGS), in conjunction with PLDI
2005
Experiences in Using Cetus for Source-to-Source Transformations
Johnson, Troy A., Lee, Sang-Ik, Fei, Long, Basumallik, Ayon, Upadhyaya, Gautam, Eigenmann, Rudolf & Midkiff, Samuel P.
Proc. of the Workshop on Languages and Compilers for Parallel Computing (LCPC'04)
Springer Verlag, Lecture Notes in Computer Science, 2004, pp. 1-14
AccMon: Automatically Detecting Memory-Related Bugs via Program Counter-Based Invariants
Zhou, Pin, Liu, Wei, Fei, Long, Lu, Shan, Qin, Feng, Zhou, Yuanyuan, Midkiff, Samuel & Torrellas, Josep
Proceedings of the 37th annual IEEE/ACM International Symposium on Microarchitecture
IEEE Computer Society, 2004, pp. 269-280
Cetus -- An Extensible Compiler Infrastructure for Source-to-Source Transformation
Lee, Sang-Ik, Johnson, Troy A. & Eigenmann, Rudolf
Proc. of the Workshop on Languages and Compilers for Parallel Computing(LCPC'03)
Springer Verlag, Lecture Notes in Computer Science 2958, 2003, pp. 539-553