Saurabh Bagchi

Electrical & Computer Engineering

Computer Science

Director, CRISP Center

DCSL research group

Research group at Purdue


Hot Links

Research Overview



New Stuff

Legend : Presentations Papers Students Awards News

    [January 2018] Want to work on leading-edge Department of Energy accelerator-based systems? Here is an RA announcement on a project on error detection for GPUs using Machine Learning. We plan to finalize the position hire by mid February. [ WWW ]

    [January 2018] A large new project worth $39M and slated for 5 years starts. The project, funded by Lilly Endowment, will develop IoT systems to enable smart agriculture and smart manufacturing. DCSL researchers, Saurabh, Edgardo, and Heng, will be leading the charge on building a reliable wireless mesh network, data analytics to detect failing sensor nodes, and visualization of the data and the network status. [ WWW ]

    [November 2017] Our work on security and game theory for interdependent systems is highlighted in a Purdue news story. [ WWW ]

    [September 2017] Two new funded projects start on security of interdependent systems, one from NSF SaTC and the other from Sandia. [ Details ]

    [September 2017] 5 students graduate, 4 new students start in DCSL. [ Details ]

    [September 2017] Want to work in graduate school with a great bunch of faculty and students working on different aspects of computer systems? Consider applying to the Computer Engineering program within ECE. [ Details ]

Archives (previous news items)


Short Bio

I am a Professor in the School of Electrical and Computer Engineering and the Department of Computer Science (by courtesy) at Purdue University in West Lafayette, Indiana. I am the founding Director of the College of Engineering-wide center on resilience called CRISP, started in 2017. I am an ACM Distinguished Scientist (2013), a Senior Member of IEEE (2007) and ACM (2009), a Distinguished Speaker for ACM (2012), and an IMPACT Faculty Fellow at Purdue. I am the Cyber lead for the GE-funded PRIAM center at Purdue ($10M, 2015-20) and was the Cybersecurity Lead for the NSF Center at Purdue called NEEScomm ($25M, 2010-15). I am a Fellow of the campus-wide security center called CERIAS.

Our work on fault tolerance in distributed systems has been rewarded with recognition of best papers or runner-up awards at several conferences (BCB 2015, Sensys 2011, Supercomputing 2009, SecureComm 2008, etc.) and through an IBM Award (2012), a Google Faculty Award (2015), and an AT&T Labs VURI Award (2016). I have been elected to serve on the IEEE Computer Society Board of Governors for 2017-19. I have been fortunate to be awarded the Seed for Success award at Purdue University for research contributions three times (2011, 2012, 2016).

I received the MS and PhD degrees from the University of Illinois, Urbana-Champaign in the Computer Science department, in 1998 and 2001, respectively. I worked with Prof. Ravishankar Iyer and Dr. Zbigniew Kalbarczyk there. My Ph.D. dissertation was on error detection protocols in distributed systems [ pdf ]. During my Ph.D. I worked on an adaptive fault-tolerant middleware system called Chameleon. Earlier I had done my MS in June 1998 from the same school and under the same advisors. My undergraduate alma mater is the Indian Institute of Technology at Kharagpur where I did Computer Science and Engineering. I worked at the IBM T. J. Watson Research Center in Hawthorne, New York in the Distributed Messaging Systems group on a project called Gryphon in 2001. 

If you want the gory details, here is the full CV (pdf) (html). [Last update: January 2018]

If you want a meandering look, here is the non-linear version of my bio. (html)

 Research Interests

"Dependability meets Data Analytics, and at Large Scales"

The above sums up our current research direction. We work on software systems to enable them to perform their functionality in the face of natural and malicious failures. We apply and adapt data analytic techniques to work with the noise of computer systems and at large system scales. Current application domains come from distributed software systems, embedded systems, cellular systems, and bioinformatics.

I am interested in the question of how to build heterogeneous large-scale distributed systems that are reliable. Since many business and life critical functions are being performed by distributed systems, they need to be reliable while meeting their performance goals. Thus, there is need for smart error detection, diagnosis and recovery protocols. There is need for architectures that can combine fault tolerance aspects with performance aspects in an adaptive manner, adapting to different user requirements and different runtime environments. I consider intrusions to be an increasingly important class of faults and we are looking at the design of intrusion tolerant systems.

Wireless networks of embedded nodes cooperating among themselves for information gathering and analysis are becoming an important platform in several domains, leading toward the vision of "Internet of Things". The nodes are placed in situ in the environment to be monitored and have the capacity for sensing, communication, computation and sometimes, mobility and actuation. Since the nodes have limited power resource, all the tasks need to be performed under power constraints. The reliability challenges come from the unpredictability of the environment in which the networks are based and the security challenges come from the fact that the networks are often deployed in open environments where network-based and physical compromise-based attacks are possible. I am investigating the issues in building embedded networks to meet high-level reliability requirements in the face of these challenges.

For details of my research projects, take a look at the home page of the Dependable Computing Systems Lab (DCSL) and the research overview document. If you are interested in working in the research group, please take a look at the process for this outlined here.

Project thrusts at DCSL
Project Summary
Fault tolerance for distributed applications Error detection, prediction, and localization for a variety of distributed applications, currently, scientific clusters and applications, web services, and storage systems
Intelligent wireless networks Reliability of emerging class of wireless platforms - resource-constrained embedded wireless networks and cellular networks
Distributed intrusion tolerant systems Detection of intrusion attempts against distributed infrastructure and intrusion prevention and response

Our funding comes from the National Science Foundation, Department of Defense (Army Research Lab, Missile Defense Agency), Indiana 21st Century Research and Technology Fund, multiple private organizations (Northrop Grumman, IBM, AT&T; previously from Lockheed Martin, Avaya, Tellabs, Motorola, Intel), and Purdue's Research Foundation.

Contact Information

Room 329 Electrical Engineering Bldg
465 Northwestern Avenue
West Lafayette, IN 47907.
Phone: 765.494.3362 (O) 765.427.5708 (C)
Fax: 765.494.2706
Email: sbagchi_at_purdue_dot_edu

Administrative Assistant: Mary-Ann Satterfield
Room 326B
Phone: 765.494.6389
Email: msaterfi_at_purdue_dot_edu



Saurabh Bagchi [Contact info]

Last modified: January 30, 2018

finger command

PGP public key