Director, CRISP Center
Research group at Purdue
I am a Professor in the School of Electrical and Computer Engineering and the Department of Computer Science (by courtesy) at Purdue University in West Lafayette, Indiana. I am the founding Director of a university-wide resilience center CRISP, started in 2017. I serve on the Steering Committee of the premier conference in the field of dependability, DSN. I am an ACM Distinguished Scientist (2013), a Senior Member of IEEE (2007) and of ACM (2009), and a Distinguished Speaker for ACM (2012). I received an IBM Faculty Award (2014), a Google Faculty Award (2015), the AT&T Labs VURI Award (2016), and an Adobe Research Award (2017). I was elected to the IEEE Computer Society Board of Governors for the 2017-19 term. My research interests are in distributed systems and dependable computing. I am proudest of the 20 PhD students who have graduated from our research group and are in various stages of building wonderful careers in industry or academia. In our group of 12 graduate students, 3 undergraduate students, and 3 Research Scientists, we have far too much fun building and breaking real systems. Our work on dependability in various domains has been rewarded with recognition of best paper awards at BCB 2015, Sensys 2011, SecureComm 2008, SUTC 06 and runner-up best paper awards at Supercomputing 2009, 2012, HPDC 2006, IMS 2005, and DSN 2005.
I received the MS and PhD degrees from the University of Illinois, Urbana-Champaign in the Computer Science department, in 1998 and 2001, respectively. I worked with Prof. Ravishankar Iyer and Dr. Zbigniew Kalbarczyk there. My Ph.D. dissertation was on error detection protocols in distributed systems [ pdf ]. During my Ph.D. I worked on an adaptive fault-tolerant middleware system called Chameleon. My undergraduate alma mater is the Indian Institute of Technology at Kharagpur where I did Computer Science and Engineering. I worked at the IBM T. J. Watson Research Center in Hawthorne, New York in the Distributed Messaging Systems group on a project called Gryphon in 2001.
If you want the gory details, here is the full CV (pdf) (html). [Last update: March 2018]
If you want a meandering look, here is the non-linear version of my bio. (html)
The above sums up our current research direction. We work on software systems to enable them to perform their functionality in the face of natural and malicious failures. We apply and adapt data analytic techniques to work with the noise of computer systems and at large system scales. Current application domains come from distributed software systems, embedded systems, cellular systems, and bioinformatics.
I am interested in the question of how to build heterogeneous large-scale distributed systems that are reliable. Since many business and life critical functions are being performed by distributed systems, they need to be reliable while meeting their performance goals. Thus, there is need for smart error detection, diagnosis and recovery protocols. There is need for architectures that can combine fault tolerance aspects with performance aspects in an adaptive manner, adapting to different user requirements and different runtime environments. I consider intrusions to be an increasingly important class of faults and we are looking at the design of intrusion tolerant systems.
Wireless networks of embedded nodes cooperating among themselves for information gathering and analysis are becoming an important platform in several domains, leading toward the vision of "Internet of Things". The nodes are placed in situ in the environment to be monitored and have the capacity for sensing, communication, computation and sometimes, mobility and actuation. Since the nodes have limited power resource, all the tasks need to be performed under power constraints. The reliability challenges come from the unpredictability of the environment in which the networks are based and the security challenges come from the fact that the networks are often deployed in open environments where network-based and physical compromise-based attacks are possible. I am investigating the issues in building embedded networks to meet high-level reliability requirements in the face of these challenges.
For details of my research projects, take a look at the home page of the Dependable Computing Systems Lab (DCSL) and the research overview document. If you are interested in working in the research group, please take a look at the process for this outlined here.
|Fault tolerance for distributed applications||Error detection, prediction, and localization for a variety of distributed applications, currently, scientific clusters and applications, web services, and storage systems|
|Intelligent wireless networks||Reliability of emerging class of wireless platforms - resource-constrained embedded wireless networks and cellular networks|
|Distributed intrusion tolerant systems||Detection of intrusion attempts against distributed infrastructure and intrusion prevention and response|
Our funding comes from the National Science Foundation, Department of Defense (Army Research Lab, Missile Defense
Agency), Indiana 21st Century Research and Technology Fund, multiple private organizations (Northrop Grumman, IBM,
AT&T; previously from Lockheed Martin, Avaya, Tellabs, Motorola, Intel), and Purdue's Research Foundation.
Room 329 Electrical Engineering Bldg
Administrative Assistant: Mary-Ann Satterfield
Saurabh Bagchi [Contact info]
September 28, 2018
PGP public key