The Dependable Computing Systems Laboratory (DCSL) at Purdue University investigates the question of how to build dependable, heterogeneous, large-scale distributed systems.
“Dependability meets Data Analytics, and at Large Scales”
The above sums up our current research direction. We work on software systems to enable them to perform their functionality in the face of natural and malicious failures. We apply and adapt data analytic techniques to work with the noise of computer systems and at large system scales. Current application domains come from distributed software systems, embedded systems, cellular systems, and bioinformatics.
Since many business and life critical functions are being performed by distributed systems, they need to be dependable while meeting their performance goals. Thus, there is need for smart error detection, diagnosis, and recovery protocols. Since many of these systems operate on vast amounts of data and the patterns of errors or normal operation are approximate and noisy, we have to adapt leading-edge machine learning tools to these systems problems. There is also need for architectures that can combine dependability and security aspects without significantly degrading performance and do this in an adaptive manner, adapting to different user requirements and different runtime environments. This is our mission at DCSL.
Our application contexts come from various domains, many from our industrial colleagues. These include: security-critical enterprise (with Missile Defense Agency, Northrop Grumman and Lockheed Martin), mobile and cloud platforms (in collaboration with AT&T and IBM), large-scale scientific clusters and applications (in collaboration with Lawrence Livermore National Lab and Argonne National Lab), and cyber physical systems (in collaboration with GE Global Research Center and Sandia).
Our goal is to do cutting edge practical systems work that makes a splash in the academic community and a difference in the wider technology sphere through strategic collaborations with industrial partners. DCSL is the founding lab within the Purdue College of Engineering Center for Resilient Infrastructures, Systems, and Processes (CRISP).
- December 2018: Our report from the NSF workshop on Grand Challenges in Computer Systems Research is now out. A succinct 20 pages of reading will lay out for you the road ahead. [ PDF or on ACM DL ]
- September 2018: One paper, 7 authors, 5 different organizations. Purdue and 4 different companies: Adobe Research, Facebook, LinkedIn, and Google. Our Middleware 18 paper Pythia. Congrats to Ran for shepherding this diverse bunch. The paper shows how to co-locate multiple workloads on datacenter machines while avoiding resource contention. [ PDF ]
- September 2018: 3 new graduate students join DCSL. Shikhar Suryavansh (dependable edge computing, co-advised with Mung Chiang), Manish Nagaraj (blockchain-based security in embedded systems, co-advised with Aniket Kate of CS), and Chandan Bothra (cellular device-network cooperation). 1 new REU undergraduate student joins DCSL – Alex Medoff. Alex will be working with Ashraf on reconfiguration in distributed systems. Welcome!
- April 2018: Our system, VideoChef, on approximation in video analytics is accepted into Usenix ATC. Congratulations to Ran and the team! 76 of 378 papers are accepted for an acceptance rate of 20.1%. Here is the paper. [ PDF ]
- January 2018: A large new project worth $39M for 5 years starts. The project, funded by Lilly Endowment, will develop IoT systems to enable smart agriculture and smart manufacturing. DCSL researchers, Saurabh, Edgardo, and Heng, will be leading the charge on building a reliable wireless mesh network, data analytics to detect failing sensor nodes, and visualization of the data and the network status. [ WWW ] [ WWW ]