The Dependable Computing Systems Laboratory (DCSL) at Purdue University investigates the question of how to build dependable, heterogeneous, large-scale distributed systems.
“Dependability meets Data Analytics, and at Large Scales”
The above sums up our current research direction. We work on software systems to enable them to perform their functionality in the face of natural and malicious failures. We apply and adapt data analytic techniques to work with the noise of computer systems and at large system scales. Current application domains come from distributed software systems, embedded systems, cellular systems, and bioinformatics.
- March 2022: Two papers, one at OSDI and one at Sigmetrics. The first is on creating a performance model for serverless workflows/DAGs and the second is on reducing tail end-to-end latency. Both papers use production serverless traces from Microsoft Azure and we plan to release a subset. Congrats to Ashraf, who led the charge, and to Edgardo, Karthick, and Eshaan who ably assisted. [ Post ]
- March 2022: Two papers, one at CVPR and one at Security and Privacy. The first on scheduling complex vision tasks on mobile GPUs and the second on learning from behavioral economists how to incentivize securing interdependent systems. Congrats to Ran and Mustafa, who led the charge. [ Post ]
- January 2022: Two papers, one at Eurosys and one at AIStats. One on approximate video analytics on mobile GPUs and the other on privacy-preserving data representation for distributed data. Congrats to Ran and Azam, who led the charge. [ Post ]
- December 2021: Two DCSL-ers graduate with their PhDs, Edgardo Barsallo and Ran Xu. We are excited for what great things they will do next. [ Post ]
- November 2021: Saurabh presents at Google’s event on Federated Learning and Analytics. This brought together a set of academics with Googlers to discuss and debate the progress and future challenges in FL. [ Post ]
- August 2021: Our Assured Autonomy Innovation Institute (A2I2), which is a 5-year project (2020-25) funded by the Army Research Lab (ARL), is expanding. There are multiple Graduate Research Assistants joining this Fall and an opening for a Post-doctoral Fellow. [ WWW ]
Research Theme at DCSL
Since many business and life critical functions are being performed by distributed systems, they need to be dependable while meeting their performance goals. Thus, there is need for smart error detection, diagnosis, and recovery protocols. Since many of these systems operate on vast amounts of data and the patterns of errors or normal operation are approximate and noisy, we have to adapt leading-edge machine learning tools to these systems problems. There is also need for architectures that can combine dependability and security aspects without significantly degrading performance and do this in an adaptive manner, adapting to different user requirements and different runtime environments. This is our mission at DCSL.
Our application contexts come from various domains, many from our industrial colleagues. These include: security-critical enterprise (with Missile Defense Agency, Northrop Grumman and Lockheed Martin), mobile and cloud platforms (in collaboration with AT&T and IBM), large-scale scientific clusters and applications (in collaboration with Lawrence Livermore National Lab and Argonne National Lab), and cyber physical systems (in collaboration with GE Global Research Center and Sandia).
DCSL is the founding lab within the Purdue College of Engineering Center for Resilient Infrastructures, Systems, and Processes (CRISP). DCSL is the co-lead in the WHIN consortium, leading the thrust on “IoT Systems and Networking”.