Changes in ECE 572 Fault-Tolerant Computer SystemsEngineering Faculty Document No. 16-06 November 9, 2006 TO: The Faculty of the College of Engineering FROM: The Faculty of the School of Electrical and Computer Engineering RE: ECE 572 Changes in Course Description, Content, and Prerequisites The faculty of the From: ECE 572 – Fault-Tolerant Computer
Systems Sem. 2, Class 3, cr. 3 Prerequisite: ECE 302 and 565 or ECE 302, 365, and consent of instructor An introduction
to methodologies for specifying, modeling and designing fault-tolerant systems
supported by case studies and real systems, a term project and relevant papers. Topics include fault classification,
measurement and evaluation, techniques for fault detection and recovery,
combinatorial and Markov modeling techniques. To: ECE 572 – Fault-Tolerant Computer
Systems Sem. 2, Class 3, cr. 3 Prerequisite: ECE 302 and 368. An introduction to the hardware and software methodologies for specifying, modeling, and designing fault-tolerant systems supported by case studies of real systems. The material presents a broad spectrum of hardware and software error detection and recovery techniques that can be used to build reliable networked systems. The lectures discuss how the hardware and software techniques interplay, what techniques can be provided in COTS hardware, what can be embedded into operating system and network communication layers, and what can be provided via a distributed software layer and in the application itself. Reason: The course description and prerequisites have been changed to reflect the updated content of the course. . Mark Smith, Head School of Electrical & Computer Engineering
Engineering Faculty Document No. 16-06 November 9, 2006 Page 1 of 1 ECE 572 Fault Tolerant Computer Systems Course Outline Saurabh Bagchi Electrical and
Computer Engineering Department, 1285 EE Building, Email: sbagchi@purdue.edu Text Book D. P. Siewiorek and R. S. Swarz, Reliable Computer Systems - Design and Evaluation, 3rd edition, 1999, A.K. Peters, Limited. Reference D. K. Pradhan, ed., Fault Tolerant Computer System Design, 1st edition, 1996, Prentice-Hall. K. Trivedi, Probability and Statistics with Reliability, Queuing and Computer Science Applications, 2nd edition, 2001, John Wiley & Sons. Prerequisites ECE 302 and ECE 368. Equivalent courses may be used in satisfying the prerequisites with the consent of the instructor. Description An introduction to the hardware and software methodologies for specifying, modeling and designing fault-tolerant systems supported by case studies of real systems. The material presents a broad spectrum of hardware and software error detection and recovery techniques that can be used to build reliable networked systems. The lectures discuss how the hardware and software techniques interplay, what techniques can be provided in COTS hardware, what can be embedded into operating system and network communication layers, and what can be provided via a distributed software layer and in the application itself. Course Outcomes A student who
successfully fulfills the course requirements will have demonstrated: i. an ability to evaluate the dependability of a system. [1,2,4;a,b,e] ii. an ability to analyze a system for performance-dependability tradeoffs. [1,4;a,b,c,e,k] iii. an ability to select the appropriate detection techniques (hardware and software) for a given environment. [1,4;a,c,e,k] iv. an ability to select the appropriate recovery techniques (hardware and software) for a given environment. [1,4;a,c,e,k] v. an ability to select the appropriate points in an end-to-end system to embed fault-tolerant techniques. [1,4;a,c,e,k] Student assessment of the course outcomes will be in the form of a midterm exam, a final exam, and the grading of a design and implementation project. Each student working in a group of two will choose a project from a list. Each project will focus on one aspect of fault-tolerant system design and will test the ability to design, model or implement, execute experiments and perform evaluation.
Class Outline
|