Apr 28, 2024  
2021-2022 Undergraduate Catalog 
    
2021-2022 Undergraduate Catalog [ARCHIVED CATALOG]

Add to Portfolio (opens a new window)

ECE (CPSC) 4740 - Fault Tolerance and Reliability in High-Performance Computing

3 Credits (3 Contact Hours)
Survey of current fault tolerance and reliability issues on high-performance computing (HPC) systems. Topics include taxonomy of failures and errors, checkpoint-restart, fault injection techniques, soft error detection schemes, and lossy compression. May also be offered as CPSC 4740 . Preq: CPSC 3220  or ECE 3220  or ECE 3290 , each with a grade of C or higher. ECE 4730  is recommended, but not required.

This 4000-level course has a 6000-level counterpart. Students should refer to the Graduate Catalog for the 6000-level description and requirements.



Add to Portfolio (opens a new window)