Title:

Fault Tolerant Systems

Code:SOD
Ac.Year:2010/2011
Term:Summer
Curriculums:
ProgrammeBranchYearDuty
CSE-PHD-4DVI4-Elective
IT-PHD-3DIT3-Elective
Language:Czech
Completion:examination (written)
Type of
instruction:
Hour/semLecturesSem. ExercisesLab. exercisesComp. exercisesOther
Hours:390000
 ExaminationTestsExercisesLaboratoriesOther
Points:1000000
Guarantee:Drábek Vladimír, doc. Ing., CSc., DCSY
Lecturer:Drábek Vladimír, doc. Ing., CSc., DCSY
Instructor:Drábek Vladimír, doc. Ing., CSc., DCSY
Faculty:Faculty of Information Technology BUT
Department:Department of Computer Systems FIT BUT
 
Learning objectives:
  To inform the students about different types of redundancy and its application for the design of computer systems being able to function correctly even under presence of faults and data errors.
Description:
  Principles of fault tolerance, structures and techniques. Codes for control and correction of information. Cyclic codes, Fire codes, Galois fields, BCH and RS codes. Convolutional codes. Modelling, estimation and control of reliability. Fail-safe systems. Architecture of FT systems. Fault tolerance at VLSI level. Radiation safety. Fault tolerance in computer units, computer systems and communication networks. Radiation fault tolerance. A380 control system. Distributed tolerant systems, fault tolerant software.
Knowledge and skills required for the course:
  Computer design and software tools.
Subject specific learning outcomes and competences:
  Skills and approaches to building fault tolerance using hardware and codes.
Generic learning outcomes and competences:
  To get know a novel approach to ensure availability and safety of technical means.
Syllabus of lectures:
 
  • FT design methodology, structures and techniques.
  • Error control codes. Parity codes, arithmetic codes.
  • Residue codes, Hamming codes, sparse parity codes.
  • Cyclic codes, Fire codes.
  • Galois fields, BCH and Reed-Solomon codes, byte error detection.
  • Convolution codes.
  • Time redundancy, alternating logic.
  • Reliability modeling, combinatorial models, MIL-HDBK-217. Markov reliability models.
  • Safe systems.
  • FT architectures.
  • VLSI fault tolerance. Radiation fault tolerance. 
  • FT in computer units and computer systems.
  • FT systems architectures.
  • Software for FT systems.
Syllabus - others, projects and individual work of students:
 To process the individual project assignment.
Fundamental literature:
 
  • Lin S. - Costello D.J.: Error Control Coding: Fundamentals and Applications, Prentice-Hall, 1983
Study literature:
 
  • Lin S. - Costello D.J.: Error Control Coding: Fundamentals and Applications, Prentice-Hall, 1983
Controlled instruction:
  Exam, project presentation.