Title:

Fault-Tolerant Systems

Code:SOP
Ac.Year:ukončen 2006/2007 (Not opened)
Term:Summer
Curriculums:
ProgrammeBranchYearDuty
EI-MSC-3VTN3rdElective
EI-MSC-5VTI2nd Stage/3rd YearElective
Language:Czech
Private info:http://www.fit.vutbr.cz/study/courses/SOP/private/
Credits:5
Completion:accreditation+exam (written)
Type of
instruction:
Hour/semLecturesSem. ExercisesLab. exercisesComp. exercisesOther
Hours:3900013
 ExaminationTestsExercisesLaboratoriesOther
Points:6000040
Guarantee:Drábek Vladimír, doc. Ing., CSc., DCSY
Lecturer:Drábek Vladimír, doc. Ing., CSc., DCSY
Faculty:Faculty of Information Technology BUT
Department:Department of Computer Systems FIT BUT
Prerequisites: 
Diagnosis and Safe Systems (DIA), DCSY
 
Learning objectives:
  To inform the students about different types of redundancy and its application for the design of computer systems being able to function correctly even under presence of faults and data errors.
Description:
  Principles of fault tolerance, structures and techniques. Codes for control and correction of information. Cyclic codes, Fire codes, BCH and RS. Convolutional codes. Modelling, estimation and control of reliability. Fail-safe systems. Architecture of FT systems. Fault tolerance at VLSI level. Fault tolerance in computer units, computer systems and communication networks. Distributed tolerant systems, fault tolerant software.
Knowledge and skills required for the course:
  Principles of computer organization.
Subject specific learning outcomes and competences:
  Skills and approaches to building fault tolerance using hardware and codes.
Generic learning outcomes and competences:
  Understandig a new trend in computer design.
Syllabus of lectures:
 
  • Introduction, FT design methodology. Hardware redundancy, TMR.
  • Information redundancy, parity codes, arithmetic codes, Residue codes, Hamming codes.
  • Cyclic codes, Fire codes.
  • Galois fields, BCH and Reed-Solomon codes, byte error detection.
  • Convolution codes.
  • Reliability modeling, combinatorial models, MIL-HDBK-217. Markov reliability models.
  • Fail-safe systems.
  • Time redundancy, alternating logic, RESO, RESWO, REDWC.
  • Architecture of fault tolerant systems.
  • VLSI reconfiguration techniques.
  • Fault tolerant units, systems and networks.
  • Distributed FT systems.
  • Software for FT systems.
Syllabus of laboratory exercises:
 
  • digital systems laboratory
Syllabus of computer exercises:
 
  • Model of error correction using Fire code.
  • Model of linear-feedback shift register.
  • Parallel generation of CRC.
  • Model of Galois field GF(2|n).
  • Model of reconfiguration in 2D processor array.
  • Model of error correction using convolutional code.
  • Model of error correcction using BCH code.
  • Reliability models.
Syllabus - others, projects and individual work of students:
 
  • Individual assignment of projects.
Fundamental literature:
 
  • Lin, S., Costello, D.J.: Error Control Coding: Fundamentals and Applications, Prentice-Hall, 1983, ISBN 0-13-283796-X 
Study literature:
 
  • Hlavička, J., Racek, S., Golan, P., Blažek, T.: Číslicové systémy odolné proti poruchám, Vydavatelství ČVUT, Praha, 1992 (in Czech), ISBN 80-01-00852-5
Controlled instruction:
   
Progress assessment:
  Mid-term exam, PC lab and a project.
Exam prerequisites:
  Passing mid-term exam, PC lab and a project solution.