Title:

Parallel System Architecture and Programming

Code:ARC
Ac.Year:2012/2013
Sem:Summer
Curriculums:
ProgrammeField/
Specialization
YearDuty
IT-MSC-2MBI-Compulsory-Elective - group C
IT-MSC-2MBS-Elective
IT-MSC-2MGM-Compulsory-Elective - group C
IT-MSC-2MIN-Elective
IT-MSC-2MIS-Elective
IT-MSC-2MMI-Compulsory-Elective - group C
IT-MSC-2MMM-Elective
IT-MSC-2MPV1stCompulsory
IT-MSC-2MSK1stCompulsory
Language of Instruction:Czech
Credits:5
Completion:credit+exam (written)
Type of
instruction:
Hour/semLecturesSeminar
Exercises
Laboratory
Exercises
Computer
Exercises
Other
Hours:3900013
 ExamsTestsExercisesLaboratoriesOther
Points:60100030
Guarantor:Dvořák Václav, prof. Ing., DrSc. (DCSY)
Lecturer:Bidlo Michal, Ing., Ph.D. (DCSY)
Dvořák Václav, prof. Ing., DrSc. (DCSY)
Instructor:Dobai Roland, Ing., Ph.D. (DCSY)
Dvořák Václav, prof. Ing., DrSc. (DCSY)
Petrlík Jiří, Ing. (DCSY)
Faculty:Faculty of Information Technology BUT
Department:Department of Computer Systems FIT BUT
Substitute for:
Advanced Computer Architecture (ARP), DCSY
Practical Parallel Programming (PPP), DCSY
 
Learning objectives:
  To orientate oneself in parallel systems on the market, be able to assess communication and computing possibilities of a particular architecture and to predict the performance of parallel applications. To get acquainted with the most important parallel programming tools (MPI, OpenMP), to learn their practical use and solving problems in parallel.
Description:
  The course covers architecture and programming of parallel systems with functional- and data-parallelism. First the parallel system theory and program parallelization are discussed. Programming for shared memory systems in OpenMP follows and then the most proliferated multi-core multiprocessors (SMP) and the advanced DSM NUMA systems are described.  The course goes on in message passing programming in standardized interface MPI.  Interconnection networks are dealt with separately and then their role in clusters, many-core chips and in the most powerful systems is revealed. In conclusion SIMD accelerators and GPGPU are dealt with. 
Knowledge and skills required for the course:
  Von-Neumann computer architecture, computer memory hierarchy, cache memories and their organization, programming in assembly and in C/C++.
Subject specific learning outcomes and competencies:
  Overview of principles of parallel system design and of interconnection networks, communication techniques and algorithms. Survey of parallelization techniques of fundamental scientific problems, knowledge of parallel programming in MPI and OpenMP. The use of SIMD accelerators and GPGPU.
Generic learning outcomes and competencies:
  Knowledge of capabilities and limitations of parallel processing, ability to estimate performance of parallel applications. Language means for process/thread communication and synchronization. Competence in hardware-software platforms for high-performance computing and simulations.
Syllabus of lectures:
 1. Introduction to parallel processing
2. Patterns for paralel programming
3. Shared memory programming - Introduction into OpenMP
4. Synchronization and performance awareness in OpenMP
5. Shared memory and cache coherency
6. Components of symmetrical multiprocessors
7. CC NUMA DSM architectures
8. Message passing interface
9. Collective communications
10. Interconnection networks: topology and routing algorithms
11. Interconnection networks: switching, flow control, message processing and performance
12. Message passing architectures
13. Data-parallel architectures and programming
Syllabus of numerical exercises:
 Tutorials are not scheduled for this course.
Syllabus - others, projects and individual work of students:
 
  • Performance prediction of the given parallel application on a compute cluster. 
  • Development of an application on SMP in OpenMP.
  • A parallel program in MPI on the blade cluster.
Fundamental literature:
 
  1. Pacecho, P.: Introduction to Parallel Programming. Morgan Kaufman Publishers, 2011, 392 s., ISBN: 9780123742605 
  2. Hennessy, J.L., Patterson, D.A.: Computer Architecture - A Quantitative Approach. 5. vydání, Morgan Kaufman Publishers, Inc., 2012, 856 s., ISBN: 9780123838728    
Study literature:
 
  • current PPT slides for lectures

Progress assessment:
  Three small projects in duration of 5, 4 a 4 hours ; midterm examination.
Exam prerequisites:
  To complete successfuly session work and be able to write examination, one has to get at leat 20 points out of maximum 40.
 

Your IPv4 address: 52.91.221.160
Switch to https

DNSSEC [dnssec]