On Communication-Computation Overlap in High-Performance Computing

dc.contributor.advisorGabriel, Edgar
dc.contributor.committeeMemberSubhlok, Jaspal
dc.contributor.committeeMemberShah, Shishir Kirit
dc.contributor.committeeMemberGrabow, Lars C.
dc.creatorBarigou, Youcef 1987-
dc.date.accessioned2018-07-17T17:03:08Z
dc.date.available2018-07-17T17:03:08Z
dc.date.createdMay 2016
dc.date.issued2016-05
dc.date.submittedMay 2016
dc.date.updated2018-07-17T17:03:08Z
dc.description.abstractThe number of compute nodes and cores per node have increased many fold on high end computer systems over the last decade. For a parallel application to scale to tens or even hundreds of thousands of processes, all non-computing related operations have to be kept at an absolute minimum, including communication operations. Non-blocking-collective operations extend the concept of collective operations by offering the additional benefit of being able to overlap communication and computation. However, it has been demonstrated that collective operations have to be carefully tuned for a given platform and application scenario to maximize their performance. Also, using non-blocking-collective operations in real-world applications is non-trivial. Application codes often have to be restructured significantly in order to maximize the communication-computation overlap. The goal of this dissertation is to optimize non-blocking collective-communication operations and facilitate their utilization at the end-user level. This is achieved by an automatic run-time tuning of non-blocking collective-communication operations, which allows the communication library to maximize communication-computation overlap, and choose the best performing implementation for a non-blocking-collective operation on a case by case basis. Specifically, an approach to maximize the communication-computation overlap for hybrid OpenMP/MPI applications is developed. It leverages automatic parallelization by extending existing concepts to utilize non-blocking-collective operations. It also integrates the run-time auto-tuning techniques of non-blocking-collective operations, optimizing both, the algorithms used for the non-blocking-collective operations as well as location and frequency of accompanying progress-function calls.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.citationPortions of this document have appeared in: Barigou, Youcef, Vishwanath Venkatesan, and Edgar Gabriel. "Auto-tuning non-blocking collective communication operations." In Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International, pp. 1204-1213. IEEE, 2015. DOI: 10.1109/IPDPSW.2015.15.
dc.identifier.urihttp://hdl.handle.net/10657/3274
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. UH Libraries has secured permission to reproduce any and all previously published materials contained in the work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectHigh performance computing
dc.subjectMessage Passing Interface
dc.subjectMPI
dc.subjectCollective Operations
dc.subjectCommunication-computation Overlap
dc.subjectOptimization
dc.subjectDistributed memory
dc.subjectAutomatic Parallelization
dc.titleOn Communication-Computation Overlap in High-Performance Computing
dc.type.dcmiText
dc.type.genreThesis
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
BARIGOU-DISSERTATION-2016.pdf
Size:
6.63 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: