Performance of Serialization Libraries in a High Performance Computing Environment

dc.contributor.advisorGnawali, Omprakash
dc.contributor.committeeMemberGabriel, Edgar
dc.contributor.committeeMemberShi, Weidong
dc.creatorCasey, Allen
dc.creator.orcid0000-0002-9031-020X
dc.date.accessioned2022-12-29T01:49:15Z
dc.date.available2022-12-29T01:49:15Z
dc.date.createdMay 2022
dc.date.issued2022-04-28
dc.date.updated2022-12-29T01:49:16Z
dc.description.abstractHigh performance computing is a subset of distributed computing, and is a paradigm that involves building a cluster of interconnected machines capable of performing operations in parallel. This parallelization enables the cluster to reduce the time needed to perform operations by distributing the work across multiple cluster nodes. The process is heavily dependent on internode communication, and requires nodes to coordinate and communicate by passing messages among themselves. High performance computing requires that this messaging be very efficient. The messaging process involves serializing the message contents prior to transmission, and deserializing it upon receipt by the receiver. Several libraries have emerged to facilitate serialization and deserialization including Protocol Buffers, FlatBuffers, and MessagePack. The goal of this thesis is to evaluate the performance of these libraries within the context of a high performance computing software package. As an evaluation infrastructure, a parallelized mass spectrometry tool currently under development at the University of Houston is used, and a new mechanism for serialization is contributed to this tool using each of these three serialization libraries. The libraries are evaluated holistically within the context of the above software package; with many metrics being observed including their performance in terms of execution time and hardware utilization, as well as their general ease of development.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10657/13140
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectSerialization
dc.subjectHigh performance computing
dc.titlePerformance of Serialization Libraries in a High Performance Computing Environment
dc.type.dcmiText
dc.type.genreThesis
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CASEY-THESIS-2022.pdf
Size:
394.49 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: