ACCELERATING DATA-INTENSIVE COMPUTATIONS THROUGH DYNAMIC NETWORK TRAFFIC OPTIMIZATION

dc.contributor.advisorSubhlok, Jaspal
dc.contributor.committeeMemberGurkan, Deniz
dc.contributor.committeeMemberJohnsson, Lennart
dc.creatorDaga, Anand Arun 1989-
dc.date.accessioned2014-11-21T21:37:04Z
dc.date.available2014-11-21T21:37:04Z
dc.date.createdDecember 2012
dc.date.issued2012-12
dc.date.updated2014-11-21T21:37:05Z
dc.description.abstractHadoop has been emerging as a popular distributed framework for data intensive computing in clustered environments. The main usage has been in parallel computing problems where interconnected clusters would transfer parts of the data between individual compute nodes to accomplish one job. The clusters are usually connected with shared network infrastructure where other applications also access and transfer on the same bandwidth. Specifi cally, Hadoop MapReduce jobs su ffer when running in parallel with other tra ffic in the underlying network due to their sensitivity to delay between compute phases. We propose a dynamic priority mechanism realized by OpenFlow protocol on such an infrastructure with a preferred QoS policy over all other tra ffic. Moreover, our proposed priority mechanism can be enhanced if additional network information on traffi c in the underlying network is provided. We propose to use the emerging ALTO (Application Layer Tra ffic Optimization) server to provide network tra ffic information to Hadoop. The ALTO server will be based on the industry standard, IF-MAP (interface to metadata access points protocol), to leverage publish/subscribe capabilities and the flexible schema defi nitions.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10657/791
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectHadoop
dc.subject.lcshComputer science
dc.titleACCELERATING DATA-INTENSIVE COMPUTATIONS THROUGH DYNAMIC NETWORK TRAFFIC OPTIMIZATION
dc.type.dcmiText
dc.type.genreThesis
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ThesisReportRevised_v2.0.pdf
Size:
1.48 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.12 KB
Format:
Plain Text
Description: