Show simple item record

dc.contributor.advisorShi, Weidong
dc.creatorPrakash, Varun Shimoga 1989-
dc.date.accessioned2014-03-13T22:12:09Z
dc.date.available2014-03-13T22:12:09Z
dc.date.createdDecember 2013
dc.date.issued2013-12
dc.identifier.urihttp://hdl.handle.net/10657/571
dc.description.abstractWith the advent of Big Data technologies-the capacity to store and efficiently process large sets of data, doors of opportunities for developing business intelligence that was previously unknown, has opened. Each phase in the processing of this data requires specialized infrastructures. One such phase, the preservation and archiving of data, has proven its usefulness time and again. Data archives are processed using novel data mining methods to elicit vital data gathered over long periods of time and efficiently audit the growth of a business or an organization. Data preservation is also an important aspect of business processes which helps in avoiding loss of important information due to system failures, human errors and natural calamities. This thesis investigates the need, discusses possibilities and presents a novel, highly cost-effective, unified, long- term storage solution for data. Some of the common processes followed in large-scale data warehousing systems are analyzed for overlooked, inordinate shortcomings and a profitably feasible solution is conceived for them. The gap between the general needs of 'efficient' long-term storage and common, current functionalities is analyzed. An attempt to bridge this gap is made through the use of a hybrid, hierarchical media based, performance enhancing middleware and a monolithic namespace filesystem in a new storage architecture, Tape Cloud. The scope of studies carried out by us involves interpreting the effects of using heterogeneous storage media in terms of operational behavior, average latency of data transactions and power consumption. The results show the advantages of the new storage system by demonstrating the difference in operating costs, personnel costs and total cost of ownership from varied perspectives in a business model.
dc.format.mimetypeapplication/pdf
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectData Storage
dc.subjectBackup
dc.subjectArchiving
dc.subjectCloud
dc.subjectData Centers
dc.subjectCost Efficiency
dc.subjectMagnetic Tapes
dc.subjectMiddleware
dc.subjectRead Probability Weight
dc.subjectPriority Queue
dc.subject.lcshComputer science
dc.titleResurrection: Rethinking Magnetic Tapes For Cost Efficient Data Preservation
dc.date.updated2014-03-13T22:12:15Z
dc.type.genreThesis
thesis.degree.nameMaster of Science
thesis.degree.levelMasters
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.departmentComputer Science, Department of
dc.contributor.committeeMemberTsekos, Nikolaos V.
dc.contributor.committeeMemberYuan, Xiaojing
dc.type.dcmiText
dc.format.digitalOriginborn digital
dc.description.departmentComputer Science, Department of
thesis.degree.collegeCollege of Natural Sciences and Mathematics


Files in this item


Thumbnail

This item appears in the following Collection(s)

Show simple item record