Stylistically Aware Representations of Books

dc.contributor.advisorSolorio, Thamar
dc.contributor.committeeMemberGonzalez, Fabio A.
dc.contributor.committeeMemberVilalta, Ricardo
dc.contributor.committeeMemberEick, Christoph F.
dc.creatorMaharjan, Suraj 1986-
dc.date.accessioned2018-11-30T16:13:46Z
dc.date.available2018-11-30T16:13:46Z
dc.date.createdMay 2018
dc.date.issued2018-05
dc.date.submittedMay 2018
dc.date.updated2018-11-30T16:13:46Z
dc.description.abstractThe conscious or unconscious choices made by an author to use some language forms constantly over other possible forms constitute the style of the author. Capturing style embedded in documents has a wide range of applications across many domains. In this dissertation, we propose a multitude of hand-crafted lexical, syntactic, and stylistic features together with novel deep learning methods to capture different stylistic markers embedded in documents. The methods are general enough to be applied to any domain. Here, we evaluate on an interesting and important domain: Books. The deeper study of stylistic variations will reveal the dos and don'ts of successful authors, which might help authors in shaping their writings and readers discover new books suited to their taste. We empirically show that traditional hand-crafted features and deep learning methods capture complementary information which upon careful combination yield better performance. Moreover, we find that adding an auxiliary task of genre classification to the primary task of success prediction improves results. Next, we propose a novel multimodal neural architecture that incorporates genre supervision to assign weights to individual feature types. As compared to previous ad-hoc feature combinations, which is time consuming and rigid, this method is capable of dynamically tailoring weights given to feature types based on the characteristics of each book. We then explore the authors' dexterity in use of emotion flow across the entire books to captivate readers. We show that modeling the sequential flow of emotions depicted across entire book performs better than without taking this information into account. Finally, we propose a novel method to learn stylistically aware embeddings for authors by feeding in the stylistic traits from their writings. These embeddings also prove to be assets in predicting books' likability.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttp://hdl.handle.net/10657/3451
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectNLP
dc.subjectEmotion Flow
dc.subjectAuthor Style Embeddings
dc.subjectMultitask
dc.subjectDeep learning
dc.subjectGenre-aware Attention Neural Model
dc.titleStylistically Aware Representations of Books
dc.type.dcmiText
dc.type.genreThesis
local.embargo.lift2020-05-01
local.embargo.terms2020-05-01
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
MAHARJAN-DISSERTATION-2018.pdf
Size:
1.24 MB
Format:
Adobe Portable Document Format
No Thumbnail Available
Name:
suraj-thesis.zip
Size:
1.63 MB
Format:
Unknown data format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: