Stylistically Aware Representations of Books

Maharjan, Suraj 1986-

Stylistically Aware Representations of Books

dc.contributor.advisor	Solorio, Thamar
dc.contributor.committeeMember	Gonzalez, Fabio A.
dc.contributor.committeeMember	Vilalta, Ricardo
dc.contributor.committeeMember	Eick, Christoph F.
dc.creator	Maharjan, Suraj 1986-
dc.date.accessioned	2018-11-30T16:13:46Z
dc.date.available	2018-11-30T16:13:46Z
dc.date.created	May 2018
dc.date.issued	2018-05
dc.date.submitted	May 2018
dc.date.updated	2018-11-30T16:13:46Z
dc.description.abstract	The conscious or unconscious choices made by an author to use some language forms constantly over other possible forms constitute the style of the author. Capturing style embedded in documents has a wide range of applications across many domains. In this dissertation, we propose a multitude of hand-crafted lexical, syntactic, and stylistic features together with novel deep learning methods to capture different stylistic markers embedded in documents. The methods are general enough to be applied to any domain. Here, we evaluate on an interesting and important domain: Books. The deeper study of stylistic variations will reveal the dos and don'ts of successful authors, which might help authors in shaping their writings and readers discover new books suited to their taste. We empirically show that traditional hand-crafted features and deep learning methods capture complementary information which upon careful combination yield better performance. Moreover, we find that adding an auxiliary task of genre classification to the primary task of success prediction improves results. Next, we propose a novel multimodal neural architecture that incorporates genre supervision to assign weights to individual feature types. As compared to previous ad-hoc feature combinations, which is time consuming and rigid, this method is capable of dynamically tailoring weights given to feature types based on the characteristics of each book. We then explore the authors' dexterity in use of emotion flow across the entire books to captivate readers. We show that modeling the sequential flow of emotions depicted across entire book performs better than without taking this information into account. Finally, we propose a novel method to learn stylistically aware embeddings for authors by feeding in the stylistic traits from their writings. These embeddings also prove to be assets in predicting books' likability.
dc.description.department	Computer Science, Department of
dc.format.digitalOrigin	born digital
dc.format.mimetype	application/pdf
dc.identifier.uri	http://hdl.handle.net/10657/3451
dc.language.iso	eng
dc.rights	The author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subject	NLP
dc.subject	Emotion Flow
dc.subject	Author Style Embeddings
dc.subject	Multitask
dc.subject	Deep learning
dc.subject	Genre-aware Attention Neural Model
dc.title	Stylistically Aware Representations of Books
dc.type.dcmi	Text
dc.type.genre	Thesis
local.embargo.lift	2020-05-01
local.embargo.terms	2020-05-01
thesis.degree.college	College of Natural Sciences and Mathematics
thesis.degree.department	Computer Science, Department of
thesis.degree.discipline	Computer Science
thesis.degree.grantor	University of Houston
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy

Files

Original bundle

Now showing 1 - 2 of 2

Name:: MAHARJAN-DISSERTATION-2018.pdf
Size:: 1.24 MB
Format:: Adobe Portable Document Format

Download

Name:: suraj-thesis.zip
Size:: 1.63 MB
Format:: Unknown data format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 4.43 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 1.81 KB
Format:: Plain Text
Description:

Download

Collections

Published ETD Collection