Computational Methods for Tweet Summarization and Emotion Extraction

dc.contributor.advisorEick, Christoph F.
dc.contributor.committeeMemberShi, Weidong
dc.contributor.committeeMemberLendasse, Amaury
dc.creatorKummari, Anjana
dc.creator.orcid0000-0002-7251-2238
dc.date.accessioned2020-06-02T04:56:50Z
dc.date.available2020-06-02T04:56:50Z
dc.date.createdMay 2020
dc.date.issued2020-05
dc.date.submittedMay 2020
dc.date.updated2020-06-02T04:56:51Z
dc.description.abstractThe process of gathering insights from social media has gained significant importance in the last decade. Since social media data is growing larger and larger, frameworks that can analyze social media content automatically are of critical importance. Twitter is a micro-blog service that generates a massive amount of textual content every day. Throughout our research, we concentrate on using Twitter for the task of sentiment analysis, the most popular micro-blogging site. We demonstrate how to compile a corpus automatically for purposes of sentiment analysis and opinion mining. Sentiment analysis classifies texts based on the sentimental orientation of opinions and emotions they contain. In this project, we are interested in evaluating popular sentiment analysis tools that automatically determine emotions in tweets and to develop computational methods that summarize the content of a large set of tweets. For the comparison of sentiment analysis tools, we created different benchmarks of manually annotated tweet datasets, and then evaluated the tools using these benchmarks. We also addressed some of the most popular sentiment analysis challenges. As far as summarization of tweets is concerned, we designed and developed algorithms that extract keywords and key sentences as a summary for a set of tweets. Finally, we developed a tool that creates a distance matrix for a set of tweets relying on the popular TF-IDF framework.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10657/6617
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectNLP, Twitter Analytics, Tweet Summarization, Sentiment Analysis
dc.titleComputational Methods for Tweet Summarization and Emotion Extraction
dc.type.dcmiText
dc.type.genreThesis
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelMasters
thesis.degree.nameMaster of Science

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
KUMMARI-THESIS-2020.pdf
Size:
1.78 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: