Computational Methods for Tweet Summarization and Emotion Extraction
dc.contributor.advisor | Eick, Christoph F. | |
dc.contributor.committeeMember | Shi, Weidong | |
dc.contributor.committeeMember | Lendasse, Amaury | |
dc.creator | Kummari, Anjana | |
dc.creator.orcid | 0000-0002-7251-2238 | |
dc.date.accessioned | 2020-06-02T04:56:50Z | |
dc.date.available | 2020-06-02T04:56:50Z | |
dc.date.created | May 2020 | |
dc.date.issued | 2020-05 | |
dc.date.submitted | May 2020 | |
dc.date.updated | 2020-06-02T04:56:51Z | |
dc.description.abstract | The process of gathering insights from social media has gained significant importance in the last decade. Since social media data is growing larger and larger, frameworks that can analyze social media content automatically are of critical importance. Twitter is a micro-blog service that generates a massive amount of textual content every day. Throughout our research, we concentrate on using Twitter for the task of sentiment analysis, the most popular micro-blogging site. We demonstrate how to compile a corpus automatically for purposes of sentiment analysis and opinion mining. Sentiment analysis classifies texts based on the sentimental orientation of opinions and emotions they contain. In this project, we are interested in evaluating popular sentiment analysis tools that automatically determine emotions in tweets and to develop computational methods that summarize the content of a large set of tweets. For the comparison of sentiment analysis tools, we created different benchmarks of manually annotated tweet datasets, and then evaluated the tools using these benchmarks. We also addressed some of the most popular sentiment analysis challenges. As far as summarization of tweets is concerned, we designed and developed algorithms that extract keywords and key sentences as a summary for a set of tweets. Finally, we developed a tool that creates a distance matrix for a set of tweets relying on the popular TF-IDF framework. | |
dc.description.department | Computer Science, Department of | |
dc.format.digitalOrigin | born digital | |
dc.format.mimetype | application/pdf | |
dc.identifier.uri | https://hdl.handle.net/10657/6617 | |
dc.language.iso | eng | |
dc.rights | The author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s). | |
dc.subject | NLP, Twitter Analytics, Tweet Summarization, Sentiment Analysis | |
dc.title | Computational Methods for Tweet Summarization and Emotion Extraction | |
dc.type.dcmi | Text | |
dc.type.genre | Thesis | |
thesis.degree.college | College of Natural Sciences and Mathematics | |
thesis.degree.department | Computer Science, Department of | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | University of Houston | |
thesis.degree.level | Masters | |
thesis.degree.name | Master of Science |
Files
Original bundle
1 - 1 of 1