Detecting Objectionable Content in Online Media

Shafaei, Mahsa

Detecting Objectionable Content in Online Media

dc.contributor.advisor	Solorio, Thamar
dc.contributor.committeeMember	Gabriel, Edgar
dc.contributor.committeeMember	Kakadiaris, Ioannis A.
dc.contributor.committeeMember	Gonzalez, Fabio A.
dc.creator	Shafaei, Mahsa
dc.creator.orcid	0000-0001-5428-3497
dc.date.accessioned	2022-06-17T21:59:02Z
dc.date.created	December 2021
dc.date.issued	2021-12
dc.date.submitted	December 2021
dc.date.updated	2022-06-17T21:59:03Z
dc.description.abstract	In this dissertation, we discuss methods toward having a system to automatically detect objectionable content in online media. Movies, animations, trailers, and video blogs are vastly accessible by younger audiences through the movie service providers (e.g. Amazon and Netflix), YouTube, movie theatres, and generally the web. The online content helps us learn and inspire societal changes. But it can also contain objectionable content that negatively affects viewers' behavior, especially children. For some media content (like movies, books and trailers), we do have a rating system. For example, the rating system for movies is adopted from the Motion Picture Association of America (MPAA), consists of manual inspection of movies to assign an age rating. However, there are some issues regarding this rating system. First, the current system announces a single rating for the whole content. Yet, suitability is partially related to the culture, people's background, emotional and cognitive skills of children. Thus, having a single rating is not always helpful, and more details are needed. Second, this manual process does not scale to an ever-increasing number of online videos available on the internet. As the first step towards the main goal of this dissertation (detecting objectionable content), we design, implement, and evaluate a system that is capable to predict movies and trailers age suitability rating without a human observation to explore different models for the task. The system that we propose either employs only the script of the movies as the input, or it takes advantage of all modalities and combines all cues from acoustic, visual and textual information for detecting the objectionable content. The script-based system can be utilized at the early steps of the production when we only have the script. The multi-modal version, however, can be used after the production when a video is fully ready. Finally, we expand our multi-modal model to automatically generate the list of objectionable elements in any kind of video. In this dissertation, we focus exclusively on "Comic Mischief" elements, which no one has attempted previously. Along with the system, we propose the biggest corpus of movie scripts that comes with metadata, poster images, and movie trailers that are rated by the MPAA institution. We also compile a dataset including a wide range of videos that are tagged with comic mischief elements in video scenes. Finally, we make the implementation and data resources available for further research.
dc.description.department	Computer Science, Department of
dc.format.digitalOrigin	born digital
dc.format.mimetype	application/pdf
dc.identifier.citation	Portions of this document appear in: Shafaei, Mahsa, Niloofar Safi Samghabadi, Sudipta Kar, and Thamar Solorio. "Age Suitability Rating: Predicting the MPAA Rating Based on Movie Dialogues." In Proceedings of The 12th Language Resources and Evaluation Conference, pp. 1327-1335. 2020; and in: Shafaei, Mahsa, Christos Smailis, Ioannis A. Kakadiaris, and Thamar Solorio. "A Case Study of Deep Learning Based Multi-Modal Methods for Predicting the Age-Suitability Rating of Movie Trailers." RANLP (2021).
dc.identifier.uri	https://hdl.handle.net/10657/9270
dc.language.iso	eng
dc.rights	The author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. UH Libraries has secured permission to reproduce any and all previously published materials contained in the work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subject	Objectionable Content
dc.subject	Deep Learning Architecture
dc.title	Detecting Objectionable Content in Online Media
dc.type.dcmi	Text
dc.type.genre	Thesis
dcterms.accessRights	The full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period.
local.embargo.lift	2023-12-01
local.embargo.terms	2023-12-01
thesis.degree.college	College of Natural Sciences and Mathematics
thesis.degree.department	Computer Science, Department of
thesis.degree.discipline	Computer Science
thesis.degree.grantor	University of Houston
thesis.degree.level	Doctoral
thesis.degree.name	Doctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SHAFAEI-DISSERTATION-2021.pdf
Size:: 1.55 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 4.43 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 1.81 KB
Format:: Plain Text
Description:

Download

Collections

Published ETD Collection