Visual Summarization of Lecture Videos to Enhance Navigation

dc.contributor.advisorSubhlok, Jaspal
dc.contributor.committeeMemberShah, Shishir Kirit
dc.contributor.committeeMemberSolorio, Thamar
dc.contributor.committeeMemberWayne, Chad M.
dc.creatorRahman, Mohammad Rajiur
dc.creator.orcid0000-0002-4462-0274
dc.date.accessioned2022-06-29T23:04:29Z
dc.date.available2022-06-29T23:04:29Z
dc.date.createdMay 2021
dc.date.issued2021-05
dc.date.submittedMay 2021
dc.date.updated2022-06-29T23:04:30Z
dc.description.abstractRecorded lecture video is a popular and essential learning resource. A fundamental limitation of lecture video is the inability to access any content of interest quickly. Several lecture video management portals introduced additional navigation features like indexing, captioning, search, etc. Lecture video indexing is the automatic partitioning of videos into smaller segments, each discussing a particular topic. However, these indexes do not describe the content of the segment. My goal is to create a visual summary containing a subset of images extracted from a lecture video segment to enhance navigation. The quality of a visual summary depends on the uniqueness and importance of the images. The uniqueness is achieved by ensuring a diverse set of images that has low similarity between them. The importance is the desirability of an image to be included in the summary. Experimental results indicate a combination of keypoints-match and color histograms work best to identify unique objects, and a combination of the size and the number of keypoints can closely approximate the desirability of an image for including in the summary. This dissertation presents a graph-based algorithm that selects a subset of unique and important images for a visual summary. The results from this research are implemented into a real-world lecture video management portal called Videopoints. The evaluation is based on summaries provided by Videopoints users on a dataset of 120 video segments. The graph-based heuristic algorithm for identifying summary images achieves 66% F1-measure with frequently-selected images as the ground truth and 79% F1-measure with the union of all user-selected images as the ground truth. For 93.8% of algorithm selected visual summary images, at least one user also selected that image for their summary or considered it similar to another image they selected. Over 70% of automatically generated summaries were rated as good or very good by the users on a 4-point scale from poor to very good. Overall, the results establish that the methodology introduced in this dissertation produces good quality visual summaries that are practically useful for lecture video navigation.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.citationPortions of this document appear in: Rahman, Mohammad Rajiur, Shishir Shah, and Jaspal Subhlok. "Visual summarization of lecture video segments for enhanced navigation." In 2020 IEEE International Symposium on Multimedia (ISM), pp. 154-157. IEEE, 2020.
dc.identifier.urihttps://hdl.handle.net/10657/10197
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. UH Libraries has secured permission to reproduce any and all previously published materials contained in the work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectLecture Video Summarization, Visual Summarization, Summarization
dc.titleVisual Summarization of Lecture Videos to Enhance Navigation
dc.type.dcmiText
dc.type.genreThesis
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
RAHMAN-DISSERTATION-2021.pdf
Size:
8.84 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.44 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.82 KB
Format:
Plain Text
Description: