Human Detection in the Wild

dc.contributor.committeeMemberKakadiaris, Ioannis A.
dc.contributor.committeeMemberPrasad, Saurabh
dc.contributor.committeeMemberEick, Christoph F.
dc.contributor.committeeMemberGabriel, Edgar
dc.creatorShi, Lei
dc.creator.orcid0000-0002-9536-336X
dc.date.accessioned2021-06-09T17:33:00Z
dc.date.createdAugust 2020
dc.date.issued2020-08
dc.date.submittedAugust 2020
dc.date.updated2021-06-09T17:33:02Z
dc.description.abstractHuman detection remains a challenging task due to the problems caused by occlusion variance. Visible-body bounding boxes are typically used as an extra supervision signal to improve the performance of human detection. However, visible-body assisted approaches produce a large number of false positives, which result from a lack of adequate and discriminative full-body contextual information. As the most discriminative features of head and human, face detection has attracted much attention. Despite the great progress that has been achieved for accurate face detection, detecting multi-scale faces, especially for small faces, remains a challenging problem. Existing approaches that tackle multi-scale face detection problem could be categorized into two-stage face detectors and single-stage face detectors. Regarding two-stage face detectors, to learn discriminative facial features at various scales, the input pyramids or multi-scale feature maps are deployed to provide more facial information for the network to learn features in various scales. However, they could increase the training difficulty and complexity of the network. Regarding single-stage face detectors, feature fusion and context aggregation have been used to enrich contextual information. However, treating reliable information and noise equally could result in much noise in the fused features at different levels. Moreover, dilated convolutions in the context aggregation module could result in the gridding artifacts problem. The goal of this dissertation is to design, develop, and evaluate human detection algorithms to solve the above problems. Three contributions made in this dissertation could be summarized as follows: (i) A decoupled visible region network for human detection was designed, developed, and evaluated to overcome the occlusion challenge. The proposed human detector improved performance from MR$^{-2}$ of $11.24$ to MR$^{-2}$ of $10.50$ when compared to Bi-box which is inspired by our work on the CityPersons dataset. (ii) A two-stage face detector was designed, developed, and evaluated to overcome scale challenge. It improves performance by mAP of $12.1\%$ when compared to our baseline on the WIDER FACE dataset. (iii) A single-stage face detector was designed, developed, and evaluated to overcome scale challenge. The proposed method achieves the best performance with an mAP of $77.0\%$ on the UFDD dataset.
dc.description.departmentComputer Science, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.citationPortions of this document appear in: L. Shi, X. Xu, I. A. Kakadiaris. “SSFD+: A Robust Two-stage Face Detector,”IEEETransaction on Biometrics, Behavior, and Identity Science, 1(2019), 181-191.; L. Shi, X. Xu, I. A. Kakadiaris. “SANet: Smoothed Attention Netwok for Single-stage FaceDetector,” InProceedings of IEEE International Conference on Biometrics, (Crete, Greece,2019), pp. 11-20.; L. Shi, X. Xu, I. A. Kakadiaris. “SSFD: A Face Detector using A Single-scale Feature Map,”InProceedings of IEEE International Conference on Biometrics: Theory, Applications, andSystems, (LA, CA, 2018), pp. 1-10.
dc.identifier.urihttps://hdl.handle.net/10657/7759
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. UH Libraries has secured permission to reproduce any and all previously published materials contained in the work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectHuman detection, face detection
dc.titleHuman Detection in the Wild
dc.type.dcmiText
dc.type.genreThesis
local.embargo.lift2022-08-01
local.embargo.terms2022-08-01
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentComputer Science, Department of
thesis.degree.disciplineComputer Science
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
SHI-DISSERTATION-2020.pdf
Size:
9.7 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.42 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: