Rainfall Prediction Based on Environmental Data

dc.contributor.advisorAzencott, Robert
dc.contributor.advisorJun, Mikyoung
dc.contributor.committeeMemberLabate, Demetrio
dc.contributor.committeeMemberSchumacher, Courtney
dc.creatorSun, Chunmei
dc.creator.orcid0000-0002-8849-1056
dc.date.accessioned2024-01-20T21:00:53Z
dc.date.createdAugust 2023
dc.date.issued2023-08
dc.date.updated2024-01-20T21:00:53Z
dc.description.abstractOur study explores the feasibility of predicting the rare rainfall events in the tropical East Pacific area using the environmental variables from the profiles of humidity, temperature, and wind. The prediction was conducted over three different rain types (deep convective, stratiform, and shallow convective) from GPM satellite radar. Environmental variables were from MERRA-2 reanalysis data. The environmental features at 28 pressure levels at the target pixel with the features at the neighboring pixels were used as our predictors for predicting the rare events at the target pixel. Prior to fitting the model, we applied feature rarity recoding. Rather than predicting rainfall intensity, our objective was to predict the occurrence of rare rainfall events. Our study defined three rare rainfall events by three threshold values, namely, 99.5\%, 99\%, and 95\% quantiles of rain levels. To address the issue that rainfall patterns in the large East Pacific area are not uniform, we proposed dividing it into 15 distinct geographic zones. Each rare event prediction task focused on a specific combination of rain type, rare event, and geographic zone. Multiple independent random forest classifiers were employed to construct an optimal classifier for each prediction task, with evaluation conducted on the validation set. The results suggest that our methodology achieved an accuracy of over 90\% in predicting rare rainfall events. The validation set accuracies for rare rainfall defined by 99.5\% and 99\% quantiles are higher than those defined by 95\% quantiles. The validation set accuracies of predicting deep convective and stratiform rain types are comparable and higher than those for predicting shallow convective rain. To assess the impact of environmental features on rare rainfall prediction, we employed the permutation importance method to evaluate feature importance. This analysis was implemented for the corresponding optimal classifier and systematically on regrouped features across 5 successive pressure levels. Our findings indicate that humidity and temperature play a critical role in predicting rare rainfall events, surpassing the importance of wind features. Specifically, humidity features at middle altitudes are important for deep convective and stratiform rain, while low altitude humidity features are important for shallow convective rain. Temperature features at low altitudes are crucial for deep convective and stratiform rain, while both low and high altitude temperature features are relevant for shallow convective rain. Wind features at low altitudes exhibit importance across all three rain types. Furthermore, we applied the K-means clustering algorithm to regroup the 15 geographic zones into 4 distinct clusters. This clustering approach allowed us to explore similarities and patterns across the geographic zones.
dc.description.departmentMathematics, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10657/15933
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectRainfall
dc.subjectRare events
dc.subjectPrediction
dc.subjectRandom forest classifier
dc.subjectPermutation importance
dc.titleRainfall Prediction Based on Environmental Data
dc.type.dcmitext
dc.type.genreThesis
dcterms.accessRightsThe full text of this item is not available at this time because the student has placed this item under an embargo for a period of time. The Libraries are not authorized to provide a copy of this work during the embargo period.
local.embargo.lift2025-08-01
local.embargo.terms2025-08-01
thesis.degree.collegeCollege of Natural Sciences and Mathematics
thesis.degree.departmentMathematics, Department of
thesis.degree.disciplineMathematics
thesis.degree.grantorUniversity of Houston
thesis.degree.levelDoctoral
thesis.degree.nameDoctor of Philosophy

Files

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: