Linguistic Diversity On Africa: Clustering Methods Application On Language Typology Data

dc.contributor.advisorGrigorenko, Elena L.
dc.contributor.committeeMemberFrancis, David J.
dc.contributor.committeeMemberRakhlin, Natalia V.
dc.creatorMarkov, Ilia
dc.creator.orcid0000-0002-1639-7968
dc.date.accessioned2024-01-24T19:35:52Z
dc.date.available2024-01-24T19:35:52Z
dc.date.createdDecember 2023
dc.date.issued2023-12
dc.date.updated2024-01-24T19:35:52Z
dc.description.abstractOne of the main goals in language research is to understand the distribution of linguistic diversity and its underlying principles. Linguistic typology allows us to characterize diverse languages in terms of their linguistic features, which can be used to create a relatively comprehensive description of any language. An important theoretical issue concerning linguistic diversity is whether it should be considered stochastic (randomly determined) or deterministic (based on a set of principles governing it). The latter may depend on a set of constraints imposed from inside or outside the linguistic system, i.e., language faculty in the narrow sense or the “interfaces” – aspects of perceptual, motor, and general cognition systems. The debate on the nature of language variation has not been settled. Africa, having the longest history of human settlement of any continent and hypothesized to be the place of origin of the Homo Sapiens, has the deepest genealogical relations between languages that are yet to be comprehensively and systematically described. By using a sample of the language typology data available in the World Atlas of Language Structures (WALS) and in the repository of crosslinguistic phonological inventory data (PHOIBLE 2) for languages of the African continent, we seek to address the following research objective: to investigate whether the typological diversity of languages in Africa can be characterized by clustering along two important structural divides: synthetic – analytic and tonal – non-tonal. Several methods, including latent class analysis and CFA models, hierarchical clustering, k-means family algorithms and CART modes, using feature networks focusing on relevant language domains were constructed to classify the data according to these structural divides. Those classification patterns can be further linked to several possible future interface hypotheses, leading to better understanding of human language.
dc.description.departmentPsychology, Department of
dc.format.digitalOriginborn digital
dc.format.mimetypeapplication/pdf
dc.identifier.urihttps://hdl.handle.net/10657/16117
dc.language.isoeng
dc.rightsThe author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subjectLinguistic typology
dc.subjectAfrican languages
dc.subjectcluster analysis
dc.subjectstructural equation modeling
dc.titleLinguistic Diversity On Africa: Clustering Methods Application On Language Typology Data
dc.type.dcmitext
dc.type.genreThesis
thesis.degree.collegeCollege of Liberal Arts and Social Sciences
thesis.degree.departmentPsychology, Department of
thesis.degree.disciplinePsychology
thesis.degree.grantorUniversity of Houston
thesis.degree.levelMasters
thesis.degree.nameMaster of Arts

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
MARKOV-THESIS-2023.pdf
Size:
1.33 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
4.43 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
1.81 KB
Format:
Plain Text
Description: