Linguistic Diversity On Africa: Clustering Methods Application On Language Typology Data

Markov, Ilia

Linguistic Diversity On Africa: Clustering Methods Application On Language Typology Data

dc.contributor.advisor	Grigorenko, Elena L.
dc.contributor.committeeMember	Francis, David J.
dc.contributor.committeeMember	Rakhlin, Natalia V.
dc.creator	Markov, Ilia
dc.creator.orcid	0000-0002-1639-7968
dc.date.accessioned	2024-01-24T19:35:52Z
dc.date.available	2024-01-24T19:35:52Z
dc.date.created	December 2023
dc.date.issued	2023-12
dc.date.updated	2024-01-24T19:35:52Z
dc.description.abstract	One of the main goals in language research is to understand the distribution of linguistic diversity and its underlying principles. Linguistic typology allows us to characterize diverse languages in terms of their linguistic features, which can be used to create a relatively comprehensive description of any language. An important theoretical issue concerning linguistic diversity is whether it should be considered stochastic (randomly determined) or deterministic (based on a set of principles governing it). The latter may depend on a set of constraints imposed from inside or outside the linguistic system, i.e., language faculty in the narrow sense or the “interfaces” – aspects of perceptual, motor, and general cognition systems. The debate on the nature of language variation has not been settled. Africa, having the longest history of human settlement of any continent and hypothesized to be the place of origin of the Homo Sapiens, has the deepest genealogical relations between languages that are yet to be comprehensively and systematically described. By using a sample of the language typology data available in the World Atlas of Language Structures (WALS) and in the repository of crosslinguistic phonological inventory data (PHOIBLE 2) for languages of the African continent, we seek to address the following research objective: to investigate whether the typological diversity of languages in Africa can be characterized by clustering along two important structural divides: synthetic – analytic and tonal – non-tonal. Several methods, including latent class analysis and CFA models, hierarchical clustering, k-means family algorithms and CART modes, using feature networks focusing on relevant language domains were constructed to classify the data according to these structural divides. Those classification patterns can be further linked to several possible future interface hypotheses, leading to better understanding of human language.
dc.description.department	Psychology, Department of
dc.format.digitalOrigin	born digital
dc.format.mimetype	application/pdf
dc.identifier.uri	https://hdl.handle.net/10657/16117
dc.language.iso	eng
dc.rights	The author of this work is the copyright owner. UH Libraries and the Texas Digital Library have their permission to store and provide access to this work. Further transmission, reproduction, or presentation of this work is prohibited except with permission of the author(s).
dc.subject	Linguistic typology
dc.subject	African languages
dc.subject	cluster analysis
dc.subject	structural equation modeling
dc.title	Linguistic Diversity On Africa: Clustering Methods Application On Language Typology Data
dc.type.dcmi	text
dc.type.genre	Thesis
thesis.degree.college	College of Liberal Arts and Social Sciences
thesis.degree.department	Psychology, Department of
thesis.degree.discipline	Psychology
thesis.degree.grantor	University of Houston
thesis.degree.level	Masters
thesis.degree.name	Master of Arts

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MARKOV-THESIS-2023.pdf
Size:: 1.33 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 2 of 2

Name:: PROQUEST_LICENSE.txt
Size:: 4.43 KB
Format:: Plain Text
Description:

Download

Name:: LICENSE.txt
Size:: 1.81 KB
Format:: Plain Text
Description:

Download

Collections

Published ETD Collection