Data-driven identification of situated meanings in corpus data using Latent Class Analysis

Abstract: Identifying the meanings of grammatical elements in context is a major challenge for corpus-linguistic studies of grammatical variation. This study proposes a novel solution to this problem. I describe the situated meanings of grammatical elements as latent constructs, i.e., social concepts that cannot be observed directly but need to be inferred from the way that speakers behave. I use Latent Class Analysis (LCA) to create a data-driven typology of meanings for three modal periphrases in spoken Spanish and compare this typology to manual classification of the data in terms of modality. My findings show that (a) the situated meanings identified by the LCA do not directly correspond to the modal meanings that are commonly assumed to govern the variation between the three periphrases, and (b) the data-driven typology of meanings explains better the variation between these periphrases.

Location
Deutsche Nationalbibliothek Frankfurt am Main
Extent
Online-Ressource
Language
Englisch

Bibliographic citation
Data-driven identification of situated meanings in corpus data using Latent Class Analysis ; volume:10 ; number:1 ; year:2024 ; extent:30
Open linguistics ; 10, Heft 1 (2024) (gesamt 30)

Creator

DOI
10.1515/opli-2024-0029
URN
urn:nbn:de:101:1-2411211559022.376113609004
Rights
Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.
Last update
15.08.2025, 7:36 AM CEST

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Associated

Other Objects (12)