Replication Data for: The decade construction rivalry in Russian: Using a corpus to study historical linguistics

DOI

This dataset contains 3 data files, 5 files with R code, and a short read-me file with documentation. The data files contain information about the development of two competing constructions in Russian temporal adverbials. The files with R code give the code for analysis of the databases.

ARTICLE ABSTRACT: What can a corpus do for the historical linguist? How can corpus data shed light on the diachronic development of so-called rival forms, i.e., words or grammatical constructions that appear to be synonyms? This article addresses these questions based on a detailed empirical analysis of two seemingly synonymous constructions in Russian. Corresponding to the English ‘decade construction’ in the twenties, Russian has two rival constructions, viz. v dvadcatye gody [lit. “in the twentieth years”] (with the numeral and noun in the accusative) and v dvadcatyx godax (with the numeral and noun in the locative case). Three hypotheses about rival forms are considered: leveling (whereby one form ousts its rival), sociolinguistic differentiation (whereby the two rivals survive in different varieties of a language) and semantic differentiation (whereby the two rivals develop different meanings over time). Contrary to what has been suggested in the literature, we find little evidence for semantic and sociolinguistic differentiation. Instead, we demonstrate that leveling is taking place, since the accusative construction is in the process of ousting its rival. While our study shows that corpus data facilitate detailed analysis of the interaction between leveling, sociolinguistic differentiation and semantic differentiation, our analysis also points to limitations, especially when it comes to corpus-based analysis of sociolinguistic and semantic factors.

Identifier
DOI https://doi.org/10.18710/QKHCVE
Related Identifier https://doi.org/10.1075/dia.16043.nes
Metadata Access https://dataverse.no/oai?verb=GetRecord&metadataPrefix=oai_datacite&identifier=doi:10.18710/QKHCVE
Provenance
Creator Nesset, Tore; Makarova, Anastasia
Publisher DataverseNO
Contributor Nesset, Tore; UiT The Arctic University of Norway; The Tromsø Repository of Language and Linguistics
Publication Year 2017
Rights CC0 1.0; info:eu-repo/semantics/openAccess; http://creativecommons.org/publicdomain/zero/1.0
OpenAccess true
Contact Nesset, Tore (UiT The Arctic University of Norway)
Representation
Resource Type Corpus data; Dataset
Format text/plain; text/csv; text/tab-separated-values
Size 6789; 6485190; 8037582; 3809376; 958895; 781; 805; 1069; 1721; 1169
Version 1.2
Discipline Humanities; Linguistics