Date of Degree
Computational Linguistics | Russian Linguistics
Russian loanwords, inflectional morphology, loanword detection, finite-state transducer, language model
This paper investigates recent English loanwords in Russian and explores ways in which computational methods can help further theoretical research. The goal of the study is two-fold: to find new, previously unattested loanwords borrowed over the last decade and to examine the rate of adaptation of the new borrowings, attested by the degree to which they conform to the constraints of the Russian language. First, we train a finite-state pipeline that combines character n-gram language models, which encode phonotactic and lexical properties of loanwords, with a binary classifier to detect loanwords. The model achieves state-of-the-art performance results during evaluation, surpassing previously established benchmarks. Secondly, we introduce a new and extended corpus of recent Russian loanwords that have been detected in Web texts by our model. The corpus includes loanwords together with their morphological features, part-of-speech tags, and sentences in which they occur. We conduct an analysis of inflectional morphology of the identified loanwords, investigating the rate of indeclinability of recent loanwords and stem-final consonant alternations in verbs.
Spektor, Yulia, "Detection and Morphological Analysis of Novel Russian Loanwords" (2021). CUNY Academic Works.