Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Andrij Rovenchak

Approaches to the classification of complex systems: Words, texts, and more

May 09, 2022

Andrij Rovenchak

Figure 1 for Approaches to the classification of complex systems: Words, texts, and more

Figure 2 for Approaches to the classification of complex systems: Words, texts, and more

Figure 3 for Approaches to the classification of complex systems: Words, texts, and more

Figure 4 for Approaches to the classification of complex systems: Words, texts, and more

Abstract:The Chapter starts with introductory information about quantitative linguistics notions, like rank--frequency dependence, Zipf's law, frequency spectra, etc. Similarities in distributions of words in texts with level occupation in quantum ensembles hint at a superficial analogy with statistical physics. This enables one to define various parameters for texts based on this physical analogy, including "temperature", "chemical potential", entropy, and some others. Such parameters provide a set of variables to classify texts serving as an example of complex systems. Moreover, texts are perhaps the easiest complex systems to collect and analyze. Similar approaches can be developed to study, for instance, genomes due to well-known linguistic analogies. We consider a couple of approaches to define nucleotide sequences in mitochondrial DNAs and viral RNAs and demonstrate their possible application as an auxiliary tool for comparative analysis of genomes. Finally, we discuss entropy as one of the parameters, which can be easily computed from rank--frequency dependences. Being a discriminating parameter in some problems of classification of complex systems, entropy can be given a proper interpretation only in a limited class of problems. Its overall role and significance remain an open issue so far.

* Chapter submitted to the book: Order, Disorder and Criticality: Advanced Problems of Phase Transition Theory. Ed. by Yu. Holovatch. Vol. 7, 2022, World Scientific, Singapore

Via

Access Paper or Ask Questions

Application of a Quantum Ensemble Model to Linguistic Analysis

Nov 23, 2010

Andrij Rovenchak, Solomija Buk

Figure 1 for Application of a Quantum Ensemble Model to Linguistic Analysis

Figure 2 for Application of a Quantum Ensemble Model to Linguistic Analysis

Figure 3 for Application of a Quantum Ensemble Model to Linguistic Analysis

Figure 4 for Application of a Quantum Ensemble Model to Linguistic Analysis

Abstract:A new set of parameters to describe the word frequency behavior of texts is proposed. The analogy between the word frequency distribution and the Bose-distribution is suggested and the notion of "temperature" is introduced for this case. The calculations are made for English, Ukrainian, and the Guinean Maninka languages. The correlation between in-deep language structure (the level of analyticity) and the defined parameters is shown to exist.

* Physica A, Volume 390, Issue 7, Pages 1326-1331 (2011)
* 13 pages; 4 figures; 1 table

Via

Access Paper or Ask Questions

Distribution of complexities in the Vai script

Oct 01, 2008

Andrij Rovenchak, Ján Mačutek, Charles Riley

Figure 1 for Distribution of complexities in the Vai script

Figure 2 for Distribution of complexities in the Vai script

Figure 3 for Distribution of complexities in the Vai script

Figure 4 for Distribution of complexities in the Vai script

Abstract:In the paper, we analyze the distribution of complexities in the Vai script, an indigenous syllabic writing system from Liberia. It is found that the uniformity hypothesis for complexities fails for this script. The models using Poisson distribution for the number of components and hyper-Poisson distribution for connections provide good fits in the case of the Vai script.

* Glottometrics 18, 1-12 (2009)
* 13 pages

Via

Access Paper or Ask Questions

Some properties of the Ukrainian writing system

Feb 28, 2008

Solomija Buk, Ján Mačutek, Andrij Rovenchak

Figure 1 for Some properties of the Ukrainian writing system

Figure 2 for Some properties of the Ukrainian writing system

Figure 3 for Some properties of the Ukrainian writing system

Figure 4 for Some properties of the Ukrainian writing system

Abstract:We investigate the grapheme-phoneme relation in Ukrainian and some properties of the Ukrainian version of the Cyrillic alphabet.

* Glottometrics 16, 63-79 (2008)
* 17 pages

Via

Access Paper or Ask Questions

Online-concordance "Perekhresni stezhky" ("The Cross-Paths"), a novel by Ivan Franko

Jan 21, 2008

Solomiya Buk, Andrij Rovenchak

Abstract:In the article, theoretical principles and practical realization for the compilation of the concordance to "Perekhresni stezhky" ("The Cross-Paths"), a novel by Ivan Franko, are described. Two forms for the context presentation are proposed. The electronic version of this lexicographic work is available online.

* Ivan Franko: Spirit, Science, Thought, Will (Proceedings of the International Scientific Congress dedicated to the 150th anniversary (Lviv, 27 September -- 1 October 2006, Lviv University Press, Vol. 2, pp. 203-211, 2010)
* in Ukrainian

Via

Access Paper or Ask Questions

Menzerath-Altmann Law for Syntactic Structures in Ukrainian

Jan 30, 2007

Solomija Buk, Andrij Rovenchak

Figure 1 for Menzerath-Altmann Law for Syntactic Structures in Ukrainian

Figure 2 for Menzerath-Altmann Law for Syntactic Structures in Ukrainian

Figure 3 for Menzerath-Altmann Law for Syntactic Structures in Ukrainian

Figure 4 for Menzerath-Altmann Law for Syntactic Structures in Ukrainian

Abstract:In the paper, the definition of clause suitable for an automated processing of a Ukrainian text is proposed. The Menzerath-Altmann law is verified on the sentence level and the parameters for the dependences of the clause length counted in words and syllables on the sentence length counted in clauses are calculated for "Perekhresni Stezhky" ("The Cross-Paths"), a novel by Ivan Franko.

* Glottotheory. Vol. 1, No. 1, pp 10-17 (2008)
* 8 pages; submitted to the Proceedings of the International scientific conference on Modern Methods in Linguistics held in honour of the anniversary of Prof. Gabriel L. Altmann (October 23rd and 24th, 2006, Budmerice Castle, Slovakia)

Via

Access Paper or Ask Questions

Statistical Parameters of the Novel "Perekhresni stezhky" ("The Cross-Paths") by Ivan Franko

Dec 28, 2005

Solomija Buk, Andrij Rovenchak

Figure 1 for Statistical Parameters of the Novel "Perekhresni stezhky" ("The Cross-Paths") by Ivan Franko

Figure 2 for Statistical Parameters of the Novel "Perekhresni stezhky" ("The Cross-Paths") by Ivan Franko

Figure 3 for Statistical Parameters of the Novel "Perekhresni stezhky" ("The Cross-Paths") by Ivan Franko

Figure 4 for Statistical Parameters of the Novel "Perekhresni stezhky" ("The Cross-Paths") by Ivan Franko

Abstract:In the paper, a complex statistical characteristics of a Ukrainian novel is given for the first time. The distribution of word-forms with respect to their size is studied. The linguistic laws by Zipf-Mandelbrot and Altmann-Menzerath are analyzed.

* Quantitative Linguistics 62: Exact methods in the study of language and text: dedicated to Professor Gabriel Altmann on the occasion of his 75th birthday / Ed. by P. Grzybek and R. Kohler (Berlin; New York: de Gruyter), 39-48 (2007)
* 11 pages

Via

Access Paper or Ask Questions