Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

Vasyl Lytvyn; Victoria Vysotska; Petro Pukach; Zinovii Nytrebych; Ihor Demkiv; Andriy Senyk; Oksana Malanchuk; Svitlana Sachenko; Roman Kovalchuk; Nadiia Huzyk

Journal ArticleOPEN ACCESS

Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

Eastern-European Journal of Enterprise Technologies (2018) 6(2-96) 19-31

DOI: 10.15587/1729-4061.2018.149596

41Citations

9Readers

Abstract

A formal approach was proposed to implement text content attribution. The study was conducted with Ukrainian scientific and technical texts. The results of application of the designed algorithms of automatic attribution of the text content based on the NLP and stylemetry methods were analyzed. Prospects and features of application of stylemetry information technologies for attribution of the text content were considered. Quantitative content analysis of scientific and technical text content takes advantage of content monitoring and text content analysis based on NLP, Web-Mining and stylemetry methods to identify the multitude of authors whose talking style is similar to that of the analyzed text fragment. This narrows the range of search for further use in the stylemetry methods to determine the degree of belonging of the analyzed text to a particular author. Decomposition of the attribution method was carried out based on analysis of such talking coefficients as lexical diversity, degree (measure) of syntactic complexity, talking coherence, indexes of exclusivity and concentration of the text. At the same time, author's style parameters such as the number of words in a certain text, the total number of words of this text, the number of sentences, the number of prepositions, the number of conjunctions, the number of words with occurrence frequency 1, the number of words with occurrence frequency 10 or more were analyzed. Further experimental study requires testing of the proposed method in identifying keywords of texts of other categories: scientific humanitarian, artistic, journalistic, etc.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Lytvyn, V., Vysotska, V., Pukach, P., Nytrebych, Z., Demkiv, I., Senyk, A., … Huzyk, N. (2018). Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian. Eastern-European Journal of Enterprise Technologies, 6(2–96), 19–31. https://doi.org/10.15587/1729-4061.2018.149596

Readers over time

Readers' Seniority

Professor / Associate Prof. 2

40%

Lecturer / Post doc 1

20%

PhD / Post grad / Masters / Doc 1

20%

Researcher 1

20%

Readers' Discipline

Business, Management and Accounting 2

33%

Mathematics 2

33%

Social Sciences 1

17%

Arts and Humanities 1

17%

Analysis of the developed quantitative method for automatic attribution of scientific and technical text content written in Ukrainian

Abstract

Author supplied keywords

References Powered by Scopus

Data mining for Web personalization

Method of integration and content management of the information resources network

Intellectual system design for content formation

Cited by Powered by Scopus

Design of a recommendation system based on Collaborative Filtering and machine learning considering personal needs of the user

Design of the architecture of an intelligent system for distributing commercial content in the internet space based on seo-technologies, neural networks, and machine learning

Identification of authorship of ukrainianlanguage texts of journalistic style using neural networks

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline