Letter Frequency Analysis of Languages Using Latin Alphabet

  • Grigas G
  • Juškevičienė A
N/ACitations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

The evaluation of the peculiarities of alphabets, particularly the frequency of letters is essential when designing keyboards, analysing texts, designing alphabet-based games, and doing some text mining. Thus, it is important to determine what might be useful for designers of text input tools, and of other technologies related to sets of letters. Knowledge of common features among different languages gives an opportunity to take advantage of the experience of other languages. Nowadays an increasing amount of texts is published on the Internet. In order to adequately compare the frequencies of letters in different languages used in the online space, Wikipedia texts have been selected as a source material for investigation. This paper presents the Method of the Adjacent Letter Frequency Differences in the frequency line, which helps to evaluate frequency breakpoints. This is a uniform evaluation criterion for 25 main languages using Latin script in order to highlight the similarities and differences among them. Research focuses on the letter frequency analysis in the area of rarely used native letters and frequently used foreign letters in a particular language. The frequency of the letters is one of the factors that determines the location of the keys for the language specific letters on the keyboard.

Cite

CITATION STYLE

APA

Grigas, G., & Juškevičienė, A. (2018). Letter Frequency Analysis of Languages Using Latin Alphabet. International Linguistics Research, 1(1), p18. https://doi.org/10.30560/ilr.v1n1p18

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free