MBLA social corpus: Multipurpose multidimensional corpus on cyber-language

0Citations
Citations of this article
4Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Technological advances have made it possible for areas such as Corpus Linguistics and Computational Linguistics to advance exponentially. However, the basic evolution followed by corpora, as an essential tool in these areas, has been fundamentally in size. Proof of this is the Google nGram project, which has digitized a vast number of books from 1505 to the present day, allowing studies to be carried out on corpora. However, and as a result of the continuous evolution of new communication media and social networks, we have witnessed the birth of a new genre, called cyber-language, situated between orality and textuality, of which there are no specialized corpora. Our proposal is to design a tool to create a large multidimensional corpus based on the social network Twitter and a set of specific tools to generate subcorpora, conduct quantitative studies and visualize the stored information, from the perspective of bigdata manipulation.

Cite

CITATION STYLE

APA

Maroto Conde, Á. L., & Bermúdez Vázquez, M. (2019). MBLA social corpus: Multipurpose multidimensional corpus on cyber-language. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11755 LNAI, pp. 283–298). Springer. https://doi.org/10.1007/978-3-030-30135-4_21

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free