We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse the contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using the data from the corpus.
CITATION STYLE
Sproagis, U., & Rikters, M. (2020). What can we learn from almost a decade of food tweets. In Frontiers in Artificial Intelligence and Applications (Vol. 328, pp. 191–198). IOS Press BV. https://doi.org/10.3233/faia200622
Mendeley helps you to discover research relevant for your work.