The SUMMA platform: A scalable infrastructure for multi-lingual multi-media monitoring

4Citations
Citations of this article
61Readers
Mendeley users who have this article in their library.

Abstract

The open-source SUMMA Platform is a highly scalable distributed architecture for monitoring a large number of media broadcasts in parallel, with a lag behind actual broadcast time of at most a few minutes. The Platform offers a fully automated media ingestion pipeline capable of recording live broadcasts, detection and transcription of spoken content, translation of all text (original or transcribed) into English, recognition and linking of Named Entities, topic detection, clustering and crosslingual multi-document summarization of related media items, and last but not least, extraction and storage of factual claims in these news items. Browser-based graphical user interfaces provide humans with aggregated information as well as structured access to individual news items stored in the Platform's database. This paper describes the intended use cases and provides an overview over the system's implementation.

Cite

CITATION STYLE

APA

Germann, U., Liepiņs, R., Barzdins, G., Gosko, D., Miranda, S., & Nogueira, D. (2015). The SUMMA platform: A scalable infrastructure for multi-lingual multi-media monitoring. In ACL 2018 - 56th Annual Meeting of the Association for Computational Linguistics, Proceedings of System Demonstrations (pp. 99–104). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/p18-4017

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free