Hadoop framework for entity recognition within high velocity streams using deep learning

0Citations
Citations of this article
16Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Social media such as twitter, Facebook are the sources for Stream data. They generate unstructured formal text on various topics containing, emotions expressed on persons, organizations, locations, movies etc. Characteristics of such stream data are velocity, volume, incomplete, often incorrect, cryptic and noisy. Hadoop framework is proposed in our earlier work for recognising and resolving entities within semi structured data such as e-catalogs. This paper extends the framework for recognising and resolving entities from unstructured data such as tweets. Such a system can be used in data integration, de-duplication, detecting events, sentiment analysis. The proposed framework will recognize pre-defined entities from streams using Natural Language Processing (NLP) for extracting local context features and uses Map Reduce for entity resolution. Test results proved that the proposed entity recognition system could identify predefined entities such as location, organization and person entities with an accuracy of 72%.

Cite

CITATION STYLE

APA

Vasavi, S., & Prabhakar Benny, S. (2018). Hadoop framework for entity recognition within high velocity streams using deep learning. In Advances in Intelligent Systems and Computing (Vol. 542, pp. 247–257). Springer Verlag. https://doi.org/10.1007/978-981-10-3223-3_23

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free