Indonesian news classification application with named entity recognition approach

  • Nurchim N
  • Nurmalitasari N
  • Long Z
N/ACitations
Citations of this article
11Readers
Mendeley users who have this article in their library.

Abstract

Nowadays, many netizens search for news via search engines with countless amounts of information, so it is increasingly difficult to determine when the number of news articles that appear changes very quickly and dynamically. Thus, it is necessary to process the extraction of news information to display the core information of the news. Problems arise, especially in Indonesian, which has a structure of various noun phrase entities with shallow parsing or grammatical induction. Named Entity Recognition (NER) has the opportunity to overcome this because it can extract news entities in depth, starting from proper nouns in text documents containing information search, machine translation, answering questions, and automatic summarization. This study aims to apply NER in Indonesian language news classification. This study uses Design-Based Research whose process includes (1) pre-implementation, (2) design, (3) implementation and revision, and finally, (4) reflection and evaluation. This application was developed on the platform python, streamlit, BeautifulSoup, gnews, and spacy library. The results of application accuracy testing have an F1-score value of 89.69% for all entities consisting of place, figure, day, date, and organization.

Cite

CITATION STYLE

APA

Nurchim, N., Nurmalitasari, N., & Long, Z. A. (2023). Indonesian news classification application with named entity recognition approach. JURNAL INFOTEL, 15(2), 130–134. https://doi.org/10.20895/infotel.v15i2.909

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free