Enhanced TextRank using weighted word embedding for text summarization

Evi Yulianti; Nicholas Pangestu; Meganingrum Arista Jiwanggi

Journal ArticleOPEN ACCESS

Enhanced TextRank using weighted word embedding for text summarization

International Journal of Electrical and Computer Engineering (2023) 13(5) 5472-5482

DOI: 10.11591/ijece.v13i5.pp5472-5482

6Citations

41Readers

Abstract

The length of a news article may influence people’s interest to read the article. In this case, text summarization can help to create a shorter representative version of an article to reduce people’s read time. This paper proposes to use weighted word embedding based on Word2Vec, FastText, and bidirectional encoder representations from transformers (BERT) models to enhance the TextRank summarization algorithm. The use of weighted word embedding is aimed to create better sentence representation, in order to produce more accurate summaries. The results show that using (unweighted) word embedding significantly improves the performance of the TextRank algorithm, with the best performance gained by the summarization system using BERT word embedding. When each word embedding is weighed using term frequency-inverse document frequency (TF-IDF), the performance for all systems using unweighted word embedding further significantly improve, with the biggest improvement achieved by the systems using Word2Vec (with 6.80% to 12.92% increase) and FastText (with 7.04% to 12.78% increase). Overall, our systems using weighted word embedding can outperform the TextRank method by up to 17.33% in ROUGE-1 and 30.01% in ROUGE-2. This demonstrates the effectiveness of weighted word embedding in the TextRank algorithm for text summarization.

Author supplied keywords

Cite

CITATION STYLE

APA

Yulianti, E., Pangestu, N., & Jiwanggi, M. A. (2023). Enhanced TextRank using weighted word embedding for text summarization. International Journal of Electrical and Computer Engineering, 13(5), 5472–5482. https://doi.org/10.11591/ijece.v13i5.pp5472-5482

Enhanced TextRank using weighted word embedding for text summarization

Abstract

Author supplied keywords

Cite

Register to see more suggestions