GloSOPHIA: An Enhanced Textual Based Clustering Approach by Word Embeddings

2Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Textual case based reasoning (TCBR) is a challenging problem because a single case may consist of different topics and complex linguistic terms. Many efforts have been made to enhance retrieval process in TCBR using clustering methods. This paper proposes an enhanced clustering approach called GloSOPHIA (GloVe SOPHIA). It is based on extending SOPHIA by integrating word embeddings technique to enhance knowledge discovery in TCBR. To evaluate the quality of the proposed method, we will apply the GloSOPHIA to an Arabic newspaper corpus called watan-2004 and will compare the results with SOPHIA (SOPHisticated Information Analysis), K-means, and Self-Organizing Map (SOM) with different types of evaluation criteria. The results show that GloSOPHIA outperforms the 3 other clustering methods in most of the evaluation criteria.

Cite

CITATION STYLE

APA

Terra, E., Mohammed, A., & Hefny, H. A. (2020). GloSOPHIA: An Enhanced Textual Based Clustering Approach by Word Embeddings. In Advances in Intelligent Systems and Computing (Vol. 1058, pp. 700–710). Springer. https://doi.org/10.1007/978-3-030-31129-2_64

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free