Textual case based reasoning (TCBR) is a challenging problem because a single case may consist of different topics and complex linguistic terms. Many efforts have been made to enhance retrieval process in TCBR using clustering methods. This paper proposes an enhanced clustering approach called GloSOPHIA (GloVe SOPHIA). It is based on extending SOPHIA by integrating word embeddings technique to enhance knowledge discovery in TCBR. To evaluate the quality of the proposed method, we will apply the GloSOPHIA to an Arabic newspaper corpus called watan-2004 and will compare the results with SOPHIA (SOPHisticated Information Analysis), K-means, and Self-Organizing Map (SOM) with different types of evaluation criteria. The results show that GloSOPHIA outperforms the 3 other clustering methods in most of the evaluation criteria.
CITATION STYLE
Terra, E., Mohammed, A., & Hefny, H. A. (2020). GloSOPHIA: An Enhanced Textual Based Clustering Approach by Word Embeddings. In Advances in Intelligent Systems and Computing (Vol. 1058, pp. 700–710). Springer. https://doi.org/10.1007/978-3-030-31129-2_64
Mendeley helps you to discover research relevant for your work.