Abstract
This paper presents an alternative algorithm based on the singular value decomposition (SVD) that creates vector representation for linguistic units with reduced dimensionality. The work was motivated by an application aimed to represent text segments for further processing in a multi-document summarization system. The algorithm tries to compensate for SVD's bias towards dominant-topic documents. Our experiments on measuring document similarities have shown that the algorithm achieves higher average precision with lower number of dimensions than the baseline algorithms - the SVD and the vector space model.
Cite
CITATION STYLE
Huang, F., & Wilks, Y. (2007). Clustered sub-matrix singular value decomposition. In NAACL-HLT 2007 - Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, Companion Volume: Short Papers (pp. 69–72). Association for Computational Linguistics (ACL). https://doi.org/10.3115/1614108.1614126
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.