Canonical correlation analysis (CCA) is a method for reducing the dimension of data represented using two views. It has been previously used to derive word embeddings, where one view indicates a word, and the other view indicates its context. We describe a way to incorporate prior knowledge into CCA, give a theoretical justification for it, and test it by deriving word embeddings and evaluating them on a myriad of datasets.
CITATION STYLE
Osborne, D., Narayan, S., & Cohen, S. B. (2016). Encoding Prior Knowledge with Eigenword Embeddings. Transactions of the Association for Computational Linguistics, 4, 417–430. https://doi.org/10.1162/tacl_a_00108
Mendeley helps you to discover research relevant for your work.