Efficient algorithm for math formula semantic search

8Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Mathematical formulae play an important role in many scientific domains. Regardless of the importance of mathematical formula search, conventional keyword-based retrieval methods are not sufficient for searching mathematical formulae, which are structured as trees. The increasing number as well as the structural complexity of mathematical formulae in scientific articles lead to the necessity for large-scale structure-aware formula search techniques. In this paper, we formulate three types of measures that represent distinctive features of semantic similarity of math formulae, and develop efficient hash-based algorithms for the approximate calculation. Our experiments using NTCIR-11 Math-2 Task dataset, a large-scale test collection for math information retrieval with about 60-million formulae, show that the proposed method improves the search precision while also keeps the scalability and runtime efficiency high.

Cite

CITATION STYLE

APA

Ohashi, S., Kristianto, G. Y., Topić, G., & Aizawa, A. (2016). Efficient algorithm for math formula semantic search. IEICE Transactions on Information and Systems, E99D(4), 979–988. https://doi.org/10.1587/transinf.2015DAP0023

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free