Improvement in XML Keyword Search and Ranking for Data Analytics

Vasudev Yadav; Pradeep Tomar; Prabhjot Singh; Gurjit Kaur

Conference Proceedings

Improvement in XML Keyword Search and Ranking for Data Analytics

Smart Innovation, Systems and Technologies (2020) 141 339-349

DOI: 10.1007/978-981-13-8406-6_33

2Citations

1Readers

Get full text

Abstract

The success of web search engine for an ordinary user (Initially, search engine requires very precise query which only expert can write.) motivates the search engine for XML database. XML-based search engine requires DOM parser to parse the XML database. DOM parser produces a tree, which developed only in main memory. But generally XML database is larger than the main memory. Therefore, DOM parser has a disadvantage in case of large database. Instead of using DOM parser, Sax parser is used. SAX parser parses the XML file character by character. Means no requirement of the whole file in main memory, and unlikely DOM parser SAX parser requires no tree. SAX parser consumes less time than DOM Parser also. Searching take a lot of time by hitting the database again and again to fetch the same or recently used data. The solution is a simple cache memory. Cache memory is developed by storing recently used data into hashmap because hash map provides the O(1) search time complexity. Ranking use only use IDF*TF score to calculate the result. But this algorithm does not provide the best ranking. Ranking using cosine similarity algorithm is a better approach. (Basically, Cosine algorithm is used to find similarity between two documents.).

Author supplied keywords

Cite

CITATION STYLE

APA

Yadav, V., Tomar, P., Singh, P., & Kaur, G. (2020). Improvement in XML Keyword Search and Ranking for Data Analytics. In Smart Innovation, Systems and Technologies (Vol. 141, pp. 339–349). Springer. https://doi.org/10.1007/978-981-13-8406-6_33

Improvement in XML Keyword Search and Ranking for Data Analytics

Abstract

Author supplied keywords

Cite

Register to see more suggestions