We present a system for resolving entities and disambiguating locations based on publicly available web data in the domain of ancient Hindu Temples. Scarce, unstructured information poses a challenge to Entity Resolution(ER) and snippet ranking. Additionally, because the same set of entities may be associated with multiple locations, Location Disambiguation(LD) is a problem. The mentions and descriptions of temples1 exist in the order of hundreds of thousands, with such data generated by various users in various forms such as text (Wikipedia pages), videos (YouTube videos), blogs, etc. We demonstrate an integrated approach using a combination of grammar rules for parsing and unsupervised (clustering) algorithms to resolve entity and locations with high confidence. A demo of our system is accessible at tinyurl.com/templedemos2. Our system is open source and available on GitHub3
CITATION STYLE
Maheshwari, A., Kumar, V., Ramakrishnan, G., & Saketha Nath, J. (2018). Entity Resolution and Location Disambiguation in the Ancient Hindu Temples Domain using Web Data. In NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Demonstrations Session (pp. 46–50). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n18-5010
Mendeley helps you to discover research relevant for your work.