Abstract
Data discoverability is a challenge in science gateway architectures. As the volume of data managed and shared through a science gateway grows, it is imperative to expose a search functionality which enables users to quickly navigate to files within their own data sets as well as to identify relevant files in shared or public data sets. Desirable qualities in a file search feature include scalability to arbitrary data sizes, rapid and responsive indexing triggered by user activity, and easy maintainability by development teams without specialist knowledge of search algorithms. We describe a search architecture built around Elasticsearch that meets each of these criteria, and which has been successfully implemented at the Texas Advanced Computing Center to enhance data discoverability in several science gateway projects.
Author supplied keywords
Cite
CITATION STYLE
Rosenberg, J., Coronel, J. B., Meiring, J., Gray, S., & Brown, T. (2019). Leveraging elasticsearch to improve data discoverability in science gateways. In ACM International Conference Proceeding Series. Association for Computing Machinery. https://doi.org/10.1145/3332186.3332230
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.