CORE, a harvesting service offering access to millions of open access research papers from around the world, has shifted its harvesting process from following a monolithic approach to the adoption of a microservices infrastructure. In this paper, we explain how we re-arranged and re-scheduled our old ingestion pipeline, present CORE’s move to managing microservices and outline the tools we use in a new and optimised ingestion system. In addition, we discuss the inefficiencies of our old harvesting process, the advantages, and challenges of our new ingestion system and our future plans. We conclude that via the adoption of microservices architecture we managed to achieve a scalable and distributed system that would assist with CORE’s future performance and evolution.
CITATION STYLE
Cancellieri, M., Pontika, N., Pearce, S., Anastasiou, L., & Knoth, P. (2017). Building scalable digital library ingestion pipelines using microservices. In Communications in Computer and Information Science (Vol. 755, pp. 275–285). Springer Verlag. https://doi.org/10.1007/978-3-319-70863-8_27
Mendeley helps you to discover research relevant for your work.