Towards a Terabyte Digital Library System

Hao Ding; Yun Lin; Bin Liu

Journal Article

Towards a Terabyte Digital Library System

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2004) 2690 1042-1046

DOI: 10.1007/978-3-540-45080-1_147

N/ACitations

3Readers

Get full text

Abstract

In China-US Million Book Digital Library, output of the digitalization process is more than one terabyte of text in OEB and PDF format. To access these data quickly and accurately, we are developing a distributed terabyte text retrieval system. To solve the interoperability and extensibility among different information resources, we introduced our solutions of three kinds of metadata schemes. Furthermore, because of the complexity in Chinese language, we made an approach in word segment methods to increase the efficiency and response time of the DL system. In the testbed, we put an extra layer in the cache server and designed a new algorithm based on VSM. With the query cache, system can search less data while maintaining acceptable retrieval accuracy. © Springer-Verlag 2003.

Cite

CITATION STYLE

APA

Ding, H., Lin, Y., & Liu, B. (2004). Towards a Terabyte Digital Library System. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2690, 1042–1046. https://doi.org/10.1007/978-3-540-45080-1_147

Towards a Terabyte Digital Library System

Abstract

Cite

Register to see more suggestions