Use and maintenance of histograms for large scientific database access planning: A case study of a pharmaceutical data repository

0Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Scientific databases, and in particular chemical and biological databases, have reached massive sizes in recent years due to the improvement of bench-side high throughput screening tools used by scientists. This rapid increase has caused a shift in the bottleneck in discovery and product development from the bench side to the computational side, thus, creating a need for new computational tools that can facilitate the access and interpretation of such massive data. This paper discusses the design and implementation of the computation of a histogram to speed up access to large pharmaceutical databases. As opposed to traditional histograms in which approximate value distributions is obtained by grouping attribute values into buckets, the computation histogram proposed in this paper records the retrieval time and the calculation time of descriptors in a pharmaceutical drug candidate database. Both on-line and off-line update techniques are proposed to update the computation histogram so that an efficient query plan can be generated. The efficiency of the proposed computation histogram is demonstrated by using a drug candidate database which is used in the pharmaceutical drug discovery process. The histogram allows the result of a query to be either computed using a computational algorithm or retrieved from the database. In addition to the pharmaceutical drug candidate database, the proposed approach is applicable to other scientific databases such as biological and agroscience databases.

Cite

CITATION STYLE

APA

Miled, Z. B., Liu, J., Bukhres, O., Li, H., Martin, J., Balagopalakrishna, C., & Oppelt, R. (2004). Use and maintenance of histograms for large scientific database access planning: A case study of a pharmaceutical data repository. Journal of Intelligent Information Systems, 23(2), 145–178. https://doi.org/10.1023/B:JIIS.0000039533.13569.8e

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free