This paper presents an extensible architecture that can be used to support the integration of heterogeneous biological data sets. In our architecture, a clustering approach has been developed to support distributed biological data sources with inconsistent identification of biological objects. The architecture uses the AutoMed data integration toolkit to store the schemas of the data sources and the semi-automatically generated transformations from the source data into the data of an integrated warehouse. AutoMed supports bi-directional, extensible transformations which can be used to update the warehouse data as entities change, are added, or are deleted in the data sources. The transformations can also be used to support the addition or removal of entire data sources, or evolutions in the schemas of the data sources or of the warehouse itself. The results of using the architecture for the integration of existing genomic data sets are discussed. © Springer-Verlag Berlin Heidelberg 2005.
CITATION STYLE
Maibaum, M., Zamboulis, L., Rimon, G., Orengo, C., Martin, N., & Poulovassilis, A. (2005). Cluster based integration of heterogeneous biological databases using the AutoMed toolkit. In Lecture Notes in Bioinformatics (Subseries of Lecture Notes in Computer Science) (Vol. 3615, pp. 191–207). Springer Verlag. https://doi.org/10.1007/11530084_16
Mendeley helps you to discover research relevant for your work.