On-line tools for sequence retrieval and multivariate statistics in molecular biology

Guy Perriere; Jean Thioulouse

Journal ArticleOPEN ACCESS

On-line tools for sequence retrieval and multivariate statistics in molecular biology

Bioinformatics (1996) 12(1) 63-69

DOI: 10.1093/bioinformatics/12.1.63

14Citations

13Readers

Abstract

We have developed a World-Wide Web server for browsing sequence collections structured under the ACNUC format and for performing multivariate analyses on sequences. General collections (like GenBank or EMBL), as well as specialized data banks (like Hovergen and NRSub) can be accessed. This system allows complex queries to be constructed, and the result of each query, represented by a list of sequences, is stored on the server. It is then possible to reuse this list to compute multivariate analyses on the sequences. Two examples of applications are shown. The first one consists in a study of codon usage with correspondence analysis on all the protein genes of Haemophilus influenzae Rd. This study allows the highly expressed genes and the integral membrane proteins of this organism to be identified. The second one consists in an ordering of 70 aligned protein sequences of growth hormone with principal coordinate analysis. With this method, we are able to re-establish the patterns of relationships between the sequences previously determined with tree building programs. © 1996, Oxford University Press.

Cite

CITATION STYLE

APA

Perriere, G., & Thioulouse, J. (1996). On-line tools for sequence retrieval and multivariate statistics in molecular biology. Bioinformatics, 12(1), 63–69. https://doi.org/10.1093/bioinformatics/12.1.63

On-line tools for sequence retrieval and multivariate statistics in molecular biology

Abstract

Cite

Register to see more suggestions