Motivation: Gene regulation often depends on functional modules which feature a detectable internal organisation. Overall sequence similarity of these modules is often insufficient for detection by general search methods like FASTA of even Gapped BLAST. However, it is of interest to evaluate whether modules, often known from experimental analysis of single sequences, are present in other regulatory sequences. Results: We developed a new method (FastM) which combines a search algorithm for individual transcription factor binding sited (MatInspector) with a distance correlation function. FastM allows fast definition of a model of correlated binding sited derived from as little as a single promotor or enhancer. ModelInspector results are suitable for evaluation of the significance of the model. We used FastM to define a model for the experimentally verified NFκB/IRF1 regulatory module from the major histocompatibility complex (MHC) class 1 HLA-B gene promoter. Analysis is of a test set of sequences as well as a database searches with this model showed excellent correlation of the model with the biological function of the module. These results could not be obtained by searches using FASTA of Gapped BLAST, which are based on sequence similarity. We were also able to demonstrate association of a hypothetical GRE-GRE module with viral sequences based on analysis of several GenBank sections with this module. Availability: The WWW version of FastM is accessible at: http://www.gsf.de/cgi-bin/fastm.pl and http://genomatix.gsf.de/cgi-bin/fastm2/fastm.pl. Contact: werner@@@gsf.de.
CITATION STYLE
Klingenhoff, A., Frech, K., Quandt, K., & Werner, T. (1999). Functional promoter modules can be detected by formal models independent of overall nucleotide sequence similarity. In Bioinformatics (Vol. 15, pp. 180–186). Oxford University Press. https://doi.org/10.1093/bioinformatics/15.3.180
Mendeley helps you to discover research relevant for your work.