Abstract
Background: To understand biology and differences among various tissues or cell types, one typically searches for molecular features that display characteristic abundance patterns. Several specificity metrics have been introduced to identify tissue-specific molecular features, but these either require an equal number of replicates per tissue or they can't handle replicates at all. Results: We describe a non-parametric specificity score that is compatible with unequal sample group sizes. To demonstrate its usefulness, the specificity score was calculated on all GTEx samples, detecting known and novel tissue-specific genes. A webtool was developed to browse these results for genes or tissues of interest. An example python implementation of SPECS is available at https://github.com/celineeveraert/SPECS. The precalculated SPECS results on the GTEx data are available through a user-friendly browser at specs.cmgg.be. Conclusions: SPECS is a non-parametric method that identifies known and novel specific-expressed genes. In addition, SPECS could be adopted for other features and applications.
Author supplied keywords
Cite
CITATION STYLE
Everaert, C., Volders, P. J., Morlion, A., Thas, O., & Mestdagh, P. (2020). SPECS: A non-parametric method to identify tissue-specific molecular features for unbalanced sample groups. BMC Bioinformatics, 21(1). https://doi.org/10.1186/s12859-020-3407-z
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.