High-dimensional data, or high-feature variables, are often used to describe the characteristics of microRNA sequence and microarray data. As a consequence, the curse of high dimension often becomes a problem. High-dimension variables lead to many difficulties in processing and can be hard to understand. On the other aspect, as the sample size rather limited, the more variables, the more statistical error would be produced in the data processing. For the purpose of decreasing the dimension of variables, a degenerated kmer method was suggested. To enhance the statistical robustness, the gapped k-mer method was introduced. In the last part of this chapter, some traditional supervised and unsupervised mathematical methods that used to decrease the dimensionality of the data are also described.
CITATION STYLE
Hu, Y., Lan, W., & Miller, D. (2017). Handling high-dimension (high-feature) microRNA data. Methods in Molecular Biology, 1617, 179–186. https://doi.org/10.1007/978-1-4939-7046-9_13
Mendeley helps you to discover research relevant for your work.