Even though the interest in machine learning studies is growing significantly, especially in medicine, the imbalance between study results and clinical relevance is more pronounced than ever. The reasons for this include data quality and interoperability issues. Hence, we aimed at examining site- and study-specific differences in publicly available standard electrocardiogram (ECG) datasets, which in theory should be interoperable by consistent 12-lead definition, sampling rate, and measurement duration. The focus lies upon the question of whether even slight study peculiarities can affect the stability of trained machine learning models. To this end, the performances of modern network architectures as well as unsupervised pattern detection algorithms are investigated across different datasets. Overall, this is intended to examine the generalization of machine learning results of single-site ECG studies.
CITATION STYLE
Plagwitz, L., Vogelsang, T., Doldi, F., Bickmann, L., Fujarski, M., Eckardt, L., & Varghese, J. (2023). The Necessity of Multiple Data Sources for ECG-Based Machine Learning Models. In Studies in Health Technology and Informatics (Vol. 302, pp. 33–37). IOS Press BV. https://doi.org/10.3233/SHTI230059
Mendeley helps you to discover research relevant for your work.