Abstract
Deep Learning applications have become an important solution for analyzing and making predictions with massive amounts of data in recent years. However, this type of application introduces significant input/output (I/O) loads on computer systems. Moreover, when executed on distributed systems or parallel distributed memory systems, they handle much information that must be read during training. This persistent and continuous access to files can overwhelm file systems and negatively impact application performance. A file format defines how information is stored, and the choice of a format depends on the use case. Therefore, it is important to analyze how the file format influences the training stage when loading and reading the dataset, as opening and reading many small files could affect application performance. Thus, this paper will analyze the I/O pattern of different file formats used in deep learning applications to characterize their behavior.
Author supplied keywords
Cite
CITATION STYLE
Leon, B., Parraga, E., Mendez, S., Rexachs, D., Suppi, R., & Luque, E. (2025). Analyzing the Influence of File Formats on I/O Patterns in Deep Learning. In Communications in Computer and Information Science (Vol. 2256 CCIS, pp. 130–136). Springer Science and Business Media Deutschland GmbH. https://doi.org/10.1007/978-3-031-85638-9_10
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.