Using approximate matching to reduce the volume of digital data

Frank Breitinger; Christian Winter; York Yannikos; Tobias Fink; Michael Seefried

Conference ProceedingsOPEN ACCESS

Using approximate matching to reduce the volume of digital data

IFIP Advances in Information and Communication Technology (2014) 433 149-163

DOI: 10.1007/978-3-662-44952-3_11

1Citations

15Readers

Abstract

Digital forensic investigators frequently have to search for relevant files in massive digital corpora – a task often compared to finding a needle in a haystack. To address this challenge, investigators typically apply cryptographic hash functions to identify known files. However, cryptographic hashing only allows the detection of files that exactly match the known file hash values or fingerprints. This paper demonstrates the benefits of using approximate matching to locate relevant files. The experiments described in this paper used three test images of Windows XP, Windows 7 and Ubuntu 12.04 systems to evaluate fingerprint-based comparisons. The results reveal that approximate matching can improve file identification – in one case, increasing the identification rate from 1.82% to 23.76%.

Author supplied keywords

Cite

CITATION STYLE

APA

Breitinger, F., Winter, C., Yannikos, Y., Fink, T., & Seefried, M. (2014). Using approximate matching to reduce the volume of digital data. In IFIP Advances in Information and Communication Technology (Vol. 433, pp. 149–163). Springer New York LLC. https://doi.org/10.1007/978-3-662-44952-3_11

Using approximate matching to reduce the volume of digital data

Abstract

Author supplied keywords

Cite

Register to see more suggestions