Producing “one vast index”: Google Book Search as an algorithmic system

Melissa K. Chalmers; Paul N. Edwards

Journal ArticleOPEN ACCESS

Producing “one vast index”: Google Book Search as an algorithmic system

Big Data and Society (2017) 4(2)

DOI: 10.1177/2053951717716950

10Citations

21Readers

Abstract

In 2004, Google embarked on a massive book digitization project. Forty library partners and billions of scanned pages later, Google Book Search has provided searchable text access to millions of books. While many details of Google’s conversion processes remain proprietary secret, here we piece together their general outlines by closely examining Google Book Search products, Google patents, and the entanglement of libraries and computer scientists in the longer history of digitization work. We argue that far from simply “scanning” books, Google’s efforts may be characterized as algorithmic digitization, strongly shaped by an equation of digital access with full-text searchability. We explore the consequences of Google’s algorithmic digitization system for what end users ultimately do and do not see, placing these effects in the context of the multiple technical, material, and legal challenges surrounding Google Book Search. By approaching digitization primarily as a text extraction and indexing challenge—an effort to convert print books into electronically searchable data—GBS enacts one possible future for books, in which they are defined largely by their textual content.

Author supplied keywords

Cite

CITATION STYLE

APA

Chalmers, M. K., & Edwards, P. N. (2017). Producing “one vast index”: Google Book Search as an algorithmic system. Big Data and Society, 4(2). https://doi.org/10.1177/2053951717716950

Producing “one vast index”: Google Book Search as an algorithmic system

Abstract

Author supplied keywords

Cite

Register to see more suggestions