In this book chapter, we examined the portability of several different well-known text mining techniques on patent text. We test the techniques by addressing three different relation extraction applications: acronym extraction, hyponymy extraction and factoid entity relation extraction. These applications require different types of natural language processing tools, from simple regular expression matching (acronym extraction), to part of speech and phrase chunking (hyponymy extraction), to a full-blown dependency parser (factoid extraction). With the relation extraction applications presented in this chapter, we want to elucidate the requirements needed of general natural language processing tools when deployed on patent text for a specific extraction task. On the other hand, we also present language technology methods which are already portable to the patent genre with no or only moderate adaptations to the text genre.
CITATION STYLE
Andersson, L., Hanbury, A., & Rauber, A. (2017). The Portability of Three Types of Text Mining Techniques into the Patent Text Genre (pp. 241–280). https://doi.org/10.1007/978-3-662-53817-3_9
Mendeley helps you to discover research relevant for your work.