From XML to XML: The why and how of making the biodiversity literature accessible to researchers

  • Willis A
  • King D
  • Morse D
 et al. 
  • 21

    Readers

    Mendeley users who have this article in their library.
  • 2

    Citations

    Citations of this article.

Abstract

We present the ABLE document collection, which consists of a set of annotated volumes of the Bulletin of the British Museum (Natural History). These follow our work on automating the markup of scanned copies of the biodiversity literature, for the purpose of supporting working taxonomists. We consider an enhanced TEI XML markup language, which is used as an intermediate stage in translating from the initial XML obtained from Optical Character Recognition to the target taXMLit. The intermediate representation allows additional information from external sources such as a taxonomic thesaurus to be incorporated before the final translation into taXMLit.

Get free article suggestions today

Mendeley saves you time finding and organizing research

Sign up here
Already have an account ?Sign in

Find this document

  • PUI: 619603692
  • SGR: 84904663199
  • SCOPUS: 2-s2.0-84904663199
  • ISBN: 2-9517408-6-7

Authors

  • Alistair Willis

  • David King

  • David Morse

  • Anton Dil

  • Chris Lyal

  • David Roberts

Cite this document

Choose a citation style from the tabs below

Save time finding and organizing research with Mendeley

Sign up for free