Automated information extraction from web APIs documentation

Papa Alioune Ly; Carlos Pedrinaci; John Domingue

Conference Proceedings

Automated information extraction from web APIs documentation

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2012) 7651 LNCS 497-511

DOI: 10.1007/978-3-642-35063-4_36

11Citations

19Readers

Get full text

Abstract

A fundamental characteristic of Web APIs is the fact that, de facto, providers hardly follow any standard practices while implementing, publishing, and documenting their APIs. As a consequence, the discovery and use of these services by third parties is significantly hampered. In order to achieve further automation while exploiting Web APIs we present an approach for automatically extracting relevant technical information from the Web pages documenting them. In particular we have devised two algorithms that automatically extract technical details such as operation names, operation descriptions or URI templates from the documentation of Web APIs adopting either RPC or RESTful interfaces. The algorithms devised, which exploit advanced DOM processing as well as state of the art Information Extraction and Natural Language Processing techniques, have been evaluated against a detailed dataset exhibiting a high precision and recall-around 90% for both REST and RPC APIs-outperforming state of the art information extraction algorithms. © 2012 Springer-Verlag.

Author supplied keywords

Cite

CITATION STYLE

APA

Ly, P. A., Pedrinaci, C., & Domingue, J. (2012). Automated information extraction from web APIs documentation. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7651 LNCS, pp. 497–511). https://doi.org/10.1007/978-3-642-35063-4_36

Automated information extraction from web APIs documentation

Abstract

Author supplied keywords

Cite

Register to see more suggestions