Due to the advances of information technology, new devices generate amount of data. Especially, XML is a standard format for data exchange. Therefore, processing big XML data is an important topic. We propose an efficient XML data processing mechanism, which includes a design of XMLInputFormat class, MapReduce modules, and an HBase schema. The mechanism scans an XML document to reconstruct parent-child relationships in the document. It generates deserialized paths, which are stored in HBase.
CITATION STYLE
Chen, S. Y., Chen, H. M., & Zeng, W. C. (2015). Efficient XML data processing based on mapreduce framework. In Lecture Notes in Electrical Engineering (Vol. 329, pp. 97–104). Springer Verlag. https://doi.org/10.1007/978-94-017-9558-6_12
Mendeley helps you to discover research relevant for your work.