We perform a LHC data analysis workflow using tools and data formats that are commonly used in the "Big Data" community outside High Energy Physics (HEP). These include Apache Avro for serialisation to binary files, Pig and Hadoop for mass data processing and Python Scikit-Learn for multi-variate analysis. Comparison is made with the same analysis performed with current HEP tools in ROOT. © Published under licence by IOP Publishing Ltd.
CITATION STYLE
Bhimji, W., Bristow, T., & Washbrook, A. (2014). HEPDOOP: High-Energy Physics analysis using Hadoop. In Journal of Physics: Conference Series (Vol. 513). Institute of Physics Publishing. https://doi.org/10.1088/1742-6596/513/2/022004
Mendeley helps you to discover research relevant for your work.