Astronomy produces extremely large data sets from ground-based telescopes, space missions, and simulation. The volume and complexity of these rich data sets require new approaches and advanced tools to understand the information contained therein. No one can load this data on their own computer, most cannot even keep it at their institution, and worse, no platform exists that allows one to evaluate their models across the whole of the data. Simply having an extremely large volume of data available in one place is not sufficient; one must be able to make valid, rigorous, scientific comparisons across very different data sets from very different instrumentation. We propose a framework to directly address this which has the following components: a model-based computational platform, streamlined access to large volumes of data, and an educational and social platform for both researchers and the public.
Muna, D., & Huff, E. (2014). A New Framework for a Model-Based Data Science Computational Platform. ArXiv.Org, astro-ph.I. Retrieved from http://arxiv.org/abs/1402.5932v1