We introduce StarFlow, a script-centric environment for data analysis. StarFlow has four main features: (1) extraction of control and data-flow dependencies through a novel combination of static analysis, dynamic runtime analysis, and user annotations, (2) command-line tools for exploring and propagating changes through the resulting dependency network, (3) support for workflow abstractions enabling robust parallel executions of complex analysis pipelines, and (4) a seamless interface with the Python scripting language. We describe real applications of StarFlow, including automatic parallelization of complex workflows in the cloud. © 2010 Springer-Verlag Berlin Heidelberg.
CITATION STYLE
Angelino, E., Yamins, D., & Seltzer, M. (2010). StarFlow: A script-centric data analysis environment. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 6378 LNCS, pp. 236–250). https://doi.org/10.1007/978-3-642-17819-1_27
Mendeley helps you to discover research relevant for your work.