Targeted Learning in Data Science

Mark J. Van der Laan; Sherri Rose

Book

Targeted Learning in Data Science

Van der Laan M
Rose S

(2018), 640

N/ACitations

59Readers

Abstract

This textbook for graduate students in statistics, data science, and public health deals with the practical challenges that come with big, complex, and dynamic data. It presents a scientific roadmap to translate real-world data science applications into formal statistical estimation problems by using the general template of targeted maximum likelihood estimators. These targeted machine learning algorithms estimate quantities of interest while still providing valid inference. Targeted learning methods within data science area critical component for solving scientific problems in the modern age. The techniques can answer complex questions including optimal rules for assigning treatment based on longitudinal data with time-dependent confounding, as well as other estimands in dependent data structures, such as networks. Included in Targeted Learning in Data Science are demonstrations with soft ware packages and real data sets that present a case that targeted learning is crucial for the next generation of statisticians and data scientists. Th is book is a sequel to the first textbook on machine learning for causal inference, Targeted Learning, published in 2011. Mark van der Laan, PhD, is Jiann-Ping Hsu/Karl E. Peace Professor of Biostatistics and Statistics at UC Berkeley. His research interests include statistical methods in genomics, survival analysis, censored data, machine learning, semiparametric models, causal inference, and targeted learning. Dr. van der Laan received the 2004 Mortimer Spiegelman Award, the 2005 Van Dantzig Award, the 2005 COPSS Snedecor Award, the 2005 COPSS Presidential Award, and has graduated over 40 PhD students in biostatistics and statistics. Sherri Rose, PhD, is Associate Professor of Health Care Policy (Biostatistics) at Harvard Medical School. Her work is centered on developing and integrating innovative statistical approaches to advance human health. Dr. Rose's methodological research focuses on nonparametric machine learning for causal inference and prediction. She co-leads the Health Policy Data Science Lab and currently serves as an associate editor for the Journal of the American Statistical Association and Biostatistics.-- Abbreviations and Notation -- Philosophy of Targeted Learning in Data Science -- Part I: Introductory Chapters.- 1. The Statistical Estimation Problem in Complex Longitudinal Big Data.- 2. Longitudinal Causal Models.- 3. Super Learner for Longitudinal Problems.- 4. Longitudinal Targeted Maximum Likelihood Estimation (LTMLE).- 5. Understanding LTMLE.- 6. Why LTMLE?.- Part II:Additional Core Topics.- 7. One-Step TMLE.- IV: Observational Longitudinal Data.- 19. Super Learning in the ICU.- 20. Stochastic Single-Time-Point Interventions.- 21. Stochastic Multiple-Time-Point Interventions on Monitoring and Treatment.- 22. Collaborative LTMLE.- Part V: Optimal Dynamic Regimes.- 23. Targeted Adaptive Designs Learning the Optimal Dynamic Treatment.- 24. Targeted Learning of the Optimal Dynamic Treatment.- 25. Optimal Dynamic Treatments under Resource Constraints.- Part VI: Computing.- 26. ltmle() for R.- 27. Scaled Super Learner for R.- 28. Scaling CTMLE for Julia.- Part VII: Special Topics.-29. Data-Adaptive Target Parameters.- 30. Double Robust Inference for LTMLE.- 31. Higher-Order TMLE.- Appendix.- A. Online Targeted Learning Theory.- B. Computerization of the calculation of efficient influence curve.- C. TMLE applied to Capture/Recapture.- D. TMLE for High Dimensional Linear Regression.- E. TMLE of Causal Effect Based on Observing a Single Time Series.

Cite

CITATION STYLE

APA

Van der Laan, M. J., & Rose, S. (2018). Targeted Learning in Data Science. Springer (p. 640).

Targeted Learning in Data Science

Abstract

Cite

Register to see more suggestions