Veridical Data Science Veridical Data Science
Adaptive Computation and Machine Learning series

Veridical Data Science

The Practice of Responsible Data Analysis and Decision Making

    • Pedido anticipado
    • Lanzamiento previsto: 15 oct 2024
    • USD 49.99
    • Pedido anticipado
    • USD 49.99

Descripción editorial

Using real-world data case studies, this innovative and accessible textbook introduces an actionable framework for conducting trustworthy data science.

Most textbooks present data science as a linear analytic process involving a set of statistical and computational techniques without accounting for the challenges intrinsic to real-world applications. Veridical Data Science, by contrast, embraces the reality that most projects begin with an ambiguous domain question and messy data; it acknowledges that datasets are mere approximations of reality while analyses are mental constructs. 
Bin Yu and Rebecca Barter employ the innovative Predictability, Computability, and Stability (PCS) framework to assess the trustworthiness and relevance of data-driven results relative to three sources of uncertainty that arise throughout the data science life cycle: the human decisions and judgment calls made during data collection, cleaning, and modeling. By providing real-world data case studies, intuitive explanations of common statistical and machine learning techniques, and supplementary R and Python code, Veridical Data Science offers a clear and actionable guide for conducting responsible data science. Requiring little background knowledge, this lucid, self-contained textbook provides a solid foundation and principled framework for future study of advanced methods in machine learning, statistics, and data science. 

Presents the Predictability, Computability, and Stability (PCS) methodology for producing trustworthy data-driven resultsTeaches how a data science project should be conducted from beginning to end, including extensive discussion of the data scientist's decision-making processCultivates critical thinking throughout the entire data science life cycleProvides practical examples and illuminating case studies of real-world data analysis problems with associated code, exercises, and solutionsSuitable for advanced undergraduate and graduate students, domain scientists, and practitioners

GÉNERO
Informática e Internet
DISPONIBLE
2024
15 de octubre
IDIOMA
EN
Inglés
EXTENSIÓN
526
Páginas
EDITORIAL
MIT Press
VENDEDOR
Penguin Random House LLC

Más libros de Bin Yu & Rebecca L. Barter

Graphene for Post-Moore Silicon Optoelectronics Graphene for Post-Moore Silicon Optoelectronics
2023
Campus Network Architectures and Technologies Campus Network Architectures and Technologies
2021

Otros libros de esta serie

Learning Theory from First Principles Learning Theory from First Principles
2024
Foundations of Computer Vision Foundations of Computer Vision
2024
Fairness and Machine Learning Fairness and Machine Learning
2023
Probabilistic Machine Learning Probabilistic Machine Learning
2023
Introduction to Online Convex Optimization, second edition Introduction to Online Convex Optimization, second edition
2022
Machine Learning from Weak Supervision Machine Learning from Weak Supervision
2022