Deequ for Scalable Data Quality Assurance Deequ for Scalable Data Quality Assurance

Deequ for Scalable Data Quality Assurance

The Complete Guide for Developers and Engineers

    • $9.500
    • $9.500

Descripción editorial

"Deequ for Scalable Data Quality Assurance"
In an era where data powers decision-making at every level, ensuring the quality of massive and fast-growing datasets poses unprecedented challenges. "Deequ for Scalable Data Quality Assurance" addresses this critical need by exploring not only the evolving standards and requirements for data quality in large-scale, modern systems but also the profound business and technical risks of neglecting it. The book begins by framing the dimensions of data quality—accuracy, completeness, consistency, timeliness, and validity—and critically evaluates traditional approaches, making a compelling case for automation and scalable, data-driven architectures.
At the heart of this work is a comprehensive exploration of Deequ, an open-source library purpose-built for automated, scalable data quality checks on distributed platforms such as Apache Spark. Through clear architectural exposition, the book demystifies Deequ’s foundational abstractions—metrics, checks, constraints, and analyzers—then guides readers in designing expressive, reusable, and parameterized validations. Advanced chapters reveal how to extend Deequ with custom metrics, orchestrate robust quality workflows in production, and integrate with CI/CD, monitoring, and audit frameworks, all while upholding security and regulatory compliance in areas such as GDPR and HIPAA.
Drawing from hands-on case studies in enterprise environments, the book illustrates the end-to-end lifecycle of data quality management—from automated detection and remediation to storytelling with actionable insights. Readers gain practical knowledge in deployment strategies, visualization, and root cause analytics while also being introduced to future trends in automated quality assurance and intelligent profiling. Whether you are a data engineer, architect, or leader, this book is an essential guide to mastering scalable data quality in the era of big data.

GÉNERO
Informática e Internet
PUBLICADO
2025
24 de julio
IDIOMA
EN
Inglés
EXTENSIÓN
250
Páginas
EDITORIAL
HiTeX Press
VENDEDOR
PublishDrive Inc.
TAMAÑO
1.9
MB
A Smaller history of Greece A Smaller history of Greece
1893
Recetas: Dieta Dash Para Principiantes: La Mejor Guía Para Perder Peso Y Presión Arterial Alta (Adelgazar) Recetas: Dieta Dash Para Principiantes: La Mejor Guía Para Perder Peso Y Presión Arterial Alta (Adelgazar)
2017
Hope Filled Recovery From Depression And Anxiety Hope Filled Recovery From Depression And Anxiety
2010
Elizabethan Sonnet Cycles: Idea, Fidesa and Chloris Elizabethan Sonnet Cycles: Idea, Fidesa and Chloris
2005
Deep Learning in JAX with Haiku Deep Learning in JAX with Haiku
2025
ZeroTier Virtual Networks for Secure Remote Access ZeroTier Virtual Networks for Secure Remote Access
2025