Scaling Machine Learning with Spark Scaling Machine Learning with Spark

Scaling Machine Learning with Spark

    • $72.99
    • $72.99

Publisher Description

Learn how to build end-to-end scalable machine learning solutions with Apache Spark. With this practical guide, author Adi Polak introduces data and ML practitioners to creative solutions that supersede today's traditional methods. You'll learn a more holistic approach that takes you beyond specific requirements and organizational goals--allowing data and ML practitioners to collaborate and understand each other better.

Scaling Machine Learning with Spark examines several technologies for building end-to-end distributed ML workflows based on the Apache Spark ecosystem with Spark MLlib, MLflow, TensorFlow, and PyTorch. If you're a data scientist who works with machine learning, this book shows you when and why to use each technology.

You will:
Explore machine learning, including distributed computing concepts and terminologyManage the ML lifecycle with MLflowIngest data and perform basic preprocessing with SparkExplore feature engineering, and use Spark to extract featuresTrain a model with MLlib and build a pipeline to reproduce itBuild a data system to combine the power of Spark with deep learningGet a step-by-step example of working with distributed TensorFlowUse PyTorch to scale machine learning and its internal architecture

GENRE
Computing & Internet
RELEASED
2023
7 March
LANGUAGE
EN
English
LENGTH
294
Pages
PUBLISHER
O'Reilly Media
SELLER
O Reilly Media, Inc.
SIZE
9.1
MB
Applied Natural Language Processing in the Enterprise Applied Natural Language Processing in the Enterprise
2021
Big-Data Analytics for Cloud, IoT and Cognitive Computing Big-Data Analytics for Cloud, IoT and Cognitive Computing
2017
Big Data Big Data
2016
Practicing Trustworthy Machine Learning Practicing Trustworthy Machine Learning
2023
Big Data Analytics Big Data Analytics
2017
The Data Bonanza The Data Bonanza
2013