Learning Spark Learning Spark

Learning Spark

Jules S. Damji 및 다른 저자
    • US$59.99
    • US$59.99

출판사 설명

Data is bigger, arrives faster, and comes in a variety of formatsâ??and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, youâ??ll be able to:
Learn Python, SQL, Scala, or Java high-level Structured APIsUnderstand Spark operations and SQL EngineInspect, tune, and debug Spark operations with Spark configurations and Spark UIConnect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or KafkaPerform analytics on batch and streaming data using Structured StreamingBuild reliable data pipelines with open source Delta Lake and SparkDevelop machine learning pipelines with MLlib and productionize models using MLflow

장르
컴퓨터 및 인터넷
출시일
2020년
7월 16일
언어
EN
영어
길이
400
페이지
출판사
O'Reilly Media
판매자
O Reilly Media, Inc.
크기
17.3
MB
Spark: The Definitive Guide Spark: The Definitive Guide
2018년
High Performance Spark High Performance Spark
2017년
Pentaho Kettle Solutions Pentaho Kettle Solutions
2010년
Data Pipelines Pocket Reference Data Pipelines Pocket Reference
2021년
IBM InfoSphere Streams: Accelerating Deployments with Analytic Accelerators IBM InfoSphere Streams: Accelerating Deployments with Analytic Accelerators
2014년
Hadoop: The Definitive Guide Hadoop: The Definitive Guide
2015년
Oyster Bay & Other Short Stories Oyster Bay & Other Short Stories
2006년
Spark. Błyskawiczna analiza danych. Wydanie II Spark. Błyskawiczna analiza danych. Wydanie II
2023년