Machine Learning with PySpark Machine Learning with PySpark

Machine Learning with PySpark

With Natural Language Processing and Recommender Systems

    • US$24.99
    • US$24.99

출판사 설명

Build machine learning models, natural language processing applications, and recommender systems with PySpark to solve various business challenges. This book starts with the fundamentals of Spark and its evolution and then covers the entire spectrum of traditional machine learning algorithms along with natural language processing and recommender systems using PySpark. 
Machine Learning with PySpark shows you how to build supervised machine learning models such as linear regression, logistic regression, decision trees, and random forest. You’ll also see unsupervised machine learning models such as K-means and hierarchical clustering. A major portion of the book focuses on feature engineering to create useful features with PySpark to train the machine learning models. The natural language processing section covers text processing, text mining, and embedding for classification. 
After reading this book,you will understand how to use PySpark’s machine learning library to build and train various machine learning models. Additionally you’ll become comfortable with related PySpark components, such as data ingestion, data processing, and data analysis, that you can use to develop data-driven intelligent applications.
You will:Build a spectrum of supervised and unsupervised machine learning algorithms
Implement machine learning algorithms with Spark MLlib libraries
Develop a recommender system with Spark MLlib libraries
Handle issues related to feature engineering, class balance, bias and variance, and cross validation for building an optimal fit model

장르
컴퓨터 및 인터넷
출시일
2018년
12월 14일
언어
EN
영어
길이
241
페이지
출판사
Apress
판매자
Springer Nature B.V.
크기
10.2
MB
Data Science for Marketing Analytics Data Science for Marketing Analytics
2021년
Advanced Analytics with PySpark Advanced Analytics with PySpark
2022년
Python Data Analysis Python Data Analysis
2021년
Data Science and Big Data Analytics Data Science and Big Data Analytics
2015년
Hands-On Machine Learning with Microsoft Excel 2019 Hands-On Machine Learning with Microsoft Excel 2019
2019년
Hands-On Data Analysis with Pandas Hands-On Data Analysis with Pandas
2021년
Learn TensorFlow 2.0 Learn TensorFlow 2.0
2019년
Deploy Machine Learning Models to Production Deploy Machine Learning Models to Production
2020년
Learn PySpark Learn PySpark
2019년
Machine Learning with PySpark Machine Learning with PySpark
2021년