Practical Hadoop Ecosystem Practical Hadoop Ecosystem

Practical Hadoop Ecosystem

A Definitive Guide to Hadoop-Related Frameworks and Tools

    • $64.99
    • $64.99

Publisher Description

This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem project. While several books on Apache Hadoop are available, most are based on the main projects MapReduce and HDFS and none discusses the other Apache Hadoop ecosystem projects and how these all work together as a cohesive big data development platform.
What you'll learnHow to set up environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5. 
How to run a MapReduce job
How to store data with Apache Hive, Apache HBase
How to index data in HDFS with Apache Solr
How to develop a Kafka messaging system
How to develop a Mahout User Recommender System
How to stream Logs to HDFS with Apache Flume
How to transfer data from MySQL database to Hive, HDFS and HBase with Sqoop
How create a Hive table over Apache Solr

GENRE
Computing & Internet
RELEASED
2016
30 September
LANGUAGE
EN
English
LENGTH
441
Pages
PUBLISHER
Apress
SELLER
Springer Nature B.V.
SIZE
14.9
MB

More Books by Deepak Vohra

Kubernetes Microservices with Docker Kubernetes Microservices with Docker
2016
Kubernetes Management Design Patterns Kubernetes Management Design Patterns
2017
Amazon Fargate Quick Start Guide Amazon Fargate Quick Start Guide
2018
JDBC 4.0 and Oracle JDeveloper for J2EE Development JDBC 4.0 and Oracle JDeveloper for J2EE Development
2008
EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g
2010
Docker Management Design Patterns Docker Management Design Patterns
2017