Practical Hadoop Ecosystem Practical Hadoop Ecosystem

Practical Hadoop Ecosystem

A Definitive Guide to Hadoop-Related Frameworks and Tools

    • €46.99
    • €46.99

Publisher Description

This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem project. While several books on Apache Hadoop are available, most are based on the main projects MapReduce and HDFS and none discusses the other Apache Hadoop ecosystem projects and how these all work together as a cohesive big data development platform.
What you'll learnHow to set up environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5. 
How to run a MapReduce job
How to store data with Apache Hive, Apache HBase
How to index data in HDFS with Apache Solr
How to develop a Kafka messaging system
How to develop a Mahout User Recommender System
How to stream Logs to HDFS with Apache Flume
How to transfer data from MySQL database to Hive, HDFS and HBase with Sqoop
How create a Hive table over Apache Solr

GENRE
Computing & Internet
RELEASED
2016
30 September
LANGUAGE
EN
English
LENGTH
441
Pages
PUBLISHER
Apress
PROVIDER INFO
Springer Science & Business Media LLC
SIZE
14.9
MB
Hadoop in Action Hadoop in Action
2010
HBase in Action HBase in Action
2012
Pro Spark Streaming Pro Spark Streaming
2016
Apache HBase Primer Apache HBase Primer
2016
Practical MongoDB Practical MongoDB
2015
Data-intensive Systems Data-intensive Systems
2019
Docker Management Design Patterns Docker Management Design Patterns
2017
Kubernetes Management Design Patterns Kubernetes Management Design Patterns
2017
Apache HBase Primer Apache HBase Primer
2016
Kubernetes Microservices with Docker Kubernetes Microservices with Docker
2016
Pro Docker Pro Docker
2015
Pro MongoDB Development Pro MongoDB Development
2015