Big Data for Chimps Big Data for Chimps

Big Data for Chimps

A Guide to Massive-Scale Data Processing in Practice

    • $49.99
    • $49.99

Publisher Description

Finding patterns in massive event streams can be difficult, but learning how to find them doesn’t have to be. This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop. You’ll gain a practical, actionable view of big data by working with real data and real problems.

Perfect for beginners, this book’s approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you’ll also learn how to use Apache Pig to process data.
Learn the necessary mechanics of working with Hadoop, including how data and computation move around the clusterDive into map/reduce mechanics and build your first map/reduce job in PythonUnderstand how to run chains of map/reduce jobs in the form of Pig scriptsUse a real-world dataset—baseball performance statistics—throughout the bookWork with examples of several analytic patterns, and learn when and where you might use them

GENRE
Computing & Internet
RELEASED
2015
28 September
LANGUAGE
EN
English
LENGTH
220
Pages
PUBLISHER
O'Reilly Media
SELLER
O Reilly Media, Inc.
SIZE
5.5
MB

More Books Like This

Joe Celko's Analytics and OLAP in SQL Joe Celko's Analytics and OLAP in SQL
2010
Mastering Large Datasets with Python Mastering Large Datasets with Python
2020
MapReduce Design Patterns MapReduce Design Patterns
2012
Joe Celko’s Complete Guide to NoSQL Joe Celko’s Complete Guide to NoSQL
2013
Joe Celko's Thinking in Sets: Auxiliary, Temporal, and Virtual Tables in SQL Joe Celko's Thinking in Sets: Auxiliary, Temporal, and Virtual Tables in SQL
2008
Elasticsearch: The Definitive Guide Elasticsearch: The Definitive Guide
2015