Large-Scale Graph Processing Using Apache Giraph Large-Scale Graph Processing Using Apache Giraph

Large-Scale Graph Processing Using Apache Giraph

Sherif Sakr 및 다른 저자
    • US$44.99
    • US$44.99

출판사 설명

This book takes its reader on a journey through Apache Giraph, a popular distributed graph processing platform designed to bring the power of big data processing to graph data. Designed as a step-by-step self-study guide for everyone interested in large-scale graph processing, it describes the fundamental abstractions of the system, its programming models and various techniques for using the system to process graph data at scale, including the implementation of several popular and advanced graph analytics algorithms.

The book is organized as follows: Chapter 1 starts by providing a general background of the big data phenomenon and a general introduction to the Apache Giraph system, its abstraction, programming model and design architecture. Next, chapter 2 focuses on Giraph as a platform and how to use it. Based on a sample job, even more advanced topics like monitoring the Giraph application lifecycle and different methods for monitoring Giraph jobs are explained.  Chapter 3 then provides an introduction to Giraph programming, introduces the basic Giraph graph model and explains how to write Giraph programs. In turn, Chapter 4 discusses in detail the implementation of some popular graph algorithms including PageRank, connected components, shortest paths and triangle closing. Chapter 5 focuses on advanced Giraph programming, discussing common Giraph algorithmic optimizations, tunable Giraph configurations that determine the system’s utilization of the underlying resources, and how to write a custom graph input and output format. Lastly, chapter 6 highlights two systems that have been introduced to tackle the challenge of large scale graph processing, GraphX and GraphLab, and explains the main commonalities and differences between these systems and Apache Giraph.

This book serves as an essential reference guide for students, researchers and practitioners in the domain of large scale graph processing. It offers step-by-step guidance, with several code examples and the complete source code available in the related github repository. Students will find a comprehensive introduction to and hands-on practice with tackling large scale graph processing problems using the Apache Giraph system, while researchers will discover thorough coverage of the emerging and ongoing advancements in big graph processing systems.

장르
컴퓨터 및 인터넷
출시일
2017년
1월 5일
언어
EN
영어
길이
222
페이지
출판사
Springer International Publishing
판매자
Springer Nature B.V.
크기
6.4
MB
Spark GraphX in Action Spark GraphX in Action
2016년
Graph Data Science with Neo4j Graph Data Science with Neo4j
2023년
Hadoop in Action Hadoop in Action
2010년
Web Application Development with Streamlit Web Application Development with Streamlit
2022년
Rapid Mashup Development Tools Rapid Mashup Development Tools
2017년
Spark in Action Spark in Action
2016년
Transactions on Large-Scale Data- and Knowledge-Centered Systems XX Transactions on Large-Scale Data- and Knowledge-Centered Systems XX
2015년
Transactions on Large-Scale Data- and Knowledge-Centered Systems XXXV Transactions on Large-Scale Data- and Knowledge-Centered Systems XXXV
2017년
Handbook of Big Data Technologies Handbook of Big Data Technologies
2017년
Process Analytics Process Analytics
2016년
Cloud Data Management Cloud Data Management
2014년