Parallel R Parallel R

Parallel R

Data Analysis in the Distributed World

    • ‏19٫99 US$
    • ‏19٫99 US$

وصف الناشر

It’s tough to argue with R as a high-quality, cross-platform, open source statistical software product—unless you’re in the business of crunching Big Data. This concise book introduces you to several strategies for using R to analyze large datasets, including three chapters on using R and Hadoop together. You’ll learn the basics of Snow, Multicore, Parallel, Segue, RHIPE, and Hadoop Streaming, including how to find them, how to use them, when they work well, and when they don’t.

With these packages, you can overcome R’s single-threaded nature by spreading work across multiple CPUs, or offloading work to multiple machines to address R’s memory barrier.
Snow: works well in a traditional cluster environmentMulticore: popular for multiprocessor and multicore computersParallel: part of the upcoming R 2.14.0 releaseR+Hadoop: provides low-level access to a popular form of cluster computingRHIPE: uses Hadoop’s power with R’s language and interactive shellSegue: lets you use Elastic MapReduce as a backend for lapply-style operations

النوع
كمبيوتر وإنترنت
تاريخ النشر
٢٠١١
٢١ أكتوبر
اللغة
EN
الإنجليزية
عدد الصفحات
١٢٦
الناشر
O'Reilly Media
البائع
O Reilly Media, Inc.
الحجم
٤٫٣
‫م.ب.‬
Parallel Processing: Questions and Answers Parallel Processing: Questions and Answers
٢٠١٨
Parallel and Concurrent Programming in Haskell Parallel and Concurrent Programming in Haskell
٢٠١٣
Practical Concurrent Haskell Practical Concurrent Haskell
٢٠١٧
Multicore and GPU Programming Multicore and GPU Programming
٢٠٢٢
Rust High Performance Rust High Performance
٢٠١٨
R Programming: Questions and Answers R Programming: Questions and Answers
٢٠١٨
Business Models for the Data Economy Business Models for the Data Economy
٢٠١٣
Managing RPM-Based Systems with Kickstart and Yum Managing RPM-Based Systems with Kickstart and Yum
٢٠٠٧
Bad Data Handbook Bad Data Handbook
٢٠١٢