Data Wrangling with R Data Wrangling with R
Use R

Data Wrangling with R

    • €72.99
    • €72.99

Publisher Description

This guide for practicing statisticians, data scientists, and R users and programmers will teach the essentials of preprocessing: data leveraging the R programming language to easily and quickly turn noisy data into usable pieces of information. Data wrangling, which is also commonly referred to as data munging, transformation, manipulation, janitor work, etc., can be a painstakingly laborious process. Roughly 80% of data analysis is spent on cleaning and preparing data; however, being a prerequisite to the rest of the data analysis workflow (visualization, analysis, reporting), it is essential that one become fluent and efficient in data wrangling techniques.

This book will guide the user through the data wrangling process via a step-by-step tutorial approach and provide a solid foundation working with data in R. The author's goal is to teach the user how to easily wrangle data in order to spend more time on understanding the content of the data. By the end of the book, the user will have learned: 
How to work with different types of data such as numerics, characters, regular expressions, factors, and datesThe difference between different data structures and how to create, add additional components to, and subset each data structureHow to acquire and parse data from locations previously inaccessibleHow to develop functions and use loop control structures to reduce code redundancyHow to use pipe operators to simplify code and make it more readableHow to reshape the layout of data and manipulate, summarize, and join data sets

In essence, the user will have the data wrangling toolbox required for modern day data analysis.

Brad Boehmke, Ph.D., is an Operations Research Analyst at Headquarters Air Force Materiel Command, Studies and Analyses Division. He is also Assistant Professor in the Operational Sciences Department at the Air Force Institute of Technology. Dr. Boehmke's research interests are in the areas of cost analysis, economic modeling, decision analysis, and developing applied modeling applications through the R statistical language.

GENRE
Computing & Internet
RELEASED
2016
17 November
LANGUAGE
EN
English
LENGTH
250
Pages
PUBLISHER
Springer International Publishing
PROVIDER INFO
Springer Science & Business Media LLC
SIZE
2.6
MB
Advanced R Advanced R
2016
R for Stata Users R for Stata Users
2010
Introduction to Data Systems Introduction to Data Systems
2020
Advanced R 4 Data Programming and the Cloud Advanced R 4 Data Programming and the Cloud
2020
A Data Scientist's Guide to Acquiring, Cleaning, and Managing Data in R A Data Scientist's Guide to Acquiring, Cleaning, and Managing Data in R
2017
R Cookbook R Cookbook
2019
Retirement Income Recipes in R Retirement Income Recipes in R
2020
Bayesian Cost-Effectiveness Analysis with the R package BCEA Bayesian Cost-Effectiveness Analysis with the R package BCEA
2025
Cultural Analytics in R: A Tidy Approach Cultural Analytics in R: A Tidy Approach
2025
An Introduction to Web Mining An Introduction to Web Mining
2025
Heart Rate Variability Analysis with the R package RHRV Heart Rate Variability Analysis with the R package RHRV
2024
Audit Analytics Audit Analytics
2024