Introduction to Data Science Introduction to Data Science
Chapman & Hall/CRC Data Science Series

Introduction to Data Science

Data Analysis and Prediction Algorithms with R

    • $114.99
    • $114.99

Publisher Description

Introduction to Data Science: Data Analysis and Prediction Algorithms with R introduces concepts and skills that can help you tackle real-world data analysis challenges. It covers concepts from probability, statistical inference, linear regression, and machine learning. It also helps you develop skills such as R programming, data wrangling, data visualization, predictive algorithm building, file organization with UNIX/Linux shell, version control with Git and GitHub, and reproducible document preparation.

This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful. The book is divided into six parts: R, data visualization, statistics with R, data wrangling, machine learning, and productivity tools. Each part has several chapters meant to be presented as one lecture.

The author uses motivating case studies that realistically mimic a data scientist’s experience. He starts by asking specific questions and answers these through data analysis so concepts are learned as a means to answering the questions. Examples of the case studies included are: US murder rates by state, self-reported student heights, trends in world health and economics, the impact of vaccines on infectious disease rates, the financial crisis of 2007-2008, election forecasting, building a baseball team, image processing of hand-written digits, and movie recommendation systems.

The statistical concepts used to answer the case study questions are only briefly introduced, so complementing with a probability and statistics textbook is highly recommended for in-depth understanding of these concepts. If you read and understand the chapters and complete the exercises, you will be prepared to learn the more advanced concepts and skills needed to become an expert.

A complete solutions manual is available to registered instructors who require the text for a course.

GENRE
Science & Nature
RELEASED
2019
November 12
LANGUAGE
EN
English
LENGTH
743
Pages
PUBLISHER
CRC Press
SELLER
Taylor & Francis Group
SIZE
32.2
MB

More Books Like This

Statistical Data Analytics Statistical Data Analytics
2015
Data Science for Mathematicians Data Science for Mathematicians
2020
Statistical Analysis and Data Display Statistical Analysis and Data Display
2015
R For Statistics: Questions and Answers R For Statistics: Questions and Answers
2018
Introduction to Probability and Statistics for Ecosystem Managers Introduction to Probability and Statistics for Ecosystem Managers
2013
COMPSTAT 2006 - Proceedings in Computational Statistics COMPSTAT 2006 - Proceedings in Computational Statistics
2007

More Books by Rafael A. Irizarry

Other Books in This Series

Basketball Data Science Basketball Data Science
2020
Feature Engineering and Selection Feature Engineering and Selection
2019
Data Science Data Science
2022
Tree-Based Methods for Statistical Learning in R Tree-Based Methods for Statistical Learning in R
2022
Massive Graph Analytics Massive Graph Analytics
2022
Supervised Machine Learning for Text Analysis in R Supervised Machine Learning for Text Analysis in R
2021