Foundations of Statistics for Data Scientists Foundations of Statistics for Data Scientists
Chapman & Hall/CRC Texts in Statistical Science

Foundations of Statistics for Data Scientists

With R and Python

    • ¥16,800
    • ¥16,800

Publisher Description

Foundations of Statistics for Data Scientists: With R and Python is designed as a textbook for a one- or two-term introduction to mathematical statistics for students training to become data scientists. It is an in-depth presentation of the topics in statistical science with which any data scientist should be familiar, including probability distributions, descriptive and inferential statistical methods, and linear modeling. The book assumes knowledge of basic calculus, so the presentation can focus on "why it works" as well as "how to do it." Compared to traditional "mathematical statistics" textbooks, however, the book has less emphasis on probability theory and more emphasis on using software to implement statistical methods and to conduct simulations to illustrate key concepts. All statistical analyses in the book use R software, with an appendix showing the same analyses with Python.

Key Features:
Shows the elements of statistical science that are important for students who plan to become data scientists. Includes Bayesian and regularized fitting of models (e.g., showing an example using the lasso), classification and clustering, and implementing methods with modern software (R and Python). Contains nearly 500 exercises.
The book also introduces modern topics that do not normally appear in mathematical statistics texts but are highly relevant for data scientists, such as Bayesian inference, generalized linear models for non-normal responses (e.g., logistic regression and Poisson loglinear models), and regularized model fitting. The nearly 500 exercises are grouped into "Data Analysis and Applications" and "Methods and Concepts." Appendices introduce R and Python and contain solutions for odd-numbered exercises. The book's website (http://stat4ds.rwth-aachen.de/) has expanded R, Python, and Matlab appendices and all data sets from the examples and exercises.

GENRE
Business & Personal Finance
RELEASED
2021
November 29
LANGUAGE
EN
English
LENGTH
486
Pages
PUBLISHER
CRC Press
SELLER
Taylor & Francis Group
SIZE
48.4
MB
A User's Guide to Business Analytics A User's Guide to Business Analytics
2016
Understanding Regression Analysis Understanding Regression Analysis
2020
Business Statistics:using R Business Statistics:using R
2019
Essential Econometric Techniques Essential Econometric Techniques
2022
Developing Econometrics Developing Econometrics
2011
Introductory Regression Analysis Introductory Regression Analysis
2013
An Introduction to Categorical Data Analysis An Introduction to Categorical Data Analysis
2018
Foundations of Linear and Generalized Linear Models Foundations of Linear and Generalized Linear Models
2015
Categorical Data Analysis Categorical Data Analysis
2013
Analysis of Ordinal Categorical Data Analysis of Ordinal Categorical Data
2012
Randomization, Bootstrap and Monte Carlo Methods in Biology Randomization, Bootstrap and Monte Carlo Methods in Biology
2020
Statistics in Survey Sampling Statistics in Survey Sampling
2025
Exercises and Solutions in Probability and Statistics Exercises and Solutions in Probability and Statistics
2025
Stationary Stochastic Processes Stationary Stochastic Processes
2012
Exercises in Statistical Reasoning Exercises in Statistical Reasoning
2025
Linear Models with R Linear Models with R
2025