Thinking in Pandas Thinking in Pandas

Thinking in Pandas

How to Use the Python Data Analysis Library the Right Way

    • €36.99
    • €36.99

Publisher Description

Understand and implement big data analysis solutions in pandas with an emphasis on performance. This book strengthens your intuition for working with pandas, the Python data analysis library, by exploring its underlying implementation and data structures.

Thinking in Pandas introduces the topic of big data and demonstrates concepts by looking at exciting and impactful projects that pandas helped to solve. From there, you will learn to assess your own projects by size and type to see if pandas is the appropriate library for your needs. Author Hannah Stepanek explains how to load and normalize data in pandas efficiently, and reviews some of the most commonly used loaders and several of their most powerful options. You will then learn how to access and transform data efficiently, what methods to avoid, and when to employ more advanced performance techniques. You will also go over basic data access and munging in pandas and the intuitive dictionary syntax. Choosing theright DataFrame format, working with multi-level DataFrames, and how pandas might be improved upon in the future are also covered.

By the end of the book, you will have a solid understanding of how the pandas library works under the hood. Get ready to make confident decisions in your own projects by utilizing pandas—the right way.

You will:

Understand the underlying data structure of pandas and why it performs the way it does under certain circumstancesDiscover how to use pandas to extract, transform, and load data correctly with an emphasis on performanceChoose the right DataFrame so that the data analysis is simple and efficient.Improve performance of pandas operations with other Python libraries

GENRE
Computing & Internet
RELEASED
2020
5 June
LANGUAGE
EN
English
LENGTH
197
Pages
PUBLISHER
Apress
PROVIDER INFO
Springer Science & Business Media LLC
SIZE
2.9
MB
Computer Science Distilled Computer Science Distilled
2017
Python High Performance - Second Edition Python High Performance - Second Edition
2017
NumPy: Questions and Answers (2020 Edition) NumPy: Questions and Answers (2020 Edition)
2019
NumPy: Questions and Answers NumPy: Questions and Answers
2018
OCR A2 Computing F453 Advanced Computing Theory: Revision Guide OCR A2 Computing F453 Advanced Computing Theory: Revision Guide
2013
Coding for Performance: Questions and Answers Coding for Performance: Questions and Answers
2018