HOME
eBook
- eBook
- 오디오(북)
- 동영상
IT/프로그래밍
- 경제경영
- 자기계발
- 시/에세이
- 인문
- 종교
- 소설
- 국어/외국어
- 정치/사회
- 역사/문화
- 과학/공학
- IT/프로그래밍
- 건강/의학
- 가정/생활/요리
- 여행/취미
- 예술/대중문화
- 유아
- 아동
- 청소년
- 교재/수험서
- 외국도서
- 매거진
- 대학교재
- 로맨스
- 로맨스판타지
- BL
- GL
- 판타지
- 무협
- 라이트노벨
- 추리
- 미스터리
- 스릴러
- 섹슈얼로맨스
- 단행본만화
- 웹툰
- 웹소설
데이터베이스/아키텍처
- IT일반/교양
- 컴퓨터입문/활용
- 컴퓨터수험서
- 컴퓨터공학
- 데이터베이스/아키텍처
- OS/네트워크
- 코딩/프로그래밍/언어
- OA (사무 보조 프로그램)
- 웹사이트/홈페이지/블로그
- 그래픽/디자인
- 영상/미디어
- 게임
- AI/AR/VR
- 기타

Pandas 1.x Cookbook Second Edition

Practical recipes for scientific computing, time series analysis, and exploratory data analysis using Python

Matt Harrison , Theodore Petrou 지음

Packt(GCO Science)

2020년 02월 27일 출간

(개의 리뷰)

( 0% 의 구매자)

eBook 상품 정보

파일 정보 PDF (5.16MB)

ISBN 9781839218910

지원기기 교보eBook App, PC e서재, 리더기, 웹뷰어

교보eBook App 듣기(TTS) 불가능

TTS 란?

텍스트를 음성으로 읽어주는 기술입니다.

전자책의 편집 상태에 따라 본문의 흐름과 다르게 텍스트를 읽을 수 있습니다.

이미지 형태로 제작된 전자책 (예 : ZIP 파일)은 TTS 기능을 지원하지 않습니다.

PDF 필기가능 (Android, iOS)

소득공제

소장

정가 : 23,000원

쿠폰적용가 20,700원

10% 할인 | 5%P 적립

이 상품은 배송되지 않는 디지털 상품이며,
교보eBook앱이나 웹뷰어에서 바로 이용가능합니다.

카드&결제 혜택

5만원 이상 구매 시 추가 2,000P
3만원 이상 구매 시, 등급별 2~4% 추가 최대 416P
리뷰 작성 시, e교환권 추가 최대 200원

상품정보
리뷰 (0)
이용안내

작품소개

이 상품이 속한 분야

Use the power of pandas to solve most complex scientific computing problems with ease. Revised for pandas 1.x.

▶Book Description
The pandas library is massive, and it's common for frequent users to be unaware of many of its more impressive features. The official pandas documentation, while thorough, does not contain many useful examples of how to piece together multiple commands as one would do during an actual analysis. This book guides you, as if you were looking over the shoulder of an expert, through situations that you are highly likely to encounter.

This new updated and revised edition provides you with unique, idiomatic, and fun recipes for both fundamental and advanced data manipulation tasks with pandas. Some recipes focus on achieving a deeper understanding of basic principles, or comparing and contrasting two similar operations. Other recipes will dive deep into a particular dataset, uncovering new and unexpected insights along the way. Many advanced recipes combine several different features across the pandas library to generate results.

▶What You Will Learn
-Master data exploration in pandas through dozens of practice problems
-Group, aggregate, transform, reshape, and filter data
-Merge data from different sources through pandas SQL-like operations
-Create visualizations via pandas hooks to matplotlib and seaborn
-Use pandas, time series functionality to perform powerful analyses
-Import, clean, and prepare real-world datasets for machine learning
-Create workflows for processing big data that doesn't fit in memory

▶Key Features
-This is the first book on pandas 1.x
-Practical, easy to implement recipes for quick solutions to common problems in data using pandas
-Master the fundamentals of pandas to quickly begin exploring any dataset

▶Who This Book Is For
This book is for Python developers, data scientists, engineers, and analysts. Pandas is the ideal tool for manipulating structured data with Python and this book provides ample instruction and examples. Not only does it cover the basics required to be proficient, but it goes into the details of idiomatic pandas.

▶TABLE of CONTENTS
-Chapter 1: Pandas Foundations
-Chapter 2: Essential DataFrame Operations
-Chapter 3: Creating and Persisting DataFrames
-Chapter 4: Beginning Data Analysis
-Chapter 5: Exploratory Data Analysis
-Chapter 6: Selecting Subsets of Data
-Chapter 7: Filtering Rows
-Chapter 8: Index Alignment
-Chapter 9: Grouping for Aggregation, Filtration, and Transformation
-Chapter 10: Restructuring Data into a Tidy Form
-Chapter 11: Combining Pandas Objects
-Chapter 12: Time Series Analysis
-Chapter 13: Visualization with Matplotlib, Pandas, and Seaborn
-Chapter 14: Debugging and Testing Pandas

▶What this book covers
- Chapter 1, Pandas Foundations, covers the anatomy and vocabulary used to identify the components of the two main pandas data structures, the Series and the DataFrame. Each column must have exactly one type of data, and each of these data types is covered. You will learn how to unleash the power of the Series and the DataFrame by calling and chaining together their methods.

- Chapter 2, Essential DataFrame Operations, focuses on the most crucial and typical operations that you will perform during data analysis.

- Chapter 3, Creating and Persisting DataFrames, discusses the various ways to ingest data and create DataFrames.

- Chapter 4, Beginning Data Analysis, helps you develop a routine to get started after reading in your data.

- Chapter 5, Exploratory Data Analysis, covers basic analysis techniques for comparing numeric and categorical data. This chapter will also demonstrate common visualization techniques.

- Chapter 6, Selecting Subsets of Data, covers the many varied and potentially confusing ways of selecting different subsets of data.

- Chapter 7, Filtering Rows, covers the process of querying your data to select subsets of it based on Boolean conditions.

- Chapter 8, Index Alignment, targets the very important and often misunderstood index object. Misuse of the Index is responsible for lots of erroneous results, and these recipes show you how to use it correctly to deliver powerful results.

- Chapter 9, Grouping for Aggregation, Filtration, and Transformation, covers the powerful grouping capabilities that are almost always necessary during data analysis. You will build customized functions to apply to your groups.

- Chapter 10, Restructuring Data into a Tidy Form, explains what tidy data is and why it's so important, and then it shows you how to transform many different forms of messy datasets into tidy ones.

- Chapter 11, Combining Pandas Objects, covers the many available methods to combine DataFrames and Series vertically or horizontally. We will also do some web-scraping and connect to a SQL relational database.

- Chapter 12, Time Series Analysis, covers advanced and powerful time series capabilities to dissect by any dimension of time possible.

- Chapter 13, Visualization with Matplotlib, Pandas, and Seaborn, introduces the matplotlib library, which is responsible for all of the plotting in pandas. We will then shift focus to the pandas plot method and, finally, to the seaborn library, which is capable of producing aesthetically pleasing visualizations not directly available in pandas.

- Chapter 14, Debugging and Testing Pandas, explores mechanisms of testing our DataFrames and pandas code. If you are planning on deploying pandas in production, this chapter will help you have confidence in your code.

▶ Preface
pandas is a library for creating and manipulating structured data with Python. What do I mean by structured? I mean tabular data in rows and columns like what you would find in a spreadsheet or database. Data scientists, analysts, programmers, engineers, and more are leveraging it to mold their data.

pandas is limited to "small data" (data that can fit in memory on a single machine). However, the syntax and operations have been adopted or inspired other projects: PySpark, Dask, Modin, cuDF, Baloo, Dexplo, Tabel, StaticFrame, among others. These projects have different goals, but some of them will scale out to big data. So there is a value in understanding how pandas works as the features are becoming the defacto API for interacting with structured data.

I, Matt Harrison, run a company, MetaSnake, that does corporate training. My bread and butter is training large companies that want to level up on Python and data skills. As such, I've taught thousands of Python and pandas users over the years. My goal in producing the second version of this book is to highlight and help with the aspects that many find confusing when coming to pandas. For all of its benefits, there are some rough edges or confusing aspects of pandas. I intend to navigate you to these and then guide you through them, so you will be able to deal with them in the real world.

If your company is interested in such live training, feel free to reach out (matt@metasnake.com).

인물정보

저자(글) Matt Harrison

인물정보

컴퓨터공학자

Matt Harrison has been using Python since 2000. He runs MetaSnake, which provides corporate training for Python and Data Science. He is the author of Machine Learning Pocket Reference, the bestselling Illustrated Guide to Python 3, and Learning the Pandas Library, among other books.

Pandas 1.x Cookbook Second Edition

저자(글) Theodore Petrou

Theodore Petrou is the founder of Dunder Data, a training company dedicated to helping teach the Python data science ecosystem effectively to individuals and corporations. Read his tutorials and attempt his data science challenges at the Dunder Data website.

이 상품의 총서

전체선택

Klover리뷰 (0)

구매 후 리뷰 작성 시, e교환권 100원 적립

문장수집

구매 후 문장수집 작성 시, e교환권 100원 적립

소장 23,000 원