pandas pivot详解|极客教程

pandas pivot详解

在数据处理过程中，我们经常会遇到需要对数据进行重塑的情况，以便更好地分析和可视化数据。pandas是一个强大的数据处理库，提供了丰富的函数和方法来帮助我们实现数据重塑操作。其中，pivot函数是pandas中用于数据重塑的重要工具之一。

1. 什么是pivot函数

pivot函数是DataFrame类中的一个方法，用于将长格式的数据转换为宽格式。在数据转换过程中，pivot函数会将指定的行列索引和数值列转换为新的表格形式，使得数据更易于分析和理解。

2. pivot函数的基本语法

pivot函数的基本语法如下：

DataFrame.pivot(index=None, columns=None, values=None)

参数说明：

index: 新表格中的行索引
columns: 新表格中的列索引
values: 新表格中的数值列

3. pivot函数的应用场景

pivot函数通常适用于以下情况：

数据需要从长格式转换为宽格式时
需要对数据进行透视操作时

4. pivot函数的具体示例

接下来，我们通过一个具体的示例来演示pivot函数的用法。假设我们有如下的数据集：

import pandas as pd

data = {
    'date': ['2021-01-01', '2021-01-01', '2021-01-02', '2021-01-02'],
    'city': ['A', 'B', 'A', 'B'],
    'temperature': [30, 28, 32, 29],
    'humidity': [60, 65, 55, 70]
}

df = pd.DataFrame(data)
print(df)

输出为：

         date city  temperature  humidity
0  2021-01-01    A          30        60
1  2021-01-01    B          28        65
2  2021-01-02    A          32        55
3  2021-01-02    B          29        70

接下来，我们使用pivot函数将上述数据从长格式转换为宽格式：

pivot_df = df.pivot(index='date', columns='city', values=['temperature', 'humidity'])
print(pivot_df)

输出为：

           temperature       humidity     
city               A   B          A   B
date                                    
2021-01-01         30  28         60  65
2021-01-02         32  29         55  70