Pandas 如何设置 Pandas 中的第一列和第一行作为索引

在本文中，我们将介绍如何使用 Pandas 将数据集中的第一列和第一行作为索引来提高数据处理的效率。

什么是索引？

在 Pandas 中，索引是一种特殊的数据结构，它能够帮助我们更快地根据标签或位置访问数据。在数据处理过程中，对于一些复杂的数据集，通常我们需要将某一列或某一行的数据作为索引来便于查找和分析。

下面，我们将通过一个经典的数据集来演示如何设置第一列和第一行作为索引。

import pandas as pd

# 载入数据集
data = pd.read_csv('data.csv')
print(data.head())

#   id      name   score1  score2  score3
# 0  1   Michael       70      88      85
# 1  2     Peter       85      79      90
# 2  3      John       60      75      85
# 3  4      Mary       90      98      95
# 4  5  Williams       70      80      75

如何设置第一列和第一行作为索引？

接下来，通过下列代码，我们将第一列数据设置为索引：

# 使用第一列数据作为索引
data = pd.read_csv('data.csv', index_col=0)
print(data.head())

#          name   score1  score2  score3
# id                                    
# 1     Michael       70      88      85
# 2       Peter       85      79      90
# 3        John       60      75      85
# 4        Mary       90      98      95
# 5    Williams       70      80      75

可以看到，此时表格已经通过第一列数据构建出来了新的索引。

接下来，我们尝试将第一行数据设置为索引：

# 使用第一行数据作为索引
data = pd.read_csv('data.csv', index_col=1, header=0)
print(data.head())

#           score1  score2  score3
# name                           
# Michael       70      88      85
# Peter         85      79      90
# John          60      75      85
# Mary          90      98      95
# Williams      70      80      75