NumPy数组中如何查找值的索引：全面指南|极客教程

NumPy数组中如何查找值的索引：全面指南

参考：How to find the Index of value in Numpy Array

NumPy是Python中用于科学计算的核心库之一，它提供了高性能的多维数组对象和用于处理这些数组的工具。在使用NumPy数组时，我们经常需要查找特定值的索引。本文将详细介绍如何在NumPy数组中查找值的索引，包括各种方法和技巧。

1. 使用numpy.where()函数

numpy.where()是一个强大的函数，可以用来查找满足特定条件的元素的索引。它可以应用于一维和多维数组。

1.1 在一维数组中查找单个值

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 3, 7, 8, 9])
indices = np.where(arr == 3)
print("numpyarray.com: Indices of value 3:", indices[0])

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们创建了一个一维数组，然后使用np.where()查找值为3的所有索引。np.where()返回一个元组，其中包含满足条件的索引数组。我们通过indices[0]来获取这个索引数组。

1.2 在多维数组中查找值

import numpy as np

arr_2d = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
indices = np.where(arr_2d == 5)
print("numpyarray.com: Indices of value 5:", list(zip(indices[0], indices[1])))

Output:

NumPy数组中如何查找值的索引：全面指南

对于多维数组，np.where()返回的是每个维度上的索引。在这个例子中，我们使用zip()函数将行索引和列索引组合起来，以得到值5的完整位置。

2. 使用numpy.argwhere()函数

numpy.argwhere()函数类似于numpy.where()，但它直接返回满足条件的元素的索引数组。

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 3, 7, 8, 9])
indices = np.argwhere(arr == 3)
print("numpyarray.com: Indices of value 3:", indices.flatten())

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，np.argwhere()返回一个2D数组，每行包含一个满足条件的元素的索引。我们使用flatten()方法将其转换为一维数组。

3. 使用numpy.nonzero()函数

numpy.nonzero()函数返回数组中非零元素的索引。我们可以结合布尔索引来使用它。

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 3, 7, 8, 9])
indices = np.nonzero(arr == 3)[0]
print("numpyarray.com: Indices of value 3:", indices)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，arr == 3创建了一个布尔数组，然后np.nonzero()返回True值的索引。

4. 使用列表推导式

对于简单的情况，我们也可以使用列表推导式来查找索引。

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 3, 7, 8, 9])
indices = [i for i, x in enumerate(arr) if x == 3]
print("numpyarray.com: Indices of value 3:", indices)

Output:

NumPy数组中如何查找值的索引：全面指南

这种方法虽然简单直观，但对于大型数组可能不如NumPy的内置函数高效。

5. 查找最小值和最大值的索引

NumPy提供了专门用于查找最小值和最大值索引的函数。

5.1 使用numpy.argmin()

import numpy as np

arr = np.array([5, 2, 8, 1, 9, 3, 7])
min_index = np.argmin(arr)
print("numpyarray.com: Index of minimum value:", min_index)

Output:

NumPy数组中如何查找值的索引：全面指南

np.argmin()返回数组中最小值的索引。

5.2 使用numpy.argmax()

import numpy as np

arr = np.array([5, 2, 8, 1, 9, 3, 7])
max_index = np.argmax(arr)
print("numpyarray.com: Index of maximum value:", max_index)

Output:

NumPy数组中如何查找值的索引：全面指南

np.argmax()返回数组中最大值的索引。

6. 查找满足条件的第一个值的索引

有时我们只需要找到满足条件的第一个值的索引。

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 3, 7, 8, 9])
first_index = np.where(arr == 3)[0][0]
print("numpyarray.com: First index of value 3:", first_index)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们使用np.where()找到所有值为3的索引，然后取第一个索引。

7. 在排序数组中查找值的索引

对于已排序的数组，我们可以使用二分搜索来更快地找到值的索引。

import numpy as np

arr = np.array([1, 2, 3, 4, 5, 6, 7, 8, 9])
index = np.searchsorted(arr, 5)
print("numpyarray.com: Index of value 5 in sorted array:", index)

Output:

NumPy数组中如何查找值的索引：全面指南

np.searchsorted()使用二分搜索算法来查找值应该插入的位置，这也就是该值在数组中的索引。

8. 查找近似值的索引

有时我们需要找到最接近给定值的元素的索引。

import numpy as np

arr = np.array([1.1, 2.2, 3.3, 4.4, 5.5])
value = 3.7
index = (np.abs(arr - value)).argmin()
print("numpyarray.com: Index of value closest to 3.7:", index)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们首先计算数组中每个元素与目标值的差的绝对值，然后使用argmin()找到最小差值的索引。

9. 在多维数组中查找局部最小值或最大值的索引

对于多维数组，我们可能需要找到局部最小值或最大值的索引。

import numpy as np

arr_2d = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
local_max_index = np.unravel_index(np.argmax(arr_2d), arr_2d.shape)
print("numpyarray.com: Index of local maximum:", local_max_index)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们使用np.argmax()找到最大值的扁平索引，然后使用np.unravel_index()将其转换为多维索引。

10. 使用numpy.flatnonzero()函数

numpy.flatnonzero()函数返回数组展平后非零元素的索引。

import numpy as np

arr = np.array([0, 1, 2, 0, 3, 0, 4, 5])
nonzero_indices = np.flatnonzero(arr)
print("numpyarray.com: Indices of non-zero elements:", nonzero_indices)

Output:

NumPy数组中如何查找值的索引：全面指南

这个函数特别适用于稀疏数组或者我们只关心非零元素的情况。

11. 在结构化数组中查找值的索引

结构化数组是NumPy中一种特殊的数组类型，它可以包含不同类型的字段。我们可以在这种数组中查找特定字段的值。

import numpy as np

dt = np.dtype([('name', 'U10'), ('age', int)])
arr = np.array([('Alice', 25), ('Bob', 30), ('Charlie', 35), ('David', 30)], dtype=dt)
indices = np.where(arr['age'] == 30)[0]
print("numpyarray.com: Indices of people aged 30:", indices)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们创建了一个包含名字和年龄的结构化数组，然后查找所有30岁人的索引。

12. 使用numpy.argpartition()函数

当我们需要找到数组中第k个最小或最大元素的索引时，可以使用numpy.argpartition()函数。

import numpy as np

arr = np.array([3, 1, 4, 1, 5, 9, 2, 6, 5, 3, 5])
k = 3
indices = np.argpartition(arr, k)[:k]
print(f"numpyarray.com: Indices of {k} smallest elements:", indices)

Output:

NumPy数组中如何查找值的索引：全面指南

这个函数将数组分区，使得第k个元素在其最终排序位置，左边的元素都小于等于它，右边的元素都大于等于它。我们取前k个索引就得到了k个最小元素的索引。

13. 在masked数组中查找值的索引

NumPy的masked数组允许我们在数组中标记某些元素为无效或缺失。我们可以在这种数组中查找有效值的索引。

import numpy as np
import numpy.ma as ma

arr = ma.array([1, 2, 3, 4, 5], mask=[0, 0, 1, 0, 1])
valid_indices = ma.nonzero(~arr.mask)[0]
print("numpyarray.com: Indices of valid elements:", valid_indices)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们创建了一个masked数组，其中第3和第5个元素被标记为无效。然后我们使用ma.nonzero()找到所有有效元素的索引。

14. 使用numpy.argmin()和numpy.argmax()的axis参数

对于多维数组，我们可能需要沿着特定轴找到最小值或最大值的索引。

import numpy as np

arr_2d = np.array([[1, 2, 3], [4, 5, 6], [7, 8, 9]])
min_indices = np.argmin(arr_2d, axis=1)
print("numpyarray.com: Indices of minimum values along rows:", min_indices)

Output:

NumPy数组中如何查找值的索引：全面指南

在这个例子中，我们沿着行（axis=1）找到每行的最小值的索引。

15. 使用numpy.argsort()函数

numpy.argsort()函数返回将数组排序后的索引。这可以用来找到数组中元素的排序位置。

import numpy as np

arr = np.array([3, 1, 4, 1, 5, 9, 2, 6, 5, 3, 5])
sorted_indices = np.argsort(arr)
print("numpyarray.com: Indices that would sort the array:", sorted_indices)