Python Pandas dataframe.memory_usage()

Python是一种进行数据分析的伟大语言，主要是因为以数据为中心的Python软件包的奇妙生态系统。Pandas就是这些包中的一个，使导入和分析数据变得更加容易。
Pandas dataframe.memory_usage()函数返回每一列的内存使用量，单位是字节。内存使用量可以选择性地包括索引和对象dtype元素的贡献。这个值默认会显示在DataFrame.info中。

语法:

DataFrame.memory_usage(index=True, deep=False)

参数 :
index : 指定是否在返回的系列中包括DataFrame的索引的内存使用。如果index=True，则输出中的第一项索引的内存使用情况。
deep : 如果是True，通过询问对象dtypes的系统级内存消耗来深入反省数据，并将其包含在返回值中。
返回：一个系列，其索引为原始列名，其值为每一列的内存使用量，单位为字节。
示例#1：使用memory_usage()函数打印数据框中每一列的内存使用量以及索引的内存使用量。

# importing pandas as pd
import pandas as pd
 
# Creating the dataframe
df = pd.read_csv("nba.csv")
 
# Print the dataframe
df

Python Pandas dataframe.memory_usage()

让我们使用memory_usage()函数来查找每一列的内存使用情况。

# Function to find memory use of each
# column along with the index
# even if we do not set index = True,
# it will show the index usage as well by default.
df.memory_usage(index = True)

输出 :

Python Pandas dataframe.memory_usage()

例子2：使用memory_usage()函数查找每一列的内存使用情况，但不包括索引。

# importing pandas as pd
import pandas as pd
 
# Creating the dataframe
df = pd.read_csv("nba.csv")
 
# Function to find memory use of each
# column but not of the index
# we set index = False
df.memory_usage(index = False)