如何将目录下所有的Excel文件读取为Pandas DataFrame?
要读取目录中的所有Excel文件,可以使用Glob模块和read_excel()方法。
假设以下是我们目录中的Excel文件 –
Sales1.xlsx
Sales2.xlsx
首先,设置所有Excel文件所在的路径。然后使用glob获取Excel文件并读取它们的内容 –
path = "C:\Users\amit_\Desktop\"
filenames = glob.glob(path + "\*.xlsx")
print('File names:', filenames)
接下来,使用for循环来迭代读取特定目录中的所有Excel文件。我们也将使用read_excel() –
for file in filenames:
print("Reading file = ",file)
print(pd.read_excel(file))
示例
以下是完整的代码 –
import pandas as pd
import glob
# getting excel files from Directory Desktop
path = "C:\Users\amit_\Desktop\"
# read all the files with extension .xlsx i.e. excel
filenames = glob.glob(path + "\*.xlsx")
print('File names:', filenames)
# for loop to iterate all excel files
for file in filenames:
# reading excel files
print("Reading file = ",file)
print(pd.read_excel(file))
输出
这将生成以下输出 –
File names:['C:\Users\amit_\Desktop\Sales1.xlsx','C:\Users\amit_\Desktop\Sales2.xlsx']
Reading file = C:\Users\amit_\Desktop\Sales1.xlsx
Car Place UnitsSold
0 Audi Bangalore 80
1 Porsche Mumbai 110
2 RollsRoyce Pune 100
Reading file = C:\Users\amit_\Desktop\Sales2.xlsx
Car Place UnitsSold
0 BMW Delhi 95
1 Mercedes Hyderabad 80
2 Lamborgini Chandigarh 80