Python ŌĆō 如何在Pandas中检查丢失的日期
首先,设置一个包含日期记录(即我们的购买日期)的列表字典来检查缺失的日期。例如:-
# dictionary of lists
d = {'Car': ['BMW', 'Lexus', 'Audi', 'Mercedes', 'Jaguar', 'Bentley'],
'Date_of_purchase': ['2020-10-10', '2020-10-12', '2020-10-17', '2020-10-16', '2020-10-19', '2020-10-22']}
现在,从上面的字典列表中创建一个数据帧(DataFrame) −
dataFrame = pd.DataFrame(d)
接下来,将其设置为索引 −
dataFrame = dataFrame.set_index('Date_of_purchase')
使用to_datetime()方法将字符串转换为DateTime对象 −
dataFrame.index = pd.to_datetime(dataFrame.index)
在范围内显示剩余的日期 −
k = pd.date_range(start="2020-10-10", end="2020-10-22").difference(dataFrame.index);
更多Pandas文章,请阅读:Pandas教程
示例
如下所示的代码:-
import pandas as pd
# dictionary of lists
d = {'Car': ['BMW', 'Lexus', 'Audi', 'Mercedes', 'Jaguar', 'Bentley'],
'Date_of_purchase': ['2020-10-10', '2020-10-12', '2020-10-17', '2020-10-16', '2020-10-19', '2020-10-22'] }
# creating dataframe from the above dictionary of lists
dataFrame = pd.DataFrame(d)
print"DataFrame...\n",dataFrame
# Date_of_purchase set as index
dataFrame = dataFrame.set_index('Date_of_purchase')
# using to_datetime() to convert string to DateTime object
dataFrame.index = pd.to_datetime(dataFrame.index)
# remaining dates displayed as output
print("\nDisplaying remaining dates from a range of dates...")
k = pd.date_range(start="2020-10-10", end="2020-10-22").difference(dataFrame.index);
print(k);
输出
这将产生以下输出:-
DataFrame...
Car Date_of_purchase
0 BMW 2020-10-10
1 Lexus 2020-10-12
2 Audi 2020-10-17
3 Mercedes 2020-10-16
4 Jaguar 2020-10-19
5 Bentley 2020-10-22
Displaying remaining dates from a range of dates...
DatetimeIndex(['2020-10-11', '2020-10-13', '2020-10-14', '2020-10-15',
'2020-10-18', '2020-10-20', '2020-10-21'],
dtype='datetime64[ns]', freq=None)