我在增加特征的过程中,发现训练集可以提取的特征但测试集不能提取,比如题目让你提交测试集的预测结果,即“领取优惠券后15天以内的使用情况”,但是训练集在提取特征过程中,我增加了与核销有关的特征,比如“商家优惠券被领取后核销次数”,而测试集去无法提取和训练集相同的这个特征,因为测试集只有优惠券的领取数据。我认为可能是因为不能提取一致的特征才导致训练集和测试集特征不匹配,那么我应该如何解决这个给问题?
[以下为问题的具体描述]
ValueError: feature_names mismatch:
['Distance', 'is_manjian', 'discount_rate', 'min_cost_of_manjian', 'null_distance', 'week', 'is_weekend', 'week_0', 'week_1', 'week_2', 'week_3', 'week_4', 'week_5', 'week_6', 'simple_User_id_receive_cnt', 'simple_User_id_Coupon_id_receive_cnt', 'simple_User_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_repeat_receive', 'simple_Merchant_id_received_cnt', 'simple_Merchant_id_hexiao_times', 'simple_Merchant_id_hexiao_rate', 'simple_Merchant_id_label_hexiao_diff_users', 'simple_Merchant_id_label_hexiao_diff_coupons']
['Distance', 'is_manjian', 'discount_rate', 'min_cost_of_manjian', 'null_distance', 'week', 'is_weekend', 'week_0', 'week_1', 'week_2', 'week_3', 'week_4', 'week_5', 'week_6', 'simple_User_id_receive_cnt', 'simple_User_id_Coupon_id_receive_cnt', 'simple_User_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_repeat_receive', 'simple_Merchant_id_received_cnt']
expected
simple_Merchant_id_label_hexiao_diff_users, simple_Merchant_id_label_hexiao_diff_coupons, simple_Merchant_id_hexiao_rate, simple_Merchant_id_hexiao_times in input data