dpwqb 2020-01-28 16:48 采纳率: 33.3%
浏览 627
已结题

xgboost ValueError:特征名不匹配问题;问题背景:天池新人O2O优惠券使用预测

我在增加特征的过程中,发现训练集可以提取的特征但测试集不能提取,比如题目让你提交测试集的预测结果,即“领取优惠券后15天以内的使用情况”,但是训练集在提取特征过程中,我增加了与核销有关的特征,比如“商家优惠券被领取后核销次数”,而测试集去无法提取和训练集相同的这个特征,因为测试集只有优惠券的领取数据。我认为可能是因为不能提取一致的特征才导致训练集和测试集特征不匹配,那么我应该如何解决这个给问题?

[以下为问题的具体描述]

ValueError: feature_names mismatch: 
['Distance', 'is_manjian', 'discount_rate', 'min_cost_of_manjian', 'null_distance', 'week', 'is_weekend', 'week_0', 'week_1', 'week_2', 'week_3', 'week_4', 'week_5', 'week_6', 'simple_User_id_receive_cnt', 'simple_User_id_Coupon_id_receive_cnt', 'simple_User_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_repeat_receive', 'simple_Merchant_id_received_cnt', 'simple_Merchant_id_hexiao_times', 'simple_Merchant_id_hexiao_rate', 'simple_Merchant_id_label_hexiao_diff_users', 'simple_Merchant_id_label_hexiao_diff_coupons'] 

['Distance', 'is_manjian', 'discount_rate', 'min_cost_of_manjian', 'null_distance', 'week', 'is_weekend', 'week_0', 'week_1', 'week_2', 'week_3', 'week_4', 'week_5', 'week_6', 'simple_User_id_receive_cnt', 'simple_User_id_Coupon_id_receive_cnt', 'simple_User_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_receive_cnt', 'simple_User_id_Coupon_id_Date_received_repeat_receive', 'simple_Merchant_id_received_cnt']

expected 
simple_Merchant_id_label_hexiao_diff_users, simple_Merchant_id_label_hexiao_diff_coupons, simple_Merchant_id_hexiao_rate, simple_Merchant_id_hexiao_times in input data
  • 写回答

1条回答 默认 最新

  • threenewbee 2020-01-28 16:57
    关注

    你的数据集缺少 'simple_Merchant_id_hexiao_times', 'simple_Merchant_id_hexiao_rate', 'simple_Merchant_id_label_hexiao_diff_users', 'simple_Merchant_id_label_hexiao_diff_coupons' 这几列啊

    评论

报告相同问题?

悬赏问题

  • ¥15 YoloV5 第三方库的版本对照问题
  • ¥15 请完成下列相关问题!
  • ¥15 drone 推送镜像时候 purge: true 推送完毕后没有删除对应的镜像,手动拷贝到服务器执行结果正确在样才能让指令自动执行成功删除对应镜像,如何解决?
  • ¥15 求daily translation(DT)偏差订正方法的代码
  • ¥15 js调用html页面需要隐藏某个按钮
  • ¥15 ads仿真结果在圆图上是怎么读数的
  • ¥20 Cotex M3的调试和程序执行方式是什么样的?
  • ¥20 java项目连接sqlserver时报ssl相关错误
  • ¥15 一道python难题3
  • ¥15 牛顿斯科特系数表表示