巨人脚下的周小烨 2020-08-10 20:13 采纳率: 0%
浏览 150
已结题

什么是standard 10-view preditio?

I met this term in the paper "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition":"3.1.2 Multi-level Pooling Improves Accuracy In Table 2 (b) we show the results using single- size training. The training and testing sizes are both 224×224. In these networks, the convolutional layers have the same structures as the corresponding base- line models, whereas the pooling layer after the final convolutional layer is replaced with the SPP layer. For the results in Table 2, we use a 4-level pyramid. The pyramid is {6×6, 3×3, 2×2, 1×1} (totally 50 bins). For fair comparison, we still use the standard 10- view prediction with each view a 224×224 crop. Our results in Table 2 (b) show considerable improvement over the no-SPP baselines in Table 2 (a). Interestingly, the largest gain of top-1 error (1.65%) is given by the most accurate architecture. Since we are still using the same 10 cropped views as in (a), these gains are solely because of multi-level pooling."

thanks a lot

  • 写回答

1条回答 默认 最新

  • threenewbee 2020-08-10 21:30
    关注

    标准10视角预测( predition)
    多个视角可以引入景深一类的信息,比如说车上有多个摄像头,可以感知物体的距离。

    评论

报告相同问题?

悬赏问题

  • ¥15 树莓派与pix飞控通信
  • ¥15 自动转发微信群信息到另外一个微信群
  • ¥15 outlook无法配置成功
  • ¥30 这是哪个作者做的宝宝起名网站
  • ¥60 版本过低apk如何修改可以兼容新的安卓系统
  • ¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
  • ¥50 有数据,怎么建立模型求影响全要素生产率的因素
  • ¥50 有数据,怎么用matlab求全要素生产率
  • ¥15 TI的insta-spin例程
  • ¥15 完成下列问题完成下列问题