巨人脚下的周小烨
2020-08-10 20:13什么是standard 10-view preditio?
10I met this term in the paper "Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition":"3.1.2 Multi-level Pooling Improves Accuracy In Table 2 (b) we show the results using single- size training. The training and testing sizes are both 224×224. In these networks, the convolutional layers have the same structures as the corresponding base- line models, whereas the pooling layer after the final convolutional layer is replaced with the SPP layer. For the results in Table 2, we use a 4-level pyramid. The pyramid is {6×6, 3×3, 2×2, 1×1} (totally 50 bins). For fair comparison, we still use the standard 10- view prediction with each view a 224×224 crop. Our results in Table 2 (b) show considerable improvement over the no-SPP baselines in Table 2 (a). Interestingly, the largest gain of top-1 error (1.65%) is given by the most accurate architecture. Since we are still using the same 10 cropped views as in (a), these gains are solely because of multi-level pooling."
thanks a lot
- 点赞
- 回答
- 收藏
- 复制链接分享
1条回答
为你推荐
- Go中多词结构类型的命名标准是什么?
- struct
- 1个回答
- golang:为什么终端标准输出不打印出utf-8字符?
- vim
- 2个回答
- Go中任意精度常数的目的是什么?
- constants
- 1个回答
- 为什么Go图像包剪切+粘贴循环遍历像素?
- coding-style
- types
- standard-library
- 2个回答
- 为什么Go的bufio在后台使用恐慌?
- error-handling
- 1个回答