Peter_PAN2013 2017-07-27 00:57 采纳率: 0%

深度学习图像分区：Deeplab v2 pretrained model 跑不出像样的结果

最近尝试deploy deeplab v2 的模型，官方没有deploy.prototxt 所以自己从test.prototxt 改写了一个。但结果是：
如果去掉CRF层，跑出来就是一团浆糊; 如果保留CRF层，物体的轮廓很清晰，有胳膊有腿的，但是每个物体每次跑出来的颜色都不一样，也就是说每次分配的标签也不一样，也印证了前面没有CRF层的时候完全没有预测能力这个事实。。。
我直接用了** http://liangchiehchen.com/projects/DeepLabv2_vgg.html ** 这个链接下载的 train_iter_20000.caffemodel，我以为这是在voc 2012 上 fine tune 过的权重，因此我的deploy.prototxt 也是按照这个假设写的。所以，求助啊！为什么这样直接deploy完全不行的样子？
以下是deploy.prototxt：

# VGG 16-layer network convolutional finetuning

Network modified to have smaller receptive field (128 pixels)

and smaller stride (8 pixels) when run in convolutional mode.

In this model we also change max pooling size in the first 4 layer

from 2 to 3 while retaining stride = 2

which makes it easier to exactly align responses at different layer.

#
name: "233"
input: "data"
input_dim: 1
input_dim: 3
input_dim: 513
input_dim: 513

input: "data_dim"
input_dim: 1
input_dim: 1
input_dim: 1
input_dim: 2

#layer {

name: "data"

type: "MemoryData"

top: "data"

top: "data_dim"

memory_data_param {

batch_size: 1

channels: 3

height: 865

width: 1297

}

layer {
name: "conv1_1"
type: "Convolution"
bottom: "data"
top: "conv1_1"
convolution_param {
num_output: 64
pad: 1
kernel_size: 3
}
}
layer {
name: "relu1_1"
type: "ReLU"
bottom: "conv1_1"
top: "conv1_1"
}
layer {
name: "conv1_2"
type: "Convolution"
bottom: "conv1_1"
top: "conv1_2"
convolution_param {
num_output: 64
pad: 1
kernel_size: 3
}
}
layer {
name: "relu1_2"
type: "ReLU"
bottom: "conv1_2"
top: "conv1_2"
}
layer {
name: "pool1"
type: "Pooling"
bottom: "conv1_2"
top: "pool1"
pooling_param {
pool: MAX
kernel_size: 3
stride: 2
pad: 1
}
}
layer {
name: "conv2_1"
type: "Convolution"
bottom: "pool1"
top: "conv2_1"
convolution_param {
num_output: 128
pad: 1
kernel_size: 3
}
}
layer {
name: "relu2_1"
type: "ReLU"
bottom: "conv2_1"
top: "conv2_1"
}
layer {
name: "conv2_2"
type: "Convolution"
bottom: "conv2_1"
top: "conv2_2"
convolution_param {
num_output: 128
pad: 1
kernel_size: 3
}
}
layer {
name: "relu2_2"
type: "ReLU"
bottom: "conv2_2"
top: "conv2_2"
}
layer {
name: "pool2"
type: "Pooling"
bottom: "conv2_2"
top: "pool2"
pooling_param {
pool: MAX
kernel_size: 3
stride: 2
pad: 1
}
}
layer {
name: "conv3_1"
type: "Convolution"
bottom: "pool2"
top: "conv3_1"
convolution_param {
num_output: 256
pad: 1
kernel_size: 3
}
}
layer {
name: "relu3_1"
type: "ReLU"
bottom: "conv3_1"
top: "conv3_1"
}
layer {
name: "conv3_2"
type: "Convolution"
bottom: "conv3_1"
top: "conv3_2"
convolution_param {
num_output: 256
pad: 1
kernel_size: 3
}
}
layer {
name: "relu3_2"
type: "ReLU"
bottom: "conv3_2"
top: "conv3_2"
}
layer {
name: "conv3_3"
type: "Convolution"
bottom: "conv3_2"
top: "conv3_3"
convolution_param {
num_output: 256
pad: 1
kernel_size: 3
}
}
layer {
name: "relu3_3"
type: "ReLU"
bottom: "conv3_3"
top: "conv3_3"
}
layer {
name: "pool3"
type: "Pooling"
bottom: "conv3_3"
top: "pool3"
pooling_param {
pool: MAX
kernel_size: 3
stride: 2
pad: 1
}
}
layer {
name: "conv4_1"
type: "Convolution"
bottom: "pool3"
top: "conv4_1"
convolution_param {
num_output: 512
pad: 1
kernel_size: 3
}
}
layer {
name: "relu4_1"
type: "ReLU"
bottom: "conv4_1"
top: "conv4_1"
}
layer {
name: "conv4_2"
type: "Convolution"
bottom: "conv4_1"
top: "conv4_2"
convolution_param {
num_output: 512
pad: 1
kernel_size: 3
}
}
layer {
name: "relu4_2"
type: "ReLU"
bottom: "conv4_2"
top: "conv4_2"
}
layer {
name: "conv4_3"
type: "Convolution"
bottom: "conv4_2"
top: "conv4_3"
convolution_param {
num_output: 512
pad: 1
kernel_size: 3
}
}
layer {
name: "relu4_3"
type: "ReLU"
bottom: "conv4_3"
top: "conv4_3"
}
layer {
bottom: "conv4_3"
top: "pool4"
name: "pool4"
type: "Pooling"
pooling_param {
pool: MAX
kernel_size: 3
pad: 1
stride: 1
}
}
layer {
name: "conv5_1"
type: "Convolution"
bottom: "pool4"
top: "conv5_1"
convolution_param {
num_output: 512
pad: 2
kernel_size: 3
dilation: 2
}
}
layer {
name: "relu5_1"
type: "ReLU"
bottom: "conv5_1"
top: "conv5_1"
}
layer {
name: "conv5_2"
type: "Convolution"
bottom: "conv5_1"
top: "conv5_2"
convolution_param {
num_output: 512
pad: 2
kernel_size: 3
dilation: 2
}
}
layer {
name: "relu5_2"
type: "ReLU"
bottom: "conv5_2"
top: "conv5_2"
}
layer {
name: "conv5_3"
type: "Convolution"
bottom: "conv5_2"
top: "conv5_3"
convolution_param {
num_output: 512
pad: 2
kernel_size: 3
dilation: 2
}
}
layer {
name: "relu5_3"
type: "ReLU"
bottom: "conv5_3"
top: "conv5_3"
}

layer {
bottom: "conv5_3"
top: "pool5"
name: "pool5"
type: "Pooling"
pooling_param {
pool: MAX
kernel_size: 3
stride: 1
pad: 1
}
}

hole = 6

layer {
name: "fc6_1"
type: "Convolution"
bottom: "pool5"
top: "fc6_1"
convolution_param {
num_output: 1024
pad: 6
kernel_size: 3
dilation: 6
}
}
layer {
name: "relu6_1"
type: "ReLU"
bottom: "fc6_1"
top: "fc6_1"
}
layer {
name: "drop6_1"
type: "Dropout"
bottom: "fc6_1"
top: "fc6_1"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc7_1"
type: "Convolution"
bottom: "fc6_1"
top: "fc7_1"
convolution_param {
num_output: 1024
kernel_size: 1
}
}
layer {
name: "relu7_1"
type: "ReLU"
bottom: "fc7_1"
top: "fc7_1"
}
layer {
name: "drop7_1"
type: "Dropout"
bottom: "fc7_1"
top: "fc7_1"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc8_0_1"
type: "Convolution"
bottom: "fc7_1"
top: "fc8_0_1"
convolution_param {
num_output: 21
kernel_size: 1
weight_filler {
type: "gaussian"
std: 0.01
}
bias_filler {
type: "constant"
value: 0
}
}
}

hole = 12

layer {
name: "fc6_2"
type: "Convolution"
bottom: "pool5"
top: "fc6_2"
convolution_param {
num_output: 1024
pad: 12
kernel_size: 3
dilation: 12
}
}
layer {
name: "relu6_2"
type: "ReLU"
bottom: "fc6_2"
top: "fc6_2"
}
layer {
name: "drop6_2"
type: "Dropout"
bottom: "fc6_2"
top: "fc6_2"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc7_2"
type: "Convolution"
bottom: "fc6_2"
top: "fc7_2"
convolution_param {
num_output: 1024
kernel_size: 1
}
}
layer {
name: "relu7_2"
type: "ReLU"
bottom: "fc7_2"
top: "fc7_2"
}
layer {
name: "drop7_2"
type: "Dropout"
bottom: "fc7_2"
top: "fc7_2"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc8_0_2"
type: "Convolution"
bottom: "fc7_2"
top: "fc8_0_2"
convolution_param {
num_output: 21
kernel_size: 1
weight_filler {
type: "gaussian"
std: 0.01
}
bias_filler {
type: "constant"
value: 0
}
}
}

hole = 18

layer {
name: "fc6_3"
type: "Convolution"
bottom: "pool5"
top: "fc6_3"
convolution_param {
num_output: 1024
pad: 18
kernel_size: 3
dilation: 18
}
}
layer {
name: "relu6_3"
type: "ReLU"
bottom: "fc6_3"
top: "fc6_3"
}
layer {
name: "drop6_3"
type: "Dropout"
bottom: "fc6_3"
top: "fc6_3"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc7_3"
type: "Convolution"
bottom: "fc6_3"
top: "fc7_3"
convolution_param {
num_output: 1024
kernel_size: 1
}
}
layer {
name: "relu7_3"
type: "ReLU"
bottom: "fc7_3"
top: "fc7_3"
}
layer {
name: "drop7_3"
type: "Dropout"
bottom: "fc7_3"
top: "fc7_3"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc8_0_3"
type: "Convolution"
bottom: "fc7_3"
top: "fc8_0_3"
convolution_param {
num_output: 21
kernel_size: 1
weight_filler {
type: "gaussian"
std: 0.01
}
bias_filler {
type: "constant"
value: 0
}
}
}

hole = 24

layer {
name: "fc6_4"
type: "Convolution"
bottom: "pool5"
top: "fc6_4"
convolution_param {
num_output: 1024
pad: 24
kernel_size: 3
dilation: 24
}
}
layer {
name: "relu6_4"
type: "ReLU"
bottom: "fc6_4"
top: "fc6_4"
}
layer {
name: "drop6_4"
type: "Dropout"
bottom: "fc6_4"
top: "fc6_4"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc7_4"
type: "Convolution"
bottom: "fc6_4"
top: "fc7_4"
convolution_param {
num_output: 1024
kernel_size: 1
}
}
layer {
name: "relu7_4"
type: "ReLU"
bottom: "fc7_4"
top: "fc7_4"
}
layer {
name: "drop7_4"
type: "Dropout"
bottom: "fc7_4"
top: "fc7_4"
dropout_param {
dropout_ratio: 0.5
}
}
layer {
name: "fc8_0_4"
type: "Convolution"
bottom: "fc7_4"
top: "fc8_0_4"
convolution_param {
num_output: 21
kernel_size: 1
weight_filler {
type: "gaussian"
std: 0.01
}
bias_filler {
type: "constant"
value: 0
}
}
}

SUM the four branches

layer {
bottom: "fc8_0_1"
bottom: "fc8_0_2"
bottom: "fc8_0_3"
bottom: "fc8_0_4"
top: "fc8_0"
name: "fc8_0"
type: "Eltwise"
eltwise_param {
operation: SUM
}
}
layer {
bottom: "fc8_0"
top: "fc8_interp"
name: "fc8_interp"
type: "Interp"
interp_param {
zoom_factor: 8
}
}
#layer {

bottom: "fc8_interp"

bottom: "data_dim"

bottom: "data"

top: "crf_inf"

name: "crf"

type: "DenseCRF"

dense_crf_param {

max_iter: 10

pos_w: 3

pos_xy_std: 3

bi_w: 6

bi_xy_std: 50

bi_rgb_std: 4

}

layer {
name: "argmax"
type: "ArgMax"
bottom: "fc8_interp"
top: "argmax"
argmax_param {
axis: 1
}
}

写回答
好问题 0 提建议
追加酬金
关注问题
分享
邀请回答
编辑收藏删除结题
收藏举报

2条回答默认最新

关注

码龄粉丝数原力等级 --

被采纳

被点赞

采纳率
wanfy8800 2017-10-25 08:27
关注
你加了一层，下面的层top和bottom也要改啊

layer {
bottom: "fc8_interp"
bottom: "data_dim"
bottom: "data"
top: "crf"
name: "crf"
type: "DenseCRF"
dense_crf_param {
max_iter: 10
pos_w: 3
pos_xy_std: 3
bi_w: 4
bi_xy_std: 49
bi_rgb_std: 5
}

layer {

bottom: "fc8_interp"

top: "fc8_interp_argmax"

name: "fc8_interp_argmax"

type: "ArgMax"

argmax_param {

axis: 1

}

}

layer {
name: "fc8_mat"
type: "MatWrite"
bottom: "crf"

bottom: "fc8_interp_argmax"

bottom: "fc8_interp"

include {
phase: TEST
}```
解决无用
评论打赏
分享
举报

评论

按下Enter换行，Ctrl+Enter发表内容

报告相同问题？

关注问题

问题（20211227-01）：修改分区大小后报错 c语言
2021-12-27 15:06

回答 3 已采纳 ERR:aud_isChangeSampleRateAllow() Record is working. Change Sample Rate Fail.ERR:aud_isChangeSampl
java 实现 sparksql 时，使用分区，mysql数据库查询结果只有表头没有数据 java mysql spark
2017-07-23 09:12

回答 2 已采纳以解决，是分区太大，掩盖了之前的任务
菜鸟提问：怎么使背景颜色充满整个div分区？ css css3 html5 javascript 开发语言
2019-05-09 15:27

回答 6 已采纳你这个是不行的，首先div的都有一个最小默认高度（在你没设置高度的时候），建议你如果要设置右边页面的背景色为灰色的话，在rightJsp.jsp中body元素中设置背景色就行了
【图像超分】论文精读：Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution（LapSRN）
2024-03-31 07:27

十小大的博客卷积神经网络最近证明了单图像超分辨率的高质量重建。在本文中，我们提出了拉普拉斯金字塔超分辨率网络(LapSRN)来逐步重建高分辨率图像的子带残差。在每个金字塔级别，我们的模型以粗分辨率的特征图作为输入，预测...
在磁盘分区中当空间分区之前是EFI系统分区时，如何处理未分配的空间分区？人工智能百度
2019-02-14 17:53

回答 2 已采纳 D盘是不行，因为中间隔了一个隐藏分区。但是可以和e合并，方法是，先在未分配上创建分区，然后使用diskgen，合并分区。这个办法存在一定风险，更可靠的办法是找一个移动硬盘，把e盘文件拷贝出来。
如何优化hive动态分区写入速度？ hadoop hive 大数据
2022-05-21 17:53

回答 1 已采纳查询最后加上distribute by ORDERDATEsort by ORDERDATE distribute by按照指定的字段将数据划分到不同的输出reduce中，可以保证每个reduce处理
gbase 8a分区表支持 database
2022-05-31 09:43

回答 1 已采纳你好，目前版本是不支持的，期待以后支持吧。
Google Colaboratory：一款用于深度学习的免费GPU使用方法
2022-02-27 05:36

嵌入式技术的博客一款用于深度学习的免费的GPU：Google Colab一、Google Colab介绍一、MBR分区表格式的局限性二、GPT分区表格式的优势三、MBR分区表格式与GPT分区表格式的异同（1）BIOS：基本输入输出系统（2）UEFI：统一的可扩展...
高分悬赏：格式化硬盘，硬盘的系列号会发生改变么？分区会改变系列号么？开发语言
2020-05-21 14:39

回答 2 已采纳硬件的东西是不会改变的，格式化硬盘指数将硬盘恢复到初始状态了，损失的是后来存储的内容。
CentOS7手动分区 centos linux
2022-03-24 22:13

回答 1 已采纳 /boot 只能为标准分区啊，而且是装系统的时候就弄好的。只要分够容量，就OK7的系统，/boot 分区200M 就够了，担心的话，分300M就完全足够了
linux硬盘分区更改挂载目录 linux
2022-04-29 16:10

回答 1 已采纳 1、sda3目前挂载在根目录下，不建议操作2、你卸载/opt的挂载后，/opt自然就跑回根目录（/）下了，使用的就是sda3的容量了 #在非/opt目录下执行 umount /opt，卸载不了有
《解密并行和分布式深度学习:深度并发分析》摘要记录
2022-08-26 01:38

小锋学长生活大爆炸的博客 GANs：利用深度神经网络通过同时训练两个网络来生成真实的数据(通常是图像)。训练第一个(discriminator)网络来区分"真实"数据集样本和"虚假"生成的样本，而训练第二个(generator)网络来生成尽可能与真实数据集相似...
在VMware Workstastion中安装centOS时，分配swap分区报错，提示为无法分配所需分区方案 centos linux
2022-04-28 09:03

回答 2 已采纳不保存退出去，再重新进来，点手动分区，然后确认-自动分区，就可以了
Slurm超算集群跑深度学习代码教程
2022-06-01 15:21

代码小白的成长的博客 CPU 集群，分区命名 hpib Intel Knight Landing 7250 集群，分区命名 knl Nvidia Tesla V100 集群，分区命名 gpu 指定使用哪个分区，在作业脚本里加入 #SBATCH --partition=gpu –nodes 申请计算节点的数量，在作业...
收藏 | 90+深度学习开源数据集整理：包括目标检测、工业缺陷、图像分割等多个方向（附下载）...
2022-04-30 17:00

数据派THU的博客本文整理汇总了90+深度学习各方向的开源数据集，包含了小目标检测、目标检测、工业缺陷检测、人脸识别、姿态估计、图像分割、图像识别等方向。附下载链接。小目标检测1. AI-TOD航空图像数据集数据集下载地址：...
没有解决我的问题, 去提问

悬赏问题

¥15 如何实验stm32主通道和互补通道独立输出
¥30 这是哪个作者做的宝宝起名网站
¥60 版本过低apk如何修改可以兼容新的安卓系统
¥25 由IPR导致的DRIVER_POWER_STATE_FAILURE蓝屏
¥50 有数据，怎么建立模型求影响全要素生产率的因素
¥50 有数据，怎么用matlab求全要素生产率
¥15 TI的insta-spin例程
¥15 完成下列问题完成下列问题
¥15 C#算法问题, 不知道怎么处理这个数据的转换
¥15 YoloV5 第三方库的版本对照问题

深度学习图像分区：Deeplab v2 pretrained model 跑不出像样的结果

Network modified to have smaller receptive field (128 pixels)

and smaller stride (8 pixels) when run in convolutional mode.

In this model we also change max pooling size in the first 4 layer

from 2 to 3 while retaining stride = 2

which makes it easier to exactly align responses at different layer.

name: "data"

type: "MemoryData"

top: "data"

top: "data_dim"

memory_data_param {

batch_size: 1

channels: 3

height: 865

width: 1297

}

hole = 6

hole = 12

hole = 18

hole = 24

SUM the four branches

bottom: "fc8_interp"

bottom: "data_dim"

bottom: "data"

top: "crf_inf"

name: "crf"

type: "DenseCRF"

dense_crf_param {

max_iter: 10

pos_w: 3

pos_xy_std: 3

bi_w: 6

bi_xy_std: 50

bi_rgb_std: 4

}

2条回答 默认 最新

layer {

bottom: "fc8_interp"

top: "fc8_interp_argmax"

name: "fc8_interp_argmax"

type: "ArgMax"

argmax_param {

axis: 1

}

}

bottom: "fc8_interp_argmax"

bottom: "fc8_interp"

悬赏问题

2条回答默认最新