weixin_39979159
weixin_39979159
2021-01-07 03:51

What is the meaning of the Effective Patch Size in Figure 4 in your paper?

What is the meaning of the Effective Patch Size in Figure 4 in your paper?

该提问来源于开源项目:tamarott/SinGAN

  • 点赞
  • 写回答
  • 关注问题
  • 收藏
  • 复制链接分享
  • 邀请回答

8条回答

  • weixin_39624769 weixin_39624769 4月前

    Sorry, is at the bottom of Page 5. 1 I got the point! So when the number of scales is small => the effective receptive field is smaller than the image so it can't catch the global information. on the other hand, when is number is increased the receptive field is better to see the global information. Thanks!

    点赞 评论 复制链接分享
  • weixin_39849888 weixin_39849888 4月前

    In my view, as the authors state in the paper, they make the longer side of the coarsest image to be 25px, and actually all the discriminator's receptive field is 11px*11px, and then the receptive field is biggest in the coarsest image. When the image becomes finer, the effective patch size (the receptive field relative to the image size) becomes smaller.

    点赞 评论 复制链接分享
  • weixin_39979159 weixin_39979159 4月前

    Thank you very much!

    ------------------ 原始邮件 ------------------ 发件人: "xrenaa"<notifications.com>; 发送时间: 2019年12月2日(星期一) 中午12:10 收件人: "tamarott/SinGAN"<SinGAN.github.com>; 抄送: "刘永洛"<1256994474.com>;"Author"<author.github.com>; 主题: Re: [tamarott/SinGAN] What is the meaning of the Effective Patch Size in Figure 4 in your paper? (#44)

    In my view, as the authors state in the paper, they make the longer side of the coarsest image to be 25px, and actually all the discriminator's receptive field is 11px*11px, and then the receptive field is biggest in the coarsest image. When the image becomes finer, the effective patch size (the receptive field relative to the image size) becomes smaller.

    — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

    点赞 评论 复制链接分享
  • weixin_39753397 weixin_39753397 4月前

    Hello, I have another question! So "patch" is not actually a cropped image from the original image (e.g. X_n in the paper), namely the part of the image? I should think patch as the 11x11 convolution kernel (which is from combined 5 convs)?

    Thanks!

    点赞 评论 复制链接分享
  • weixin_39849888 weixin_39849888 4月前

    Yes. The patch is actually what a patch discriminator can see.

    点赞 评论 复制链接分享
  • weixin_39753397 weixin_39753397 4月前

    Thanks for your reply!

    点赞 评论 复制链接分享
  • weixin_39624769 weixin_39624769 4月前

    First, Thanks for your answers But I was confused about in the paper Figure 9 the author says "the effective receptive field at the coarsest level is smaller, allowing to capture only fine textures" However, according to Figure 4, the coarsest level is the biggest one. What I missed or I misunderstood something?

    Thanks a lot.

    点赞 评论 复制链接分享
  • weixin_39849888 weixin_39849888 4月前

    -ho Actually I don't see this sentence. The caption of figure 9 is "A model with a small number of scales only captures textures. As the number of scales increases, SinGAN manages to capture larger structures as well as the global arrangement of objects in the scene." In my view, the finest figure is of the same size, which is 141*250. And when the scale (layer) of the model is small, the coarsest figure is big and then the effective receptive field is big and the model can not learn about the global information.

    点赞 评论 复制链接分享

相关推荐