论文笔记：CVPR2021 Bottom-Up Shift and Reasoning for Referring Image Segmentation

最新推荐文章于 2024-07-17 16:56:56 发布

_击空明兮溯流光_

最新推荐文章于 2024-07-17 16:56:56 发布

阅读量483

点赞数

分类专栏： graph relattion 文章标签：深度学习

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/Blair_2/article/details/121204488

版权

graph relattion 专栏收录该内容

1 篇文章 0 订阅

订阅专栏

任务名字：Referring Image Segmentation (RIS)

keywords：one-stage RIS、graph、relation reasoning

背景：方法比较

vision-and-language approaches based on their designing principles,

（1）multimodal fusion and representation learning

（2）language-conditioned visual rea- soning

two-stage RIS：

优：explicit object instances and their relation-ships to conduct visual reasoning

缺：slow inference speed 、has poor generalization、the relational and spatial priors in images are lost when conducting reasoning over feature vectors of those object instances.

one-stage RIS：

优：fast inference speed、contextual representations

缺：no ex-plicit object-level information、inferior in handling complex visual scenes and expressions because they lack sufficient visual reasoning capability

Method：

图像encoder：DeepLab ResNet101

language encoder: GloVe word embedding wt of each word l_t + position encoding

为了进一步增加词间相互关系的表达，引入了自注意力机制

Bottom-Up Shift:

（1）Analysis of Reasoning Steps

利用图表达，将复杂的推理抽象成简单的节点和边

使用language graph（directed acyclic graph）：A node and a directed edge of the graph respectively correspond to a noun phrase and the linguistic relationship

（2）Stepwise Inference逐步推理：

the reasoning from bottom to up

首先节点和图融合得到X

接下来，通过对节点之间的关系（即边）按照遍历的顺序进行逐步推理，将节点在图像中的初始空间位置转移到正确的位置。

同样，我们假设上的节点在当前步骤中作为节点处理。首先通过PRS对图的每条边单独执行关系推理，然后通过平均池操作集成所有连接边中节点o_n结果。

edges的集成，对于具有初始特征映射Xn和连接边En的节点，其更新的特征映射Xn′计算如下：

PRS（3）表示迭代三次

（3）Pairwise Relational Shift

Bidirectional Attentive Refinement:

将上个模块输出的x4、x5，与前文encoder的浅层特征v2、v3、v4通过自上而下的策略合并

因为浅层包含全图的详细信息，可能引起不相关的噪声，因此使用自注意力机制

最后上采样相加

_击空明兮溯流光_

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
论文笔记：CVPR2021 Bottom-Up Shift and Reasoning for Referring Image Segmentation

任务名字：Referring Image Segmentation (RIS)keywords：one-stage RIS、relation reasoning背景：方法比较two-stage RIS：优：explicit object instances and their relation-ships to conduct visual reasoning缺：slow inference speed 、has poor generalization、the relational an
复制链接

扫一扫

专栏目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。