A CHALLENGE MULTIMODAL DATASET FOR NEURAL RELATIONEXTRACTION WITH VISUAL EVIDENCE IN SOCIAL MEDIA

最新推荐文章于 2024-09-29 08:32:14 发布

辉辉小学生

最新推荐文章于 2024-09-29 08:32:14 发布

阅读量336

点赞数

分类专栏：多模态paper 文章标签：大数据人工智能多模态

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/huihuixiaoxue/article/details/125844477

版权

多模态paper 专栏收录该内容

10 篇文章 3 订阅

订阅专栏

ABSTRACT

problem: Extracting relations in social media posts is challenging when sentences lack of contexts.

solution: images related to these sentences can supplement such missing contexts and help to

identify relations precisely.

1. INTRODUCTION

introduce a new task called multimodal relation extraction and a human-annotated multimodal dataset

propose several multimodal baselines

provide an in-depth and thorough analysis for different cases

2. RELATED WORK

2.1. Relation Extraction in Social Media

2.2. Multimodal Dataset

there are fewer datasets focusing on text-intensive tasks

3. MNRE DATASET

3.1. Dataset Collection

sources: two available multimodal named entity recognition datasets - Twitter15 and Twitter17 and crawling data from Twitter.

3.2. Twitter Name Tagging

3.3. Human Annotation

3.4. Dataset Statistics

3.5. Case Analysis

4. EXPERIMENTAL RESULT AND ANALYSIS

4.1. Problem Defifinition

function to predict relations: F : ( e 1 , e 2 , S , V ) → Y

e1,e2: pre-extracted named entities

S = ( w 1 , w 2 , ..., wn): a given sentence(marked entities e 1, e 2)

V = ( v 1 , v 2 , ..., v n): visual contents

Y: the corresponding rela tion tag

4.2. Baselines of Relation Extraction

choose models from three aspects: CNN-based method;pre -trained language model based method;distantly super vised method.

Glove+CNN：classic CNN-based model for re

BertNRE: pre-trained language model

Bert+CNN: ablation model( to demon strate that the image features are more adaptive to CNN-based methods )

PCNN: distantly supervised re model(also CNN-based)

4.3. Visual Feature Extraction

three methods to incorporate visual information:

Image Labels Visual Objects Visual Attention

4.4. General Results

4.5. Error Analysis

5. CONCLUSION

辉辉小学生

关注

0
点赞
踩
1

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫

专栏目录

辉辉小学生 CSDN认证博客专家 CSDN认证企业博客

码龄4年

129: 原创

34万+: 周排名

202万+: 总排名

6万+: 访问

: 等级

1306: 积分

5: 粉丝

12: 获赞

9: 评论

34: 收藏

私信

关注

热门文章

分类专栏

最新评论

Multimodal Relation Extraction with Efficient Graph Alignment
liutian1111: 大佬你好，请问源码调通了没求分享
Multimodal Relation Extraction with Efficient Graph Alignment
辉辉小学生: 可以，加微15927433611
Multimodal Relation Extraction with Efficient Graph Alignment
发霉的馍馍: 大佬，我找不到这个论文的pdf版本，能否提供一下啊，万分感谢
每日论文阅读 2022-11-11
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34 给我们提。
今日论文阅读2022-11-10
CSDN-Ada助手: 你好，CSDN 开始提供 #论文阅读# 的列表服务了。请看：https://blog.csdn.net/nav/advanced-technology/paper-reading 。如果你有更多需求，请来这里 https://gitcode.net/csdn/csdn-tags/-/issues/34 给我们提。

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。