Localizing moments in video with natural language

一. 基本信息

标题:Localizing moments in video with natural language

时间:2017

出版源:ICCV

领域分类:video retrieval

二. 研究背景

问题定义:effectively localizing natural language queries in videos,given a video and text description, we identify start and end points in the video which correspond to the given text description.
 
难点:
1. current video datasets do not include pairs of localized video segments and referring expressions.
2. require both language and video understanding 

相关工作:

三. 创新方法

1. propose the Moment Context Network (MCN) which relies on local and global video features.
2. collect the DistinctDescribable Moments (DiDeMo) dataset which consists of over 40,000 pairs of referring descriptions and localized moments in unedited videos.

在这里插入图片描述
四. 实验

    dataset:Distinct Describable Moments (DiDeMo) dataset(新提出)

    evaluation index :.Rank@1,Rank@5,mIoU

    baseline comparsion:

在这里插入图片描述

五. 结论

作者的总结:introduce the task of localizing moments in video with natural language

自己的评价:modeling complex (temporal) sentence structure and add some complex language model to improve the accuracy.
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值