深度学习之视频语音+视频摘要+视频显示检测+视频理解--附带源码和作者主页

WJ_MeiMei

于 2018-11-22 18:56:52 发布

阅读量7.2k

点赞数 4

分类专栏：深度学习论文源码

深度学习论文同时被 2 个专栏收录

8 篇文章

订阅专栏

7 篇文章

订阅专栏

视频语音

Vid2speech: Speech Reconstruction from Silent Video

intro: ICASSP 2017
project page: http://www.vision.huji.ac.il/vid2speech/
arxiv: https://arxiv.org/abs/1701.00495
github(official): https://github.com/arielephrat/vid2speech

视频摘要

Video summarization produces a short summary of a full-length video and ideally encapsulates its most informative parts, alleviates the problem of video browsing, editing and indexing.

Video Summarization with Long Short-term Memory

arxiv: http://arxiv.org/abs/1605.08110

DeepVideo: Video Summarization using Temporal Sequence Modelling

intro: CS231n student project report
paper: http://cs231n.stanford.edu/reports2016/216_Report.pdf

Semantic Video Trailers

arxiv: http://arxiv.org/abs/1609.01819

Video Summarization using Deep Semantic Features

inro: ACCV 2016
arxiv: http://arxiv.org/abs/1609.08758

CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization

intro: International Conference on new Trends in Computer Sciences (ICTCS), Amman-Jordan, 2017
arxiv: https://arxiv.org/abs/1708.07023

Video Summarization with Attention-Based Encoder-Decoder Networks

https://arxiv.org/abs/1708.09545

Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward

intro: AAAI 2018. Chinese Academy of Sciences & Queen Mary University of London
project page: https://kaiyangzhou.github.io/project_vsumm_reinforce/index.html
arxiv: https://arxiv.org/abs/1801.00054
github: https://github.com//KaiyangZhou/vsumm-reinforce

Viewpoint-aware Video Summarization

intro: CVPR 2018
arxiv: https://arxiv.org/abs/1804.02843

DTR-GAN: Dilated Temporal Relational Adversarial Network for Video Summarization

https://arxiv.org/abs/1804.11228

Learning Video Summarization Using Unpaired Data

https://arxiv.org/abs/1805.12174

Video Summarization Using Fully Convolutional Sequence Networks

https://arxiv.org/abs/1805.10538

Video Summarisation by Classification with Deep Reinforcement Learning

intro: BMVC 2018
arxiv: https://arxiv.org/abs/1807.03089

Query-Conditioned Three-Player Adversarial Network for Video Summarization

intro: BMVC 2018
arxiv: https://arxiv.org/abs/1807.06677

视频突出显示检测

Unsupervised Extraction of Video Highlights Via Robust Recurrent Auto-encoders

intro: ICCV 2015
intro: rely on an assumption that highlights of an event category are more frequently captured in short videos than non-highlights
arxiv: http://arxiv.org/abs/1510.01442

Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization

keywords: wearable device
paper: http://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Yao_Highlight_Detection_With_CVPR_2016_paper.pdf
paper: http://research.microsoft.com/apps/pubs/default.aspx?id=264919

Using Deep Learning to Find Basketball Highlights

blog: http://public.hudl.com/bits/archives/2015/06/05/highlights/?utm_source=tuicool&utm_medium=referral

Real-Time Video Highlights for Yahoo Esports

arxiv: https://arxiv.org/abs/1611.08780

A Deep Ranking Model for Spatio-Temporal Highlight Detection from a 360 Video

intro: AAAI 2018
arxiv: https://arxiv.org/abs/1801.10312

PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation

intro: Nanyang Technological University & Google Research, Zurich
keywords: personalized highlight detection (PHD)
arxiv: https://arxiv.org/abs/1804.06604

视频理解

Scale Up Video Understandingwith Deep Learning

intro: 2016, Tsinghua University
slides: iiis.tsinghua.edu.cn/~jianli/courses/ATCS2016spring/talk_chuang.pptx

Slicing Convolutional Neural Network for Crowd Video Understanding

intro: CVPR 2016
intro: It aims at learning generic spatio-temporal features from crowd videos, especially for long-term temporal learning
project page: http://www.ee.cuhk.edu.hk/~jshao/SCNN.html
paper: http://www.ee.cuhk.edu.hk/~jshao/papers_jshao/jshao_cvpr16_scnn.pdf
github: https://github.com/amandajshao/Slicing-CNN

Rethinking Spatiotemporal Feature Learning For Video Understanding

https://arxiv.org/abs/1712.04851

Hierarchical Video Understanding

https://arxiv.org/abs/1809.03316

博客等级

码龄9年

46
原创

241
点赞

1057
收藏

141
粉丝

关注

私信

热门文章

分类专栏

生活 1篇
bug 21篇
环境配置 13篇
测试 7篇
深度学习论文 8篇
源码 7篇
快捷键 2篇
常识 16篇
经验 12篇
work 2篇

最新评论

pytorch查看通道数维数尺寸大小
剧中人: 怎么查看训练集标签的名称呢？
pytorch和torch框架对比（区别联系）
平平淡淡普普通通的一人: 想问博主为啥换赛道了呀，换成什么方向了？
pytorch和torch框架对比（区别联系）
WJ_MeiMei: 理论是可行的，解决办法不记得了，换赛道了，几年没搞深度学习了
pytorch和torch框架对比（区别联系）
JIENANYA: 博主，想请问一下在pycharm中使用torch框架是否可行？因为我在安装pytorch框架时跟着一些博主分享的步骤、命令走之后只成功得到了torch框架（而不是我想要的pytorch）。我也不太懂是哪个环节出了问题当我在Pycharm中使用了torch方法时： [code=python] import torch print(torch.__version__) print(torch.version.cuda) print(torch.backends.cudnn.version()) print(torch.cuda.is_available()) # cuda是否可用； print(torch.cuda.device_count()) # 返回gpu数量； print(torch.cuda.get_device_name(0)) # 返回gpu名字，设备索引默认从0开始； print(torch.cuda.current_device()) # 返回当前设备索引 [/code] 这串代码，都能运行成功。但是看到您这篇文章写的说是pytorch、torch两者编程语言、依赖库、模型和中间变量关系不同。所以我想问在pycharm中调用torch库会不会对普通（普遍的）项目的运行上有很多限制。（比如：联邦学习）

大家在看

最新文章

目录

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。