ECCV2018比较有意思的paper

Double JPEG Detection in Mixed JPEG Quality Factors using Deep Convolutional Neural Network
Fighting Fake News: Image Splice Detection via Learned Self-Consistency
Face De-Spoofing: Anti-Spoofing via Noise Modeling
Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Visual Text Correction
Cross-Modal Hamming Hashing
Visual Question Answering as a Meta Learning Task
Unsupervised Hard Example Mining from Videos for Improved Object Detection
Less is More: Picking Informative Frames for Video Captioning
Cross-Modal and Hierarchical Modeling of Video and Text
Show, Tell and Discriminate: Image Captioning by Self-retrieval with Partially Labeled Data
Triplet Loss in Siamese Network for Object Tracking
Objects that Sound
Question-Guided Hybrid Convolution for Visual Question Answering
Unpaired Image Captioning by Language Pivoting
Goal-Oriented Visual Question Generation via Intermediate Rewards
An Adversarial Approach to Hard Triplet Generation
The Sound of Pixels
Rethinking the Form of Latent States in Image Captioning
Move Forward and Tell: A Progressive Generator of Video Descriptions
Attention-aware Deep Adversarial Hashing for Cross-Modal Retrieval
Deep Cross-Modal Projection Learning for Image-Text Matching
Multimodal Dual Attention Memory for Video Story Question Answering
VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions
Broadcasting Convolutional Network for Visual Relational Reasoning
Deep Attention Neural Tensor Network for Visual Question Answering
Women also Snowboard: Overcoming Bias in Captioning Models
Audio-Visual Event Localization in Unconstrained Videos
Grounding Visual Explanations
Conditional Image-Text Embedding Networks
Stacked Cross Attention for Image-Text Matching
Learning Visual Question Answering by Bootstrapping Hard Attention
Multi-modal Cycle-consistent Generalized Zero-Shot Learning
ForestHash: Semantic Hashing With Shallow Random Forests and Tiny Convolutional Networks
Constraint-Aware Deep Neural Network Compression
Recurrent Fusion Network for Image captioning
Correcting the Triplet Selection Bias for Triplet Loss
Textual Explanations for Self-Driving Vehicles
Exploring Visual Relationship for Image Captioning
Single Shot Scene Text Retrieval

  • 1
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值