【CVPR2020】计算机视觉与模式识别会议论文完全清单_Part2

378 篇文章 74 订阅
285 篇文章 55 订阅

CVPR 2020 的论文已经于6月10号放出
由于篇幅限制分两部分发出—>part1

Todo:word-cloud


Relative Interior Rule in Block-Coordinate Descent
Author: Tomas Werner,Daniel Prusa,Tomas Dlask


Learning Combinatorial Solver for Graph Matching
Author: Tao Wang,He Liu,Yidong Li,Yi Jin,Xiaohui Hou,Haibin Ling


SampleNet: Differentiable Point Cloud Sampling
Author: Itai Lang,Asaf Manor,Shai Avidan


Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
Author: Safa Messaoud,Maghav Kumar,Alexander G. Schwing


Quasi-Newton Solver for Robust Non-Rigid Registration
Author: Yuxin Yao,Bailin Deng,Weiwei Xu,Juyong Zhang


Rethinking Class-Balanced Methods for Long-Tailed Visual Recognition From a Domain Adaptation Perspective
Author: Muhammad Abdullah Jamal,Matthew Brown,Ming-Hsuan Yang,Liqiang Wang,Boqing Gong


Optimizing Rank-Based Metrics With Blackbox Differentiation
Author: Michal Rolinek,Vit Musil,Anselm Paulus,Marin Vlastelica,Claudio Michaelis,Georg Martius


DualSDF: Semantic Shape Manipulation Using a Two-Level Representation
Author: Zekun Hao,Hadar Averbuch-Elor,Noah Snavely,Serge Belongie


Dynamic Hierarchical Mimicking Towards Consistent Optimization Objectives
Author: Duo Li,Qifeng Chen


Deep Homography Estimation for Dynamic Scenes
Author: Hoang Le,Feng Liu,Shu Zhang,Aseem Agarwala


PF-Net: Point Fractal Network for 3D Point Cloud Completion
Author: Zitian Huang,Yikuan Yu,Jiawen Xu,Feng Ni,Xinyi Le


On the Regularization Properties of Structured Dropout
Author: Ambar Pal,Connor Lane,Rene Vidal,Benjamin D. Haeffele


Learning Oracle Attention for High-Fidelity Face Completion
Author: Tong Zhou,Changxing Ding,Shaowen Lin,Xinchao Wang,Dacheng Tao


Deep Image Spatial Transformation for Person Image Generation
Author: Yurui Ren,Xiaoming Yu,Junming Chen,Thomas H. Li,Ge Li


Learning to Optimize on SPD Manifolds
Author: Zhi Gao,Yuwei Wu,Yunde Jia,Mehrtash Harandi


Deep 3D Portrait From a Single Image
Author: Sicheng Xu,Jiaolong Yang,Dong Chen,Fang Wen,Yu Deng,Yunde Jia,Xin Tong


RDCFace: Radial Distortion Correction for Face Recognition
Author: He Zhao,Xianghua Ying,Yongjie Shi,Xin Tong,Jingsi Wen,Hongbin Zha


Global-Local GCN: Large-Scale Label Noise Cleansing for Face Recognition
Author: Yaobin Zhang,Weihong Deng,Mei Wang,Jiani Hu,Xian Li,Dongyue Zhao,Dongchao Wen


MISC: Multi-Condition Injection and Spatially-Adaptive Compositing for Conditional Person Image Synthesis
Author: Shuchen Weng,Wenbo Li,Dawei Li,Hongxia Jin,Boxin Shi


SAINT: Spatially Aware Interpolation NeTwork for Medical Slice Synthesis
Author: Cheng Peng,Wei-An Lin,Haofu Liao,Rama Chellappa,S. Kevin Zhou


Recurrent Feature Reasoning for Image Inpainting
Author: Jingyuan Li,Ning Wang,Lefei Zhang,Bo Du,Dacheng Tao


Structure-Preserving Super Resolution With Gradient Guidance
Author: Cheng Ma,Yongming Rao,Yean Cheng,Ce Chen,Jiwen Lu,Jie Zhou


Epipolar Transformers
Author: Yihui He,Rui Yan,Katerina Fragkiadaki,Shoou-I Yu


Diversified Arbitrary Style Transfer via Deep Feature Perturbation
Author: Zhizhong Wang,Lei Zhao,Haibo Chen,Lihong Qiu,Qihang Mo,Sihuan Lin,Wei Xing,Dongming Lu


MSG-GAN: Multi-Scale Gradients for Generative Adversarial Networks
Author: Animesh Karnewar,Oliver Wang


Overcoming Multi-Model Forgetting in One-Shot NAS With Diversity Maximization
Author: Miao Zhang,Huiqi Li,Shirui Pan,Xiaojun Chang,Steven Su


Select to Better Learn: Fast and Accurate Deep Learning Using Data Selection From Nonlinear Manifolds
Author: Mohsen Joneidi,Saeed Vahidian,Ashkan Esmaeili,Weijia Wang,Nazanin Rahnavard,Bill Lin,Mubarak Shah


Neural Point Cloud Rendering via Multi-Plane Projection
Author: Peng Dai,Yinda Zhang,Zhuwen Li,Shuaicheng Liu,Bing Zeng


Wish You Were Here: Context-Aware Human Generation
Author: Oran Gafni,Lior Wolf


Towards Photo-Realistic Virtual Try-On by Adaptively Generating-Preserving Image Content
Author: Han Yang,Ruimao Zhang,Xiaobao Guo,Wei Liu,Wangmeng Zuo,Ping Luo


Breaking the Cycle - Colleagues Are All You Need
Author: Ori Nizan,Ayellet Tal


Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation
Author: Hao Tang,Dan Xu,Yan Yan,Philip H.S. Torr,Nicu Sebe


ManiGAN: Text-Guided Image Manipulation
Author: Bowen Li,Xiaojuan Qi,Thomas Lukasiewicz,Philip H.S. Torr


Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions
Author: Ricard Durall,Margret Keuper,Janis Keuper


Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems
Author: Patrick Knobelreiter,Christian Sormann,Alexander Shekhovtsov,Friedrich Fraundorfer,Thomas Pock


Barycenters of Natural Images Constrained Wasserstein Barycenters for Image Morphing
Author: Dror Simon,Aviad Aberdam


Guided Variational Autoencoder for Disentanglement Learning
Author: Zheng Ding,Yifan Xu,Weijian Xu,Gaurav Parmar,Yang Yang,Max Welling,Zhuowen Tu


Cross-Spectral Face Hallucination via Disentangling Independent Factors
Author: Boyan Duan,Chaoyou Fu,Yi Li,Xingguang Song,Ran He


Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules
Author: Zhengxue Cheng,Heming Sun,Masaru Takeuchi,Jiro Katto


C-Flow: Conditional Generative Flow Models for Images and 3D Point Clouds
Author: Albert Pumarola,Stefan Popov,Francesc Moreno-Noguer,Vittorio Ferrari


Cogradient Descent for Bilinear Optimization
Author: Li’an Zhuo,Baochang Zhang,Linlin Yang,Hanlin Chen,Qixiang Ye,David Doermann,Rongrong Ji,Guodong Guo


Instance-Aware Image Colorization
Author: Jheng-Wei Su,Hung-Kuo Chu,Jia-Bin Huang


Joint Training of Variational Auto-Encoder and Latent Energy-Based Model
Author: Tian Han,Erik Nijkamp,Linqi Zhou,Bo Pang,Song-Chun Zhu,Ying Nian Wu


Adaptive Loss-Aware Quantization for Multi-Bit Networks
Author: Zhongnan Qu,Zimu Zhou,Yun Cheng,Lothar Thiele


ScopeFlow: Dynamic Scene Scoping for Optical Flow
Author: Aviram Bar-Haim,Lior Wolf


Video Super-Resolution With Temporal Group Attention
Author: Takashi Isobe,Songjiang Li,Xu Jia,Shanxin Yuan,Gregory Slabaugh,Chunjing Xu,Ya-Li Li,Shengjin Wang,Qi Tian


Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression
Author: Yawei Li,Shuhang Gu,Christoph Mayer,Luc Van Gool,Radu Timofte


3D Photography Using Context-Aware Layered Depth Inpainting
Author: Meng-Li Shih,Shih-Yang Su,Johannes Kopf,Jia-Bin Huang


MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
Author: Yuheng Li,Krishna Kumar Singh,Utkarsh Ojha,Yong Jae Lee


Low-Rank Compression of Neural Nets: Learning the Rank of Each Layer
Author: Yerlan Idelbayev,Miguel A. Carreira-Perpinan


Global Texture Enhancement for Fake Face Detection in the Wild
Author: Zhengzhe Liu,Xiaojuan Qi,Philip H.S. Torr


Panoptic-Based Image Synthesis
Author: Aysegul Dundar,Karan Sapra,Guilin Liu,Andrew Tao,Bryan Catanzaro


Lighthouse: Predicting Lighting Volumes for Spatially-Coherent Illumination
Author: Pratul P. Srinivasan,Ben Mildenhall,Matthew Tancik,Jonathan T. Barron,Richard Tucker,Noah Snavely


Learning to Cartoonize Using White-Box Cartoon Representations
Author: Xinrui Wang,Jinze Yu


End-to-End Learnable Geometric Vision by Backpropagating PnP Optimization
Author: Bo Chen,Alvaro Parra,Jiewei Cao,Nan Li,Tat-Jun Chin


Analyzing and Improving the Image Quality of StyleGAN
Author: Tero Karras,Samuli Laine,Miika Aittala,Janne Hellsten,Jaakko Lehtinen,Timo Aila


Fashion Editing With Adversarial Parsing Learning
Author: Haoye Dong,Xiaodan Liang,Yixuan Zhang,Xujie Zhang,Xiaohui Shen,Zhenyu Xie,Bowen Wu,Jian Yin


Augment Your Batch: Improving Generalization Through Instance Repetition
Author: Elad Hoffer,Tal Ben-Nun,Itay Hubara,Niv Giladi,Torsten Hoefler,Daniel Soudry


ARShadowGAN: Shadow Generative Adversarial Network for Augmented Reality in Single Light Scenes
Author: Daquan Liu,Chengjiang Long,Hongpan Zhang,Hanning Yu,Xinzhi Dong,Chunxia Xiao


An End-to-End Edge Aggregation Network for Moving Object Segmentation
Author: Prashant W. Patil,Kuldeep M. Biradar,Akshay Dudhane,Subrahmanyam Murala


Learning Video Stabilization Using Optical Flow
Author: Jiyang Yu,Ravi Ramamoorthi


Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation
Author: Runfa Chen,Wenbing Huang,Binghui Huang,Fuchun Sun,Bin Fang


Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory
Author: Arash Rahnama,Andre T. Nguyen,Edward Raff


StarGAN v2: Diverse Image Synthesis for Multiple Domains
Author: Yunjey Choi,Youngjung Uh,Jaejun Yoo,Jung-Woo Ha


Warping Residual Based Image Stitching for Large Parallax
Author: Kyu-Yul Lee,Jae-Young Sim


A U-Net Based Discriminator for Generative Adversarial Networks
Author: Edgar Schonfeld,Bernt Schiele,Anna Khoreva


Unpaired Portrait Drawing Generation via Asymmetric Cycle Mapping
Author: Ran Yi,Yong-Jin Liu,Yu-Kun Lai,Paul L. Rosin


When to Use Convolutional Neural Networks for Inverse Problems
Author: Nathaniel Chodosh,Simon Lucey


LUVLi Face Alignment: Estimating Landmarks’ Location, Uncertainty, and Visibility Likelihood
Author: Abhinav Kumar,Tim K. Marks,Wenxuan Mou,Ye Wang,Michael Jones,Anoop Cherian,Toshiaki Koike-Akino,Xiaoming Liu,Chen Feng


Affinity Graph Supervision for Visual Recognition
Author: Chu Wang,Babak Samari,Vladimir G. Kim,Siddhartha Chaudhuri,Kaleem Siddiqi


Unsupervised Magnification of Posture Deviations Across Subjects
Author: Michael Dorkenwald,Uta Buchler,Bjorn Ommer


Accurate Estimation of Body Height From a Single Depth Image via a Four-Stage Developing Network
Author: Fukun Yin,Shizhe Zhou


Fast Soft Color Segmentation
Author: Naofumi Akimoto,Huachun Zhu,Yanghua Jin,Yoshimitsu Aoki


Global Optimality for Point Set Registration Using Semidefinite Programming
Author: Jose Pedro Iglesias,Carl Olsson,Fredrik Kahl


Image2StyleGAN++: How to Edit the Embedded Images?
Author: Rameen Abdal,Yipeng Qin,Peter Wonka


SQE: a Self Quality Evaluation Metric for Parameters Optimization in Multi-Object Tracking
Author: Yanru Huang,Feiyu Zhu,Zheni Zeng,Xi Qiu,Yuan Shen,Jianan Wu


EventSR: From Asynchronous Events to Image Reconstruction, Restoration, and Super-Resolution via End-to-End Adversarial Learning
Author: Lin Wang,Tae-Kyun Kim,Kuk-Jin Yoon


Hierarchical Pyramid Diverse Attention Networks for Face Recognition
Author: Qiangchang Wang,Tianyi Wu,He Zheng,Guodong Guo


RGBD-Dog: Predicting Canine Pose from RGBD Sensors
Author: Sinead Kearney,Wenbin Li,Martin Parsons,Kwang In Kim,Darren Cosker


Multi-Scale Progressive Fusion Network for Single Image Deraining
Author: Kui Jiang,Zhongyuan Wang,Peng Yi,Chen Chen,Baojin Huang,Yimin Luo,Jiayi Ma,Junjun Jiang


Learning a Neural 3D Texture Space From 2D Exemplars
Author: Philipp Henzler,Niloy J. Mitra,Tobias Ritschel


BachGAN: High-Resolution Image Synthesis From Salient Object Layout
Author: Yandong Li,Yu Cheng,Zhe Gan,Licheng Yu,Liqiang Wang,Jingjing Liu


Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy
Author: Jaejun Yoo,Namhyuk Ahn,Kyung-Ah Sohn


On Positive-Unlabeled Classification in GAN
Author: Tianyu Guo,Chang Xu,Jiajun Huang,Yunhe Wang,Boxin Shi,Chao Xu,Dacheng Tao


DoveNet: Deep Image Harmonization via Domain Verification
Author: Wenyan Cong,Jianfu Zhang,Li Niu,Liu Liu,Zhixin Ling,Weiyuan Li,Liqing Zhang


Noise Robust Generative Adversarial Networks
Author: Takuhiro Kaneko,Tatsuya Harada


Normalizing Flows With Multi-Scale Autoregressive Priors
Author: Apratim Bhattacharyya,Shweta Mahajan,Mario Fritz,Bernt Schiele,Stefan Roth


Robust Reference-Based Super-Resolution With Similarity-Aware Deformable Convolution
Author: Gyumin Shim,Jinsun Park,In So Kweon


Painting Many Pasts: Synthesizing Time Lapse Videos of Paintings
Author: Amy Zhao,Guha Balakrishnan,Kathleen M. Lewis,Fredo Durand,John V. Guttag,Adrian V. Dalca


GeoDA: A Geometric Framework for Black-Box Adversarial Attacks
Author: Ali Rahmati,Seyed-Mohsen Moosavi-Dezfooli,Pascal Frossard,Huaiyu Dai


GAMIN: Generative Adversarial Multiple Imputation Network for Highly Missing Data
Author: Seongwook Yoon,Sanghoon Sull


An Internal Covariate Shift Bounding Algorithm for Deep Neural Networks by Unitizing Layers’ Outputs
Author: You Huang,Yuanlong Yu


A Unified Optimization Framework for Low-Rank Inducing Penalties
Author: Marcus Valtonen Ornhag,Carl Olsson


Single-Side Domain Generalization for Face Anti-Spoofing
Author: Yunpei Jia,Jie Zhang,Shiguang Shan,Xilin Chen


The Knowledge Within: Methods for Data-Free Model Compression
Author: Matan Haroush,Itay Hubara,Elad Hoffer,Daniel Soudry


Scale-Space Flow for End-to-End Optimized Video Compression
Author: Eirikur Agustsson,David Minnen,Nick Johnston,Johannes Balle,Sung Jin Hwang,George Toderici


Dynamic Neural Relational Inference
Author: Colin Graber,Alexander G. Schwing


Real-Time Panoptic Segmentation From Dense Detections
Author: Rui Hou,Jie Li,Arjun Bhargava,Allan Raventos,Vitor Guizilini,Chao Fang,Jerome Lynch,Adrien Gaidon


Deep Snake for Real-Time Instance Segmentation
Author: Sida Peng,Wen Jiang,Huaijin Pi,Xiuli Li,Hujun Bao,Xiaowei Zhou


AdaCoSeg: Adaptive Shape Co-Segmentation With Group Consistency Loss
Author: Chenyang Zhu,Kai Xu,Siddhartha Chaudhuri,Li Yi,Leonidas J. Guibas,Hao Zhang


Learning Dynamic Routing for Semantic Segmentation
Author: Yanwei Li,Lin Song,Yukang Chen,Zeming Li,Xiangyu Zhang,Xingang Wang,Jian Sun


Boosting Semantic Human Matting With Coarse Annotations
Author: Jinlin Liu,Yuan Yao,Wendi Hou,Miaomiao Cui,Xuansong Xie,Changshui Zhang,Xian-Sheng Hua


BlendMask: Top-Down Meets Bottom-Up for Instance Segmentation
Author: Hao Chen,Kunyang Sun,Zhi Tian,Chunhua Shen,Yongming Huang,Youliang Yan


UC-Net: Uncertainty Inspired RGB-D Saliency Detection via Conditional Variational Autoencoders
Author: Jing Zhang,Deng-Ping Fan,Yuchao Dai,Saeed Anwar,Fatemeh Sadat Saleh,Tong Zhang,Nick Barnes


Deep Geometric Functional Maps: Robust Feature Learning for Shape Correspondence
Author: Nicolas Donati,Abhishek Sharma,Maks Ovsjanikov


Deep Polarization Cues for Transparent Object Segmentation
Author: Agastya Kalra,Vage Taamazyan,Supreeth Krishna Rao,Kartik Venkataraman,Ramesh Raskar,Achuta Kadambi


DualConvMesh-Net: Joint Geodesic and Euclidean Convolutions on 3D Meshes
Author: Jonas Schult,Francis Engelmann,Theodora Kontogianni,Bastian Leibe


F-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation
Author: Konstantin Sofiiuk,Ilia Petrov,Olga Barinova,Anton Konushin


Approximating shapes in images with low-complexity polygons
Author: Muxingzi Li,Florent Lafarge,Renaud Marlet


Towards Visually Explaining Variational Autoencoders
Author: Wenqian Liu,Runze Li,Meng Zheng,Srikrishna Karanam,Ziyan Wu,Bir Bhanu,Richard J. Radke,Octavia Camps


Towards Global Explanations of Convolutional Neural Networks With Concept Attribution
Author: Weibin Wu,Yuxin Su,Xixian Chen,Shenglin Zhao,Irwin King,Michael R. Lyu,Yu-Wing Tai


Interpretable and Accurate Fine-grained Recognition via Region Grouping
Author: Zixuan Huang,Yin Li


SAM: The Sensitivity of Attribution Methods to Hyperparameters
Author: Naman Bansal,Chirag Agarwal,Anh Nguyen


High-Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Author: Haohan Wang,Xindi Wu,Zeyi Huang,Eric P. Xing


CNN-Generated Images Are Surprisingly Easy to Spot… for Now
Author: Sheng-Yu Wang,Oliver Wang,Richard Zhang,Andrew Owens,Alexei A. Efros


FALCON: A Fourier Transform Based Approach for Fast and Secure Convolutional Neural Network Predictions
Author: Shaohua Li,Kaiping Xue,Bin Zhu,Chenkai Ding,Xindi Gao,David Wei,Tao Wan


Dreaming to Distill: Data-Free Knowledge Transfer via DeepInversion
Author: Hongxu Yin,Pavlo Molchanov,Jose M. Alvarez,Zhizhong Li,Arun Mallya,Derek Hoiem,Niraj K. Jha,Jan Kautz


Unsupervised Domain Adaptation via Structurally Regularized Deep Clustering
Author: Hui Tang,Ke Chen,Kui Jia


HyperSTAR: Task-Aware Hyperparameters for Deep Networks
Author: Gaurav Mittal,Chang Liu,Nikolaos Karianakis,Victor Fragoso,Mei Chen,Yun Fu


ActBERT: Learning Global-Local Video-Text Representations
Author: Linchao Zhu,Yi Yang


State-Relabeling Adversarial Active Learning
Author: Beichen Zhang,Liang Li,Shijie Yang,Shuhui Wang,Zheng-Jun Zha,Qingming Huang


Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization
Author: Jinjie Mai,Meng Yang,Wenfeng Luo


A Shared Multi-Attention Framework for Multi-Label Zero-Shot Learning
Author: Dat Huynh,Ehsan Elhamifar


Self-Supervised Learning of Interpretable Keypoints From Unlabelled Videos
Author: Tomas Jakab,Ankush Gupta,Hakan Bilen,Andrea Vedaldi


Few-Shot Open-Set Recognition Using Meta-Learning
Author: Bo Liu,Hao Kang,Haoxiang Li,Gang Hua,Nuno Vasconcelos


Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions
Author: Han-Jia Ye,Hexiang Hu,De-Chuan Zhan,Fei Sha


Temporally Distributed Networks for Fast Video Semantic Segmentation
Author: Ping Hu,Fabian Caba,Oliver Wang,Zhe Lin,Stan Sclaroff,Federico Perazzi


Benchmarking the Robustness of Semantic Segmentation Models
Author: Christoph Kamann,Carsten Rother


There and Back Again: Revisiting Backpropagation Saliency Methods
Author: Sylvestre-Alvise Rebuffi,Ruth Fong,Xu Ji,Andrea Vedaldi


Deep Semantic Clustering by Partition Confidence Maximisation
Author: Jiabo Huang,Shaogang Gong,Xiatian Zhu


StructEdit: Learning Structural Shape Variations
Author: Kaichun Mo,Paul Guerrero,Li Yi,Hao Su,Peter Wonka,Niloy J. Mitra,Leonidas J. Guibas


Harmonizing Transferability and Discriminability for Adapting Object Detectors
Author: Chaoqi Chen,Zebiao Zheng,Xinghao Ding,Yue Huang,Qi Dou


Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching
Author: Xuhua Huang,Jiarui Xu,Yu-Wing Tai,Chi-Keung Tang


CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local Refinement
Author: Ho Kei Cheng,Jihoon Chung,Yu-Wing Tai,Chi-Keung Tang


Correlating Edge, Pose With Parsing
Author: Ziwei Zhang,Chi Su,Liang Zheng,Xiaodong Xie


VecRoad: Point-Based Iterative Graph Exploration for Road Graphs Extraction
Author: Yong-Qiang Tan,Shang-Hua Gao,Xuan-Yi Li,Ming-Ming Cheng,Bo Ren


Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation
Author: Zeyu Wang,Klint Qinami,Ioannis Christos Karakozis,Kyle Genova,Prem Nair,Kenji Hata,Olga Russakovsky


Hierarchical Human Parsing With Typed Part-Relation Reasoning
Author: Wenguan Wang,Hailong Zhu,Jifeng Dai,Yanwei Pang,Jianbing Shen,Ling Shao


Compositional Convolutional Neural Networks: A Deep Architecture With Innate Robustness to Partial Occlusion
Author: Adam Kortylewski,Ju He,Qing Liu,Alan L. Yuille


Spatial Pyramid Based Graph Reasoning for Semantic Segmentation
Author: Xia Li,Yibo Yang,Qijie Zhao,Tiancheng Shen,Zhouchen Lin,Hong Liu


Learning Video Object Segmentation From Unlabeled Videos
Author: Xiankai Lu,Wenguan Wang,Jianbing Shen,Yu-Wing Tai,David J. Crandall,Steven C. H. Hoi


Part-Aware Context Network for Human Parsing
Author: Xiaomei Zhang,Yingying Chen,Bingke Zhu,Jinqiao Wang,Ming Tang


SCOUT: Self-Aware Discriminant Counterfactual Explanations
Author: Pei Wang,Nuno Vasconcelos


Weakly-Supervised Semantic Segmentation via Sub-Category Exploration
Author: Yu-Ting Chang,Qiaosong Wang,Wei-Chih Hung,Robinson Piramuthu,Yi-Hsuan Tsai,Ming-Hsuan Yang


Continual Learning With Extended Kronecker-Factored Approximate Curvature
Author: Janghyeon Lee,Hyeong Gwon Hong,Donggyu Joo,Junmo Kim


Phase Consistent Ecological Domain Adaptation
Author: Yanchao Yang,Dong Lao,Ganesh Sundaramoorthi,Stefano Soatto


AD-Cluster: Augmented Discriminative Clustering for Domain Adaptive Person Re-Identification
Author: Yunpeng Zhai,Shijian Lu,Qixiang Ye,Xuebo Shan,Jie Chen,Rongrong Ji,Yonghong Tian


3D-MPA: Multi-Proposal Aggregation for 3D Semantic Instance Segmentation
Author: Francis Engelmann,Martin Bokeloh,Alireza Fathi,Bastian Leibe,Matthias Niessner


Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision
Author: Denis Gudovskiy,Alec Hodgkinson,Takuya Yamaguchi,Sotaro Tsukizawa


Adaptive Graph Convolutional Network With Attention Graph Clustering for Co-Saliency Detection
Author: Kaihua Zhang,Tengpeng Li,Shiwen Shen,Bo Liu,Jin Chen,Qingshan Liu


A2dele: Adaptive and Attentive Depth Distiller for Efficient RGB-D Salient Object Detection
Author: Yongri Piao,Zhengkun Rong,Miao Zhang,Weisong Ren,Huchuan Lu


Deep Fair Clustering for Visual Learning
Author: Peizhao Li,Han Zhao,Hongfu Liu


Bidirectional Graph Reasoning Network for Panoptic Segmentation
Author: Yangxin Wu,Gengwei Zhang,Yiming Gao,Xiajun Deng,Ke Gong,Xiaodan Liang,Liang Lin


Exploit Clues From Views: Self-Supervised and Regularized Learning for Multiview Object Recognition
Author: Chih-Hui Ho,Bo Liu,Tz-Ying Wu,Nuno Vasconcelos


Spherical Space Domain Adaptation With Robust Pseudo-Label Loss
Author: Xiang Gu,Jian Sun,Zongben Xu


Stochastic Classifiers for Unsupervised Domain Adaptation
Author: Zhihe Lu,Yongxin Yang,Xiatian Zhu,Cong Liu,Yi-Zhe Song,Tao Xiang


Unsupervised Learning of Intrinsic Structural Representation Points
Author: Nenglun Chen,Lingjie Liu,Zhiming Cui,Runnan Chen,Duygu Ceylan,Changhe Tu,Wenping Wang


PolyTransform: Deep Polygon Transformer for Instance Segmentation
Author: Justin Liang,Namdar Homayounfar,Wei-Chiu Ma,Yuwen Xiong,Rui Hu,Raquel Urtasun


Interactive Two-Stream Decoder for Accurate and Fast Saliency Detection
Author: Huajun Zhou,Xiaohua Xie,Jian-Huang Lai,Zixuan Chen,Lingxiao Yang


Towards Better Generalization: Joint Depth-Pose Learning Without PoseNet
Author: Wang Zhao,Shaohui Liu,Yezhi Shu,Yong-Jin Liu


LT-Net: Label Transfer by Learning Reversible Voxel-Wise Correspondence for One-Shot Medical Image Segmentation
Author: Shuxin Wang,Shilei Cao,Dong Wei,Renzhen Wang,Kai Ma,Liansheng Wang,Deyu Meng,Yefeng Zheng


FGN: Fully Guided Network for Few-Shot Instance Segmentation
Author: Zhibo Fan,Jin-Gang Yu,Zhihao Liang,Jiarong Ou,Changxin Gao,Gui-Song Xia,Yuanqing Li


A Quantum Computational Approach to Correspondence Problems on Point Sets
Author: Vladislav Golyanik,Christian Theobalt


Data-Efficient Semi-Supervised Learning by Reliable Edge Mining
Author: Peibin Chen,Tao Ma,Xu Qin,Weidi Xu,Shuchang Zhou


NestedVAE: Isolating Common Factors via Weak Supervision
Author: Matthew J. Vowels,Necati Cihan Camgoz,Richard Bowden


Progressive Adversarial Networks for Fine-Grained Domain Adaptation
Author: Sinan Wang,Xinyang Chen,Yunbo Wang,Mingsheng Long,Jianmin Wang


A Disentangling Invertible Interpretation Network for Explaining Latent Representations
Author: Patrick Esser,Robin Rombach,Bjorn Ommer


Modeling the Background for Incremental Learning in Semantic Segmentation
Author: Fabio Cermelli,Massimiliano Mancini,Samuel Rota Bulo,Elisa Ricci,Barbara Caputo


Interpreting the Latent Space of GANs for Semantic Face Editing
Author: Yujun Shen,Jinjin Gu,Xiaoou Tang,Bolei Zhou


Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation
Author: Jianqiang Wan,Yang Liu,Donglai Wei,Xiang Bai,Yongchao Xu


Self-Learning With Rectification Strategy for Human Parsing
Author: Tao Li,Zhiyuan Liang,Sanyuan Zhao,Jiahao Gong,Jianbing Shen


Hyperbolic Visual Embedding Learning for Zero-Shot Recognition
Author: Shaoteng Liu,Jingjing Chen,Liangming Pan,Chong-Wah Ngo,Tat-Seng Chua,Yu-Gang Jiang


Sequential Mastery of Multiple Visual Tasks: Networks Naturally Learn to Learn and Forget to Forget
Author: Guy Davidson,Michael C. Mozer


Distilling Effective Supervision From Severe Label Noise
Author: Zizhao Zhang,Han Zhang,Sercan O. Arik,Honglak Lee,Tomas Pfister


Eternal Sunshine of the Spotless Net: Selective Forgetting in Deep Networks
Author: Aditya Golatkar,Alessandro Achille,Stefano Soatto


CenterMask: Single Shot Instance Segmentation With Point Representation
Author: Yuqing Wang,Zhaoliang Xu,Hao Shen,Baoshan Cheng,Lirong Yang


Mitigating Bias in Face Recognition Using Skewness-Aware Reinforcement Learning
Author: Mei Wang,Weihong Deng


MineGAN: Effective Knowledge Transfer From GANs to Target Domains With Few Images
Author: Yaxing Wang,Abel Gonzalez-Garcia,David Berga,Luis Herranz,Fahad Shahbaz Khan,Joost van de Weijer


DLWL: Improving Detection for Lowshot Classes With Weakly Labelled Data
Author: Vignesh Ramanathan,Rui Wang,Dhruv Mahajan


Unsupervised Deep Shape Descriptor With Point Distribution Learning
Author: Yi Shi,Mengchen Xu,Shuaihang Yuan,Yi Fang


Stylization-Based Architecture for Fast Deep Exemplar Colorization
Author: Zhongyou Xu,Tingting Wang,Faming Fang,Yun Sheng,Guixu Zhang


Cars Can’t Fly Up in the Sky: Improving Urban-Scene Segmentation via Height-Driven Attention Networks
Author: Sungha Choi,Joanne T. Kim,Jaegul Choo


State-Aware Tracker for Real-Time Video Object Segmentation
Author: Xi Chen,Zuoxin Li,Ye Yuan,Gang Yu,Jianxin Shen,Donglian Qi


Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning
Author: Xuan Liao,Wenhao Li,Qisen Xu,Xiangfeng Wang,Bo Jin,Xiaoyun Zhang,Yanfeng Wang,Ya Zhang


ENSEI: Efficient Secure Inference via Frequency-Domain Homomorphic Convolution for Privacy-Preserving Visual Recognition
Author: Song Bian,Tianchen Wang,Masayuki Hiromoto,Yiyu Shi,Takashi Sato


Multi-Scale Interactive Network for Salient Object Detection
Author: Youwei Pang,Xiaoqi Zhao,Lihe Zhang,Huchuan Lu


Interactive Multi-Label CNN Learning With Partial Labels
Author: Dat Huynh,Ehsan Elhamifar


ViewAL: Active Learning With Viewpoint Entropy for Semantic Segmentation
Author: Yawar Siddiqui,Julien Valentin,Matthias Niessner


Scene-Adaptive Video Frame Interpolation via Meta-Learning
Author: Myungsub Choi,Janghoon Choi,Sungyong Baik,Tae Hyun Kim,Kyoung Mu Lee


Action Segmentation With Joint Self-Supervised Temporal Domain Adaptation
Author: Min-Hung Chen,Baopu Li,Yingze Bao,Ghassan AlRegib,Zsolt Kira


Pixel Consensus Voting for Panoptic Segmentation
Author: Haochen Wang,Ruotian Luo,Michael Maire,Greg Shakhnarovich


Minimizing Discrete Total Curvature for Image Processing
Author: Qiuxiang Zhong,Yutong Li,Yijie Yang,Yuping Duan


Towards Robust Image Classification Using Sequential Attention Models
Author: Daniel Zoran,Mike Chrzanowski,Po-Sen Huang,Sven Gowal,Alex Mott,Pushmeet Kohli


Discovering Synchronized Subsets of Sequences: A Large Scale Solution
Author: Evangelos Sariyanidi,Casey J. Zampella,Keith G. Bartley,John D. Herrington,Theodore D. Satterthwaite,Robert T. Schultz,Birkan Tunc


Going Deeper With Lean Point Networks
Author: Eric-Tuan Le,Iasonas Kokkinos,Niloy J. Mitra


Efficient and Robust Shape Correspondence via Sparsity-Enforced Quadratic Assignment
Author: Rui Xiang,Rongjie Lai,Hongkai Zhao


Explainable Object-Induced Action Decision for Autonomous Vehicles
Author: Yiran Xu,Xiaoyin Yang,Lihang Gong,Hsuan-Chu Lin,Tz-Ying Wu,Yunsheng Li,Nuno Vasconcelos


Spatially Attentive Output Layer for Image Classification
Author: Ildoo Kim,Woonhyuk Baek,Sungwoong Kim


Attack to Explain Deep Representation
Author: Mohammad A. A. K. Jalwana,Naveed Akhtar,Mohammed Bennamoun,Ajmal Mian


Computing Valid P-Values for Image Segmentation by Selective Inference
Author: Kosuke Tanizaki,Noriaki Hashimoto,Yu Inatsu,Hidekata Hontani,Ichiro Takeuchi


Unsupervised Learning From Video With Deep Neural Embeddings
Author: Chengxu Zhuang,Tianwei She,Alex Andonian,Max Sobol Mark,Daniel Yamins


Partial Weight Adaptation for Robust DNN Inference
Author: Xiufeng Xie,Kyu-Han Kim


Probability Weighted Compact Feature for Domain Adaptive Retrieval
Author: Fuxiang Huang,Lei Zhang,Yang Yang,Xichuan Zhou


Where Does It End? - Reasoning About Hidden Surfaces by Object Intersection Constraints
Author: Michael Strecke,Jorg Stuckler


PolarNet: An Improved Grid Representation for Online LiDAR Point Clouds Semantic Segmentation
Author: Yang Zhang,Zixiang Zhou,Philip David,Xiangyu Yue,Zerong Xi,Boqing Gong,Hassan Foroosh


Pathological Retinal Region Segmentation From OCT Images Using Geometric Relation Based Augmentation
Author: Dwarikanath Mahapatra,Behzad Bozorgtabar,Ling Shao


Transferring and Regularizing Prediction for Semantic Segmentation
Author: Yiheng Zhang,Zhaofan Qiu,Ting Yao,Chong-Wah Ngo,Dong Liu,Tao Mei


PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition
Author: Kun Su,Xiulong Liu,Eli Shlizerman


Model Adaptation: Unsupervised Domain Adaptation Without Source Data
Author: Rui Li,Qianfen Jiao,Wenming Cao,Hau-San Wong,Si Wu


Evade Deep Image Retrieval by Stashing Private Images in the Hash Space
Author: Yanru Xiao,Cong Wang,Xing Gao


Advisable Learning for Self-Driving Vehicles by Internalizing Observation-to-Action Rules
Author: Jinkyu Kim,Suhong Moon,Anna Rohrbach,Trevor Darrell,John Canny


ProAlignNet: Unsupervised Learning for Progressively Aligning Noisy Contours
Author: VSR Veeravasarapu,Abhishek Goel,Deepak Mittal,Maneesh Singh


Attribution in Scale and Space
Author: Shawn Xu,Subhashini Venugopalan,Mukund Sundararajan


Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing
Author: Vedika Agarwal,Rakshith Shetty,Mario Fritz


Deep Relational Reasoning Graph Network for Arbitrary Shape Text Detection
Author: Shi-Xue Zhang,Xiaobin Zhu,Jie-Bo Hou,Chang Liu,Chun Yang,Hongfa Wang,Xu-Cheng Yin


Large-Scale Object Detection in the Wild From Imbalanced Multi-Labels
Author: Junran Peng,Xingyuan Bu,Ming Sun,Zhaoxiang Zhang,Tieniu Tan,Junjie Yan


BBN: Bilateral-Branch Network With Cumulative Learning for Long-Tailed Visual Recognition
Author: Boyan Zhou,Quan Cui,Xiu-Shen Wei,Zhao-Min Chen


Momentum Contrast for Unsupervised Visual Representation Learning
Author: Kaiming He,Haoqi Fan,Yuxin Wu,Saining Xie,Ross Girshick


Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation
Author: Gedas Bertasius,Lorenzo Torresani


Weakly Supervised Fine-Grained Image Classification via Guassian Mixture Model Oriented Discriminative Learning
Author: Zhihui Wang,Shijie Wang,Shuhui Yang,Haojie Li,Jianjun Li,Zezhou Li


Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection
Author: Shifeng Zhang,Cheng Chi,Yongqiang Yao,Zhen Lei,Stan Z. Li


Learning User Representations for Open Vocabulary Image Hashtag Prediction
Author: Thibaut Durand


Sketch Less for More: On-the-Fly Fine-Grained Sketch-Based Image Retrieval
Author: Ayan Kumar Bhunia,Yongxin Yang,Timothy M. Hospedales,Tao Xiang,Yi-Zhe Song


Few-Shot Pill Recognition
Author: Suiyi Ling,Andreas Pastor,Jing Li,Zhaohui Che,Junle Wang,Jieun Kim,Patrick Le Callet


PointRend: Image Segmentation As Rendering
Author: Alexander Kirillov,Yuxin Wu,Kaiming He,Ross Girshick


ABCNet: Real-Time Scene Text Spotting With Adaptive Bezier-Curve Network
Author: Yuliang Liu,Hao Chen,Chunhua Shen,Tong He,Lianwen Jin,Liangwei Wang


Learning Temporal Co-Attention Models for Unsupervised Video Action Localization
Author: Guoqiang Gong,Xinghan Wang,Yadong Mu,Qi Tian


Spatiotemporal Fusion in 3D CNNs: A Probabilistic View
Author: Yizhou Zhou,Xiaoyan Sun,Chong Luo,Zheng-Jun Zha,Wenjun Zeng


Uncertainty-Aware Score Distribution Learning for Action Quality Assessment
Author: Yansong Tang,Zanlin Ni,Jiahuan Zhou,Danyang Zhang,Jiwen Lu,Ying Wu,Jie Zhou


Learning Interactions and Relationships Between Movie Characters
Author: Anna Kukleva,Makarand Tapaswi,Ivan Laptev


Video Panoptic Segmentation
Author: Dahun Kim,Sanghyun Woo,Joon-Young Lee,In So Kweon


Understanding Human Hands in Contact at Internet Scale
Author: Dandan Shan,Jiaqi Geng,Michelle Shu,David F. Fouhey


End-to-End Learning of Visual Representations From Uncurated Instructional Videos
Author: Antoine Miech,Jean-Baptiste Alayrac,Lucas Smaira,Ivan Laptev,Josef Sivic,Andrew Zisserman


You2Me: Inferring Body Pose in Egocentric Video via First and Second Person Interactions
Author: Evonne Ng,Donglai Xiang,Hanbyul Joo,Kristen Grauman


Learning a Weakly-Supervised Video Actor-Action Segmentation Model With a Wise Selection
Author: Jie Chen,Zhiheng Li,Jiebo Luo,Chenliang Xu


Learning to Measure the Static Friction Coefficient in Cloth Contact
Author: Abdullah Haroon Rasheed,Victor Romero,Florence Bertails-Descoubes,Stefanie Wuhrer,Jean-Sebastien Franco,Arnaud Lazarus


SpeedNet: Learning the Speediness in Videos
Author: Sagie Benaim,Ariel Ephrat,Oran Lang,Inbar Mosseri,William T. Freeman,Michael Rubinstein,Michal Irani,Tali Dekel


Telling Left From Right: Learning Spatial Correspondence of Sight and Sound
Author: Karren Yang,Bryan Russell,Justin Salamon


Visual-Textual Capsule Routing for Text-Based Video Segmentation
Author: Bruce McIntosh,Kevin Duarte,Yogesh S Rawat,Mubarak Shah


Graph-Structured Referring Expression Reasoning in the Wild
Author: Sibei Yang,Guanbin Li,Yizhou Yu


Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs
Author: Shizhe Chen,Qin Jin,Peng Wang,Qi Wu


Hierarchical Conditional Relation Networks for Video Question Answering
Author: Thao Minh Le,Vuong Le,Svetha Venkatesh,Truyen Tran


REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments
Author: Yuankai Qi,Qi Wu,Peter Anderson,Xin Wang,William Yang Wang,Chunhua Shen,Anton van den Hengel


Iterative Answer Prediction With Pointer-Augmented Multimodal Transformers for TextVQA
Author: Ronghang Hu,Amanpreet Singh,Trevor Darrell,Marcus Rohrbach


SQuINTing at VQA Models: Introspecting VQA Models With Sub-Questions
Author: Ramprasaath R. Selvaraju,Purva Tendulkar,Devi Parikh,Eric Horvitz,Marco Tulio Ribeiro,Besmira Nushi,Ece Kamar


Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks
Author: Fengda Zhu,Yi Zhu,Xiaojun Chang,Xiaodan Liang


Sign Language Transformers: Joint End-to-End Sign Language Recognition and Translation
Author: Necati Cihan Camgoz,Oscar Koller,Simon Hadfield,Richard Bowden


Multi-Task Collaborative Network for Joint Referring Expression Comprehension and Segmentation
Author: Gen Luo,Yiyi Zhou,Xiaoshuai Sun,Liujuan Cao,Chenglin Wu,Cheng Deng,Rongrong Ji


Counterfactual Vision and Language Learning
Author: Ehsan Abbasnejad,Damien Teney,Amin Parvaneh,Javen Shi,Anton van den Hengel


Iterative Context-Aware Graph Inference for Visual Dialog
Author: Dan Guo,Hui Wang,Hanwang Zhang,Zheng-Jun Zha,Meng Wang


TA-Student VQA: Multi-Agents Training by Self-Questioning
Author: Peixi Xiong,Ying Wu


Exploring Self-Attention for Image Recognition
Author: Hengshuang Zhao,Jiaya Jia,Vladlen Koltun


Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension
Author: Zhenfang Chen,Peng Wang,Lin Ma,Kwan-Yee K. Wong,Qi Wu


Improving Convolutional Networks With Self-Calibrated Convolutions
Author: Jiang-Jiang Liu,Qibin Hou,Ming-Ming Cheng,Changhu Wang,Jiashi Feng


Modality Shifting Attention Network for Multi-Modal Video Question Answering
Author: Junyeong Kim,Minuk Ma,Trung Pham,Kyungsu Kim,Chang D. Yoo


Learning to Structure an Image With Few Colors
Author: Yunzhong Hou,Liang Zheng,Stephen Gould


On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering
Author: Xinyu Wang,Yuliang Liu,Chunhua Shen,Chun Chet Ng,Canjie Luo,Lianwen Jin,Chee Seng Chan,Anton van den Hengel,Liangwei Wang


From Paris to Berlin: Discovering Fashion Style Influences Around the World
Author: Ziad Al-Halah,Kristen Grauman


A Local-to-Global Approach to Multi-Modal Movie Scene Segmentation
Author: Anyi Rao,Linning Xu,Yu Xiong,Guodong Xu,Qingqiu Huang,Bolei Zhou,Dahua Lin


G-TAD: Sub-Graph Localization for Temporal Action Detection
Author: Mengmeng Xu,Chen Zhao,David S. Rojas,Ali Thabet,Bernard Ghanem


Detailed 2D-3D Joint Representation for Human-Object Interaction
Author: Yong-Lu Li,Xinpeng Liu,Han Lu,Shiyi Wang,Junqi Liu,Jiefeng Li,Cewu Lu


One-Shot Adversarial Attacks on Visual Tracking With Dual Attention
Author: Xuesong Chen,Xiyu Yan,Feng Zheng,Yong Jiang,Shu-Tao Xia,Yong Zhao,Rongrong Ji


Rethinking Classification and Localization for Object Detection
Author: Yue Wu,Yinpeng Chen,Lu Yuan,Zicheng Liu,Lijuan Wang,Hongzhi Li,Yun Fu


Correspondence Networks With Adaptive Neighbourhood Consensus
Author: Shuda Li,Kai Han,Theo W. Costain,Henry Howard-Jenkins,Victor Prisacariu


Multiple Anchor Learning for Visual Object Detection
Author: Wei Ke,Tianliang Zhang,Zeyi Huang,Qixiang Ye,Jianzhuang Liu,Dong Huang


PhraseCut: Language-Based Image Segmentation in the Wild
Author: Chenyun Wu,Zhe Lin,Scott Cohen,Trung Bui,Subhransu Maji


Mask Encoding for Single Shot Instance Segmentation
Author: Rufeng Zhang,Zhi Tian,Chunhua Shen,Mingyu You,Youliang Yan


Action Genome: Actions As Compositions of Spatio-Temporal Scene Graphs
Author: Jingwei Ji,Ranjay Krishna,Li Fei-Fei,Juan Carlos Niebles


Learning Unseen Concepts via Hierarchical Decomposition and Composition
Author: Muli Yang,Cheng Deng,Junchi Yan,Xianglong Liu,Dacheng Tao


Hi-CMD: Hierarchical Cross-Modality Disentanglement for Visible-Infrared Person Re-Identification
Author: Seokeon Choi,Sumin Lee,Youngeun Kim,Taekyung Kim,Changick Kim


In Defense of Grid Features for Visual Question Answering
Author: Huaizu Jiang,Ishan Misra,Marcus Rohrbach,Erik Learned-Miller,Xinlei Chen


Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation
Author: Tao Zhou,Huazhu Fu,Chen Gong,Jianbing Shen,Ling Shao,Fatih Porikli


Dense Regression Network for Video Grounding
Author: Runhao Zeng,Haoming Xu,Wenbing Huang,Peihao Chen,Mingkui Tan,Chuang Gan


Neural Architecture Search for Lightweight Non-Local Networks
Author: Yingwei Li,Xiaojie Jin,Jieru Mei,Xiaochen Lian,Linjie Yang,Cihang Xie,Qihang Yu,Yuyin Zhou,Song Bai,Alan L. Yuille


Learning Saliency Propagation for Semi-Supervised Instance Segmentation
Author: Yanzhao Zhou,Xin Wang,Jianbin Jiao,Trevor Darrell,Fisher Yu


Speech2Action: Cross-Modal Supervision for Action Recognition
Author: Arsha Nagrani,Chen Sun,David Ross,Rahul Sukthankar,Cordelia Schmid,Andrew Zisserman


Normalized and Geometry-Aware Self-Attention Network for Image Captioning
Author: Longteng Guo,Jing Liu,Xinxin Zhu,Peng Yao,Shichen Lu,Hanqing Lu


Memory Enhanced Global-Local Aggregation for Video Object Detection
Author: Yihong Chen,Yue Cao,Han Hu,Liwei Wang


Solving Mixed-Modal Jigsaw Puzzle for Fine-Grained Sketch-Based Image Retrieval
Author: Kaiyue Pang,Yongxin Yang,Timothy M. Hospedales,Tao Xiang,Yi-Zhe Song


LG-GAN: Label Guided Adversarial Network for Flexible Targeted Attack of Point Cloud Based Deep Networks
Author: Hang Zhou,Dongdong Chen,Jing Liao,Kejiang Chen,Xiaoyi Dong,Kunlin Liu,Weiming Zhang,Gang Hua,Nenghai Yu


Memory Aggregation Networks for Efficient Interactive Video Object Segmentation
Author: Jiaxu Miao,Yunchao Wei,Yi Yang


VQA With No Questions-Answers Training
Author: Ben-Zion Vatashsky,Shimon Ullman


Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
Author: Debidatta Dwibedi,Yusuf Aytar,Jonathan Tompson,Pierre Sermanet,Andrew Zisserman


SaccadeNet: A Fast and Accurate Object Detector
Author: Shiyi Lan,Zhou Ren,Yi Wu,Larry S. Davis,Gang Hua


Multi-Granularity Reference-Aided Attentive Feature Aggregation for Video-Based Person Re-Identification
Author: Zhizheng Zhang,Cuiling Lan,Wenjun Zeng,Zhibo Chen


Video Object Grounding Using Semantic Roles in Language Description
Author: Arka Sadhu,Kan Chen,Ram Nevatia


Designing Network Design Spaces
Author: Ilija Radosavovic,Raj Prateek Kosaraju,Ross Girshick,Kaiming He,Piotr Dollar


12-in-1: Multi-Task Vision and Language Representation Learning
Author: Jiasen Lu,Vedanuj Goswami,Marcus Rohrbach,Devi Parikh,Stefan Lee


MLCVNet: Multi-Level Context VoteNet for 3D Object Detection
Author: Qian Xie,Yu-Kun Lai,Jing Wu,Zhoutao Wang,Yiming Zhang,Kai Xu,Jun Wang


Listen to Look: Action Recognition by Previewing Audio
Author: Ruohan Gao,Tae-Hyun Oh,Kristen Grauman,Lorenzo Torresani


Attention Convolutional Binary Neural Tree for Fine-Grained Visual Categorization
Author: Ruyi Ji,Longyin Wen,Libo Zhang,Dawei Du,Yanjun Wu,Chen Zhao,Xianglong Liu,Feiyue Huang


Music Gesture for Visual Sound Separation
Author: Chuang Gan,Deng Huang,Hang Zhao,Joshua B. Tenenbaum,Antonio Torralba


Referring Image Segmentation via Cross-Modal Progressive Comprehension
Author: Shaofei Huang,Tianrui Hui,Si Liu,Guanbin Li,Yunchao Wei,Jizhong Han,Luoqi Liu,Bo Li


Cloth in the Wind: A Case Study of Physical Measurement Through Simulation
Author: Tom F. H. Runia,Kirill Gavrilyuk,Cees G. M. Snoek,Arnold W. M. Smeulders


The Garden of Forking Paths: Towards Multi-Future Trajectory Prediction
Author: Junwei Liang,Lu Jiang,Kevin Murphy,Ting Yu,Alexander Hauptmann


CentripetalNet: Pursuing High-Quality Keypoint Pairs for Object Detection
Author: Zhiwei Dong,Guoxuan Li,Yue Liao,Fei Wang,Pengju Ren,Chen Qian


PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection
Author: Shaoshuai Shi,Chaoxu Guo,Li Jiang,Zhe Wang,Jianping Shi,Xiaogang Wang,Hongsheng Li


Graph Embedded Pose Clustering for Anomaly Detection
Author: Amir Markovitz,Gilad Sharir,Itamar Friedman,Lihi Zelnik-Manor,Shai Avidan


Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity Estimation
Author: Jiaming Sun,Linghao Chen,Yiming Xie,Siyu Zhang,Qinhong Jiang,Xiaowei Zhou,Hujun Bao


Deepstrip: High-Resolution Boundary Refinement
Author: Peng Zhou,Brian Price,Scott Cohen,Gregg Wilensky,Larry S. Davis


Smoothing Adversarial Domain Attack and P-Memory Reconsolidation for Cross-Domain Person Re-Identification
Author: Guangcong Wang,Jian-Huang Lai,Wenqi Liang,Guangrun Wang


Meshed-Memory Transformer for Image Captioning
Author: Marcella Cornia,Matteo Stefanini,Lorenzo Baraldi,Rita Cucchiara


Learning From Noisy Anchors for One-Stage Object Detection
Author: Hengduo Li,Zuxuan Wu,Chen Zhu,Caiming Xiong,Richard Socher,Larry S. Davis


Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection
Author: Zhongzheng Ren,Zhiding Yu,Xiaodong Yang,Ming-Yu Liu,Yong Jae Lee,Alexander G. Schwing,Jan Kautz


Density-Based Clustering for 3D Object Detection in Point Clouds
Author: Syeda Mariam Ahmed,Chee Meng Chew


Few-Shot Video Classification via Temporal Alignment
Author: Kaidi Cao,Jingwei Ji,Zhangjie Cao,Chien-Yi Chang,Juan Carlos Niebles


Densely Connected Search Space for More Flexible Neural Architecture Search
Author: Jiemin Fang,Yuzhu Sun,Qian Zhang,Yuan Li,Wenyu Liu,Xinggang Wang


Fine-Grained Video-Text Retrieval With Hierarchical Graph Reasoning
Author: Shizhe Chen,Yida Zhao,Qin Jin,Qi Wu


Warp to the Future: Joint Forecasting of Features and Feature Motion
Author: Josip Saric,Marin Orsic,Tonci Antunovic,Sacha Vrazic,Sinisa Segvic


Network Adjustment: Channel Search Guided by FLOPs Utilization Ratio
Author: Zhengsu Chen,Jianwei Niu,Lingxi Xie,Xuefeng Liu,Longhui Wei,Qi Tian


Where Does It Exist: Spatio-Temporal Video Grounding for Multi-Form Sentences
Author: Zhu Zhang,Zhou Zhao,Yang Zhao,Qi Wang,Huasheng Liu,Lianli Gao


Cross-Modal Cross-Domain Moment Alignment Network for Person Search
Author: Ya Jing,Wei Wang,Liang Wang,Tieniu Tan


Self-Training With Noisy Student Improves ImageNet Classification
Author: Qizhe Xie,Minh-Thang Luong,Eduard Hovy,Quoc V. Le


Learning Longterm Representations for Person Re-Identification Using Radio Signals
Author: Lijie Fan,Tianhong Li,Rongyao Fang,Rumen Hristov,Yuan Yuan,Dina Katabi


LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation
Author: Keunhong Park,Arsalan Mousavian,Yu Xiang,Dieter Fox


Learning Instance Occlusion for Panoptic Segmentation
Author: Justin Lazarow,Kwonjoon Lee,Kunyu Shi,Zhuowen Tu


Vision-Dialog Navigation by Exploring Cross-Modal Memory
Author: Yi Zhu,Fengda Zhu,Zhaohuan Zhan,Bingqian Lin,Jianbin Jiao,Xiaojun Chang,Xiaodan Liang


ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks
Author: Mohit Shridhar,Jesse Thomason,Daniel Gordon,Yonatan Bisk,Winson Han,Roozbeh Mottaghi,Luke Zettlemoyer,Dieter Fox


NMS by Representative Region: Towards Crowded Pedestrian Detection by Proposal Pairing
Author: Xin Huang,Zheng Ge,Zequn Jie,Osamu Yoshie


Visual Commonsense R-CNN
Author: Tan Wang,Jianqiang Huang,Hanwang Zhang,Qianru Sun


What Deep CNNs Benefit From Global Covariance Pooling: An Optimization Perspective
Author: Qilong Wang,Li Zhang,Banggu Wu,Dongwei Ren,Peihua Li,Wangmeng Zuo,Qinghua Hu


EfficientDet: Scalable and Efficient Object Detection
Author: Mingxing Tan,Ruoming Pang,Quoc V. Le


Fast Template Matching and Update for Video Object Tracking and Segmentation
Author: Mingjie Sun,Jimin Xiao,Eng Gee Lim,Bingfeng Zhang,Yao Zhao


Counterfactual Samples Synthesizing for Robust Visual Question Answering
Author: Long Chen,Xin Yan,Jun Xiao,Hanwang Zhang,Shiliang Pu,Yueting Zhuang


Local-Global Video-Text Interactions for Temporal Grounding
Author: Jonghwan Mun,Minsu Cho,Bohyung Han


Set-Constrained Viterbi for Set-Supervised Action Segmentation
Author: Jun Li,Sinisa Todorovic


Probabilistic Video Prediction From Noisy Data With a Posterior Confidence
Author: Yunbo Wang,Jiajun Wu,Mingsheng Long,Joshua B. Tenenbaum


Beyond Short-Term Snippet: Video Relation Detection With Spatio-Temporal Global Context
Author: Chenchen Liu,Yang Jin,Kehan Xu,Guoqiang Gong,Yadong Mu


Visual Grounding in Video for Unsupervised Word Translation
Author: Gunnar A. Sigurdsson,Jean-Baptiste Alayrac,Aida Nematzadeh,Lucas Smaira,Mateusz Malinowski,Joao Carreira,Phil Blunsom,Andrew Zisserman


Two Causal Principles for Improving Visual Dialog
Author: Jiaxin Qi,Yulei Niu,Jianqiang Huang,Hanwang Zhang


Spatio-Temporal Graph for Video Captioning With Knowledge Distillation
Author: Boxiao Pan,Haoye Cai,De-An Huang,Kuan-Hui Lee,Adrien Gaidon,Ehsan Adeli,Juan Carlos Niebles


A Real-Time Cross-Modality Correlation Filtering Method for Referring Expression Comprehension
Author: Yue Liao,Si Liu,Guanbin Li,Fei Wang,Yanjie Chen,Chen Qian,Bo Li


Better Captioning With Sequence-Level Exploration
Author: Jia Chen,Qin Jin


Violin: A Large-Scale Dataset for Video-and-Language Inference
Author: Jingzhou Liu,Wenhu Chen,Yu Cheng,Zhe Gan,Licheng Yu,Yiming Yang,Jingjing Liu


RiFeGAN: Rich Feature Generation for Text-to-Image Synthesis From Prior Knowledge
Author: Jun Cheng,Fuxiang Wu,Yanling Tian,Lei Wang,Dapeng Tao


Graph Structured Network for Image-Text Matching
Author: Chunxiao Liu,Zhendong Mao,Tianzhu Zhang,Hongtao Xie,Bin Wang,Yongdong Zhang


Straight to the Point: Fast-Forwarding Videos via Reinforcement Learning Using Textual Data
Author: Washington Ramos,Michel Silva,Edson Araujo,Leandro Soriano Marcolino,Erickson Nascimento


Multi-Modality Cross Attention Network for Image and Sentence Matching
Author: Xi Wei,Tianzhu Zhang,Yan Li,Yongdong Zhang,Feng Wu


Generalized ODIN: Detecting Out-of-Distribution Image Without Learning From Out-of-Distribution Data
Author: Yen-Chang Hsu,Yilin Shen,Hongxia Jin,Zsolt Kira


Learning Augmentation Network via Influence Functions
Author: Donghoon Lee,Hyunsin Park,Trung Pham,Chang D. Yoo


X-Linear Attention Networks for Image Captioning
Author: Yingwei Pan,Ting Yao,Yehao Li,Tao Mei


Unsupervised Person Re-Identification via Multi-Label Classification
Author: Dongkai Wang,Shiliang Zhang


Overcoming Classifier Imbalance for Long-Tail Object Detection With Balanced Group Softmax
Author: Yu Li,Tao Wang,Bingyi Kang,Sheng Tang,Chunfeng Wang,Jintao Li,Jiashi Feng


What You See is What You Get: Exploiting Visibility for 3D Object Detection
Author: Peiyun Hu,Jason Ziglar,David Held,Deva Ramanan


Deep Structure-Revealed Network for Texture Recognition
Author: Wei Zhai,Yang Cao,Zheng-Jun Zha,HaiYong Xie,Feng Wu


Online Knowledge Distillation via Collaborative Learning
Author: Qiushan Guo,Xinjiang Wang,Yichao Wu,Zhipeng Yu,Ding Liang,Xiaolin Hu,Ping Luo


Dynamic Convolution: Attention Over Convolution Kernels
Author: Yinpeng Chen,Xiyang Dai,Mengchen Liu,Dongdong Chen,Lu Yuan,Zicheng Liu


3DSSD: Point-Based 3D Single Stage Object Detector
Author: Zetong Yang,Yanan Sun,Shu Liu,Jiaya Jia


Deep Degradation Prior for Low-Quality Image Classification
Author: Yang Wang,Yang Cao,Zheng-Jun Zha,Jing Zhang,Zhiwei Xiong


ViBE: Dressing for Diverse Body Shapes
Author: Wei-Lin Hsiao,Kristen Grauman


Don’t Judge an Object by Its Context: Learning to Overcome Contextual Bias
Author: Krishna Kumar Singh,Dhruv Mahajan,Kristen Grauman,Yong Jae Lee,Matt Feiszli,Deepti Ghadiyaram


SESS: Self-Ensembling Semi-Supervised 3D Object Detection
Author: Na Zhao,Tat-Seng Chua,Gim Hee Lee


Combining Detection and Tracking for Human Pose Estimation in Videos
Author: Manchen Wang,Joseph Tighe,Davide Modolo


SAPIEN: A SimulAted Part-Based Interactive ENvironment
Author: Fanbo Xiang,Yuzhe Qin,Kaichun Mo,Yikuan Xia,Hao Zhu,Fangchen Liu,Minghua Liu,Hanxiao Jiang,Yifu Yuan,He Wang,Li Yi,Angel X. Chang,Leonidas J. Guibas,Hao Su


RandLA-Net: Efficient Semantic Segmentation of Large-Scale Point Clouds
Author: Qingyong Hu,Bo Yang,Linhai Xie,Stefano Rosa,Yulan Guo,Zhihua Wang,Niki Trigoni,Andrew Markham


SurfelGAN: Synthesizing Realistic Sensor Data for Autonomous Driving
Author: Zhenpei Yang,Yuning Chai,Dragomir Anguelov,Yin Zhou,Pei Sun,Dumitru Erhan,Sean Rafferty,Henrik Kretzschmar


A Programmatic and Semantic Approach to Explaining and Debugging Neural Network Based Object Detectors
Author: Edward Kim,Divya Gopinath,Corina Pasareanu,Sanjit A. Seshia


Predicting Semantic Map Representations From Images Using Pyramid Occupancy Networks
Author: Thomas Roddick,Roberto Cipolla


Efficient Derivative Computation for Cumulative B-Splines on Lie Groups
Author: Christiane Sommer,Vladyslav Usenko,David Schubert,Nikolaus Demmel,Daniel Cremers


RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real
Author: Kanishka Rao,Chris Harris,Alex Irpan,Sergey Levine,Julian Ibarz,Mohi Khansari


LiDARsim: Realistic LiDAR Simulation by Leveraging the Real World
Author: Sivabalan Manivasagam,Shenlong Wang,Kelvin Wong,Wenyuan Zeng,Mikita Sazanovich,Shuhan Tan,Bin Yang,Wei-Chiu Ma,Raquel Urtasun


Just Go With the Flow: Self-Supervised Scene Flow Estimation
Author: Himangi Mittal,Brian Okorn,David Held


TITAN: Future Forecast Using Action Priors
Author: Srikanth Malla,Behzad Dariush,Chiho Choi


Robust Learning Through Cross-Task Consistency
Author: Amir R. Zamir,Alexander Sax,Nikhil Cheerla,Rohan Suri,Zhangjie Cao,Jitendra Malik,Leonidas J. Guibas


Dynamic Refinement Network for Oriented and Densely Packed Object Detection
Author: Xingjia Pan,Yuqiang Ren,Kekai Sheng,Weiming Dong,Haolei Yuan,Xiaowei Guo,Chongyang Ma,Changsheng Xu


AOWS: Adaptive and Optimal Network Width Search With Latency Constraints
Author: Maxim Berman,Leonid Pishchulin,Ning Xu,Matthew B. Blaschko,Gerard Medioni


High-Dimensional Convolutional Networks for Geometric Pattern Recognition
Author: Christopher Choy,Junha Lee,Rene Ranftl,Jaesik Park,Vladlen Koltun


Filter Response Normalization Layer: Eliminating Batch Dependence in the Training of Deep Neural Networks
Author: Saurabh Singh,Shankar Krishnan


Deep Iterative Surface Normal Estimation
Author: Jan Eric Lenssen,Christian Osendorfer,Jonathan Masci


Dataless Model Selection With the Deep Frame Potential
Author: Calvin Murdock,Simon Lucey


UNAS: Differentiable Architecture Search Meets Reinforcement Learning
Author: Arash Vahdat,Arun Mallya,Ming-Yu Liu,Jan Kautz


Local Context Normalization: Revisiting Local Normalization
Author: Anthony Ortiz,Caleb Robinson,Dan Morris,Olac Fuentes,Christopher Kiekintveld,Md Mahmudulla Hassan,Nebojsa Jojic


ACNe: Attentive Context Normalization for Robust Permutation-Equivariant Learning
Author: Weiwei Sun,Wei Jiang,Eduard Trulls,Andrea Tagliasacchi,Kwang Moo Yi


Learning Situational Driving
Author: Eshed Ohn-Bar,Aditya Prakash,Aseem Behl,Kashyap Chitta,Andreas Geiger


From Depth What Can You See? Depth Completion via Auxiliary Image Reconstruction
Author: Kaiyue Lu,Nick Barnes,Saeed Anwar,Liang Zheng


Symmetry and Group in Attribute-Object Compositions
Author: Yong-Lu Li,Yue Xu,Xiaohan Mao,Cewu Lu


Noise-Aware Fully Webly Supervised Object Detection
Author: Yunhang Shen,Rongrong Ji,Zhiwei Chen,Xiaopeng Hong,Feng Zheng,Jianzhuang Liu,Mingliang Xu,Qi Tian


3D Part Guided Image Editing for Fine-Grained Object Understanding
Author: Zongdai Liu,Feixiang Lu,Peng Wang,Hui Miao,Liangjun Zhang,Ruigang Yang,Bin Zhou


STINet: Spatio-Temporal-Interactive Network for Pedestrian Detection and Trajectory Prediction
Author: Zhishuai Zhang,Jiyang Gao,Junhua Mao,Yukai Liu,Dragomir Anguelov,Congcong Li


Rethinking Performance Estimation in Neural Architecture Search
Author: Xiawu Zheng,Rongrong Ji,Qiang Wang,Qixiang Ye,Zhenguo Li,Yonghong Tian,Qi Tian


Feature-Metric Registration: A Fast Semi-Supervised Approach for Robust Point Cloud Registration Without Correspondences
Author: Xiaoshui Huang,Guofeng Mei,Jian Zhang


Learning Multi-View Camera Relocalization With Graph Neural Networks
Author: Fei Xue,Xin Wu,Shaojun Cai,Junqiu Wang


MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps
Author: Pengxiang Wu,Siheng Chen,Dimitris N. Metaxas


EcoNAS: Finding Proxies for Economical Neural Architecture Search
Author: Dongzhan Zhou,Xinchi Zhou,Wenwei Zhang,Chen Change Loy,Shuai Yi,Xuesen Zhang,Wanli Ouyang


Hit-Detector: Hierarchical Trinity Architecture Search for Object Detection
Author: Jianyuan Guo,Kai Han,Yunhe Wang,Chao Zhang,Zhaohui Yang,Han Wu,Xinghao Chen,Chang Xu


Geometrically Principled Connections in Graph Neural Networks
Author: Shunwang Gong,Mehdi Bahri,Michael M. Bronstein,Stefanos Zafeiriou


On Vocabulary Reliance in Scene Text Recognition
Author: Zhaoyi Wan,Jielei Zhang,Liang Zhang,Jiebo Luo,Cong Yao


Generating Accurate Pseudo-Labels in Semi-Supervised Learning and Avoiding Overconfident Predictions via Hermite Polynomial Activations
Author: Vishnu Suresh Lokhande,Songwong Tasneeyapant,Abhay Venkatesh,Sathya N. Ravi,Vikas Singh


GraspNet-1Billion: A Large-Scale Benchmark for General Object Grasping
Author: Hao-Shu Fang,Chenxi Wang,Minghao Gou,Cewu Lu


PFRL: Pose-Free Reinforcement Learning for 6D Pose Estimation
Author: Jianzhun Shao,Yuhang Jiang,Gu Wang,Zhigang Li,Xiangyang Ji


Through Fog High-Resolution Imaging Using Millimeter Wave Radar
Author: Junfeng Guan,Sohrab Madani,Suraj Jog,Saurabh Gupta,Haitham Hassanieh


Disentangling Physical Dynamics From Unknown Factors for Unsupervised Video Prediction
Author: Vincent Le Guen,Nicolas Thome


D2Det: Towards High Quality Object Detection and Instance Segmentation
Author: Jiale Cao,Hisham Cholakkal,Rao Muhammad Anwer,Fahad Shahbaz Khan,Yanwei Pang,Ling Shao


LiDAR-Based Online 3D Video Object Detection With Graph-Based Message Passing and Spatiotemporal Transformer Attention
Author: Junbo Yin,Jianbing Shen,Chenye Guan,Dingfu Zhou,Ruigang Yang


Orthogonal Convolutional Neural Networks
Author: Jiayun Wang,Yubei Chen,Rudrasis Chakraborty,Stella X. Yu


Self-Robust 3D Point Recognition via Gather-Vector Guidance
Author: Xiaoyi Dong,Dongdong Chen,Hang Zhou,Gang Hua,Weiming Zhang,Nenghai Yu


VectorNet: Encoding HD Maps and Agent Dynamics From Vectorized Representation
Author: Jiyang Gao,Chen Sun,Hang Zhao,Yi Shen,Dragomir Anguelov,Congcong Li,Cordelia Schmid


ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks
Author: Qilong Wang,Banggu Wu,Pengfei Zhu,Peihua Li,Wangmeng Zuo,Qinghua Hu


MTL-NAS: Task-Agnostic Neural Architecture Search Towards General-Purpose Multi-Task Learning
Author: Yuan Gao,Haoping Bai,Zequn Jie,Jiayi Ma,Kui Jia,Wei Liu


PnPNet: End-to-End Perception and Prediction With Tracking in the Loop
Author: Ming Liang,Bin Yang,Wenyuan Zeng,Yun Chen,Rui Hu,Sergio Casas,Raquel Urtasun


Revisiting the Sibling Head in Object Detector
Author: Guanglu Song,Yu Liu,Xiaogang Wang


Visual Reaction: Learning to Play Catch With Your Drone
Author: Kuo-Hao Zeng,Roozbeh Mottaghi,Luca Weihs,Ali Farhadi


Prime Sample Attention in Object Detection
Author: Yuhang Cao,Kai Chen,Chen Change Loy,Dahua Lin


SpineNet: Learning Scale-Permuted Backbone for Recognition and Localization
Author: Xianzhi Du,Tsung-Yi Lin,Pengchong Jin,Golnaz Ghiasi,Mingxing Tan,Yin Cui,Quoc V. Le,Xiaodan Song


KeyPose: Multi-View 3D Labeling and Keypoint Estimation for Transparent Objects
Author: Xingyu Liu,Rico Jonschkowski,Anelia Angelova,Kurt Konolige


SegGCN: Efficient 3D Point Cloud Segmentation With Fuzzy Spherical Kernel
Author: Huan Lei,Naveed Akhtar,Ajmal Mian


nuScenes: A Multimodal Dataset for Autonomous Driving
Author: Holger Caesar,Varun Bankiti,Alex H. Lang,Sourabh Vora,Venice Erin Liong,Qiang Xu,Anush Krishnan,Yu Pan,Giancarlo Baldan,Oscar Beijbom


PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation
Author: Yisheng He,Wei Sun,Haibin Huang,Jianran Liu,Haoqiang Fan,Jian Sun


Probabilistic Pixel-Adaptive Refinement Networks
Author: Anne S. Wannenwetsch,Stefan Roth


Discovering Human Interactions With Novel Objects via Zero-Shot Learning
Author: Suchen Wang,Kim-Hui Yap,Junsong Yuan,Yap-Peng Tan


Equalization Loss for Long-Tailed Object Recognition
Author: Jingru Tan,Changbao Wang,Buyu Li,Quanquan Li,Wanli Ouyang,Changqing Yin,Junjie Yan


Learning Depth-Guided Convolutions for Monocular 3D Object Detection
Author: Mingyu Ding,Yuqi Huo,Hongwei Yi,Zhe Wang,Jianping Shi,Zhiwu Lu,Ping Luo


Seeing Through Fog Without Seeing Fog: Deep Multimodal Sensor Fusion in Unseen Adverse Weather
Author: Mario Bijelic,Tobias Gruber,Fahim Mannan,Florian Kraus,Werner Ritter,Klaus Dietmayer,Felix Heide


Don’t Even Look Once: Synthesizing Features for Zero-Shot Detection
Author: Pengkai Zhu,Hanxiao Wang,Venkatesh Saligrama


EPOS: Estimating 6D Pose of Objects With Symmetries
Author: Tomas Hodan,Daniel Barath,Jiri Matas


Train in Germany, Test in the USA: Making 3D Object Detectors Generalize
Author: Yan Wang,Xiangyu Chen,Yurong You,Li Erran Li,Bharath Hariharan,Mark Campbell,Kilian Q. Weinberger,Wei-Lun Chao


Exploring Categorical Regularization for Domain Adaptive Object Detection
Author: Chang-Dong Xu,Xing-Ran Zhao,Xin Jin,Xiu-Shen Wei


Neural Implicit Embedding for Point Cloud Analysis
Author: Kent Fujiwara,Taiichi Hashimoto


Pose-Guided Visible Part Matching for Occluded Person ReID
Author: Shang Gao,Jingya Wang,Huchuan Lu,Zimo Liu


ContourNet: Taking a Further Step Toward Accurate Arbitrary-Shaped Scene Text Detection
Author: Yuxin Wang,Hongtao Xie,Zheng-Jun Zha,Mengting Xing,Zilong Fu,Yongdong Zhang


Exploring Data Aggregation in Policy Learning for Vision-Based Urban Autonomous Driving
Author: Aditya Prakash,Aseem Behl,Eshed Ohn-Bar,Kashyap Chitta,Andreas Geiger


Look-Into-Object: Self-Supervised Structure Modeling for Object Recognition
Author: Mohan Zhou,Yalong Bai,Wei Zhang,Tiejun Zhao,Tao Mei


Recognizing Objects From Any View With Object and Viewer-Centered Representations
Author: Sainan Liu,Vincent Nguyen,Isaac Rehg,Zhuowen Tu


Gated Channel Transformation for Visual Recognition
Author: Zongxin Yang,Linchao Zhu,Yu Wu,Yi Yang


Non-Local Neural Networks With Grouped Bilinear Attentional Transforms
Author: Lu Chi,Zehuan Yuan,Yadong Mu,Changhu Wang


Generative-Discriminative Feature Representations for Open-Set Recognition
Author: Pramuditha Perera,Vlad I. Morariu,Rajiv Jain,Varun Manjunatha,Curtis Wigington,Vicente Ordonez,Vishal M. Patel


RPM-Net: Robust Point Matching Using Learned Features
Author: Zi Jian Yew,Gim Hee Lee


Sideways: Depth-Parallel Training of Video Models
Author: Mateusz Malinowski,Grzegorz Swirszcz,Joao Carreira,Viorica Patraucean


Basis Prediction Networks for Effective Burst Denoising With Large Kernels
Author: Zhihao Xia,Federico Perazzi,Michael Gharbi,Kalyan Sunkavalli,Ayan Chakrabarti


Private-kNN: Practical Differential Privacy for Computer Vision
Author: Yuqing Zhu,Xiang Yu,Manmohan Chandraker,Yu-Xiang Wang


SP-NAS: Serial-to-Parallel Backbone Search for Object Detection
Author: Chenhan Jiang,Hang Xu,Wei Zhang,Xiaodan Liang,Zhenguo Li


Structure Aware Single-Stage 3D Object Detection From Point Cloud
Author: Chenhang He,Hui Zeng,Jianqiang Huang,Xian-Sheng Hua,Lei Zhang


“Looking at the Right Stuff” - Guided Semantic-Gaze for Autonomous Driving
Author: Anwesan Pal,Sayan Mondal,Henrik I. Christensen


What’s Hidden in a Randomly Weighted Neural Network?
Author: Vivek Ramanujan,Mitchell Wortsman,Aniruddha Kembhavi,Ali Farhadi,Mohammad Rastegari


Structured Multi-Hashing for Model Compression
Author: Elad Eban,Yair Movshovitz-Attias,Hao Wu,Mark Sandler,Andrew Poon,Yerlan Idelbayev,Miguel A. Carreira-Perpinan


DOPS: Learning to Detect 3D Objects and Predict Their 3D Shapes
Author: Mahyar Najibi,Guangda Lai,Abhijit Kundu,Zhichao Lu,Vivek Rathod,Thomas Funkhouser,Caroline Pantofaru,David Ross,Larry S. Davis,Alireza Fathi


AutoTrack: Towards High-Performance Visual Tracking for UAV With Automatic Spatio-Temporal Regularization
Author: Yiming Li,Changhong Fu,Fangqiang Ding,Ziyuan Huang,Geng Lu


GP-NAS: Gaussian Process Based Neural Architecture Search
Author: Zhihang Li,Teng Xi,Jiankang Deng,Gang Zhang,Shengzhao Wen,Ran He


NAS-FCOS: Fast Neural Architecture Search for Object Detection
Author: Ning Wang,Yang Gao,Hao Chen,Peng Wang,Zhi Tian,Chunhua Shen,Yanning Zhang


TCTS: A Task-Consistent Two-Stage Framework for Person Search
Author: Cheng Wang,Bingpeng Ma,Hong Chang,Shiguang Shan,Xilin Chen


SCATTER: Selective Context Attentional Scene Text Recognizer
Author: Ron Litman,Oron Anschel,Shahar Tsiper,Roee Litman,Shai Mazor,R. Manmatha


Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation
Author: Dengsheng Chen,Jun Li,Zheng Wang,Kai Xu


Hierarchical Scene Coordinate Classification and Regression for Visual Localization
Author: Xiaotian Li,Shuzhe Wang,Yi Zhao,Jakob Verbeek,Juho Kannala


MiLeNAS: Efficient Neural Architecture Search via Mixed-Level Reformulation
Author: Chaoyang He,Haishan Ye,Li Shen,Tong Zhang


Scalable Uncertainty for Computer Vision With Functional Variational Inference
Author: Eduardo D. C. Carvalho,Ronald Clark,Andrea Nicastro,Paul H. J. Kelly


Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End
Author: Abdelrahman Eldesokey,Michael Felsberg,Karl Holmquist,Michael Persson


Butterfly Transform: An Efficient FFT Based Neural Architecture Design
Author: Keivan Alizadeh vahid,Anish Prabhu,Ali Farhadi,Mohammad Rastegari


A Certifiably Globally Optimal Solution to Generalized Essential Matrix Estimation
Author: Ji Zhao,Wanting Xu,Laurent Kneip


MUXConv: Information Multiplexing in Convolutional Neural Networks
Author: Zhichao Lu,Kalyanmoy Deb,Vishnu Naresh Boddeti


PointGMM: A Neural GMM Network for Point Clouds
Author: Amir Hertz,Rana Hanocka,Raja Giryes,Daniel Cohen-Or


Noisier2Noise: Learning to Denoise From Unpaired Noisy Data
Author: Nick Moran,Dan Schmidt,Yu Zhong,Patrick Coady


TRPLP - Trifocal Relative Pose From Lines at Points
Author: Ricardo Fabbri,Timothy Duff,Hongyi Fan,Margaret H. Regan,David da Costa de Pinho,Elias Tsigaridas,Charles W. Wampler,Jonathan D. Hauenstein,Peter J. Giblin,Benjamin Kimia,Anton Leykin,Tomas Pajdla


DSNAS: Direct Neural Architecture Search Without Parameter Retraining
Author: Shoukang Hu,Sirui Xie,Hehui Zheng,Chunxiao Liu,Jianping Shi,Xunying Liu,Dahua Lin


MonoPair: Monocular 3D Object Detection Using Pairwise Spatial Relationships
Author: Yongjian Chen,Lei Tai,Kai Sun,Mingyang Li


Regularization on Spatio-Temporally Smoothed Feature for Action Recognition
Author: Jinhyung Kim,Seunghwan Cha,Dongyoon Wee,Soonmin Bae,Junmo Kim


Towards Accurate Scene Text Recognition With Semantic Reasoning Networks
Author: Deli Yu,Xuan Li,Chengquan Zhang,Tao Liu,Junyu Han,Jingtuo Liu,Errui Ding


Unsupervised Reinforcement Learning of Transferable Meta-Skills for Embodied Navigation
Author: Juncheng Li,Xin Wang,Siliang Tang,Haizhou Shi,Fei Wu,Yueting Zhuang,William Yang Wang


Inferring Attention Shift Ranks of Objects for Image Saliency
Author: Avishek Siris,Jianbo Jiao,Gary K.L. Tam,Xianghua Xie,Rynson W.H. Lau


Camera On-Boarding for Person Re-Identification Using Hypothesis Transfer Learning
Author: Sk Miraj Ahmed,Aske R. Lejbolle,Rameswar Panda,Amit K. Roy-Chowdhury


Joint Graph-Based Depth Refinement and Normal Estimation
Author: Mattia Rossi,Mireille El Gheche,Andreas Kuhn,Pascal Frossard


DR Loss: Improving Object Detection by Distributional Ranking
Author: Qi Qian,Lei Chen,Hao Li,Rong Jin


Self-Trained Deep Ordinal Regression for End-to-End Video Anomaly Detection
Author: Guansong Pang,Cheng Yan,Chunhua Shen,Anton van den Hengel,Xiao Bai


Few-Shot Class-Incremental Learning
Author: Xiaoyu Tao,Xiaopeng Hong,Xinyuan Chang,Songlin Dong,Xing Wei,Yihong Gong


PolarMask: Single Shot Instance Segmentation With Polar Representation
Author: Enze Xie,Peize Sun,Xiaoge Song,Wenhai Wang,Xuebo Liu,Ding Liang,Chunhua Shen,Ping Luo


DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover’s Distance and Structured Classifiers
Author: Chi Zhang,Yujun Cai,Guosheng Lin,Chunhua Shen


Detection in Crowded Scenes: One Proposal, Multiple Predictions
Author: Xuangeng Chu,Anlin Zheng,Xiangyu Zhang,Jian Sun


Autolabeling 3D Objects With Differentiable Rendering of SDF Shape Priors
Author: Sergey Zakharov,Wadim Kehl,Arjun Bhargava,Adrien Gaidon


Interactive Object Segmentation With Inside-Outside Guidance
Author: Shiyin Zhang,Jun Hao Liew,Yunchao Wei,Shikui Wei,Yao Zhao


Mnemonics Training: Multi-Class Incremental Learning Without Forgetting
Author: Yaoyao Liu,Yuting Su,An-An Liu,Bernt Schiele,Qianru Sun


Learning to Segment 3D Point Clouds in 2D Image Space
Author: Yecheng Lyu,Xinming Huang,Ziming Zhang


Smooth Shells: Multi-Scale Shape Registration With Functional Maps
Author: Marvin Eisenberger,Zorah Lahner,Daniel Cremers


Self-Supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation
Author: Yude Wang,Jie Zhang,Meina Kan,Shiguang Shan,Xilin Chen


Efficient Neural Vision Systems Based on Convolutional Image Acquisition
Author: Pedram Pad,Simon Narduzzi,Clement Kundig,Engin Turetken,Siavash A. Bigdeli,L. Andrea Dunbar


Visual Chirality
Author: Zhiqiu Lin,Jin Sun,Abe Davis,Noah Snavely


What Machines See Is Not What They Get: Fooling Scene Text Recognition Models With Adversarial Text Images
Author: Xing Xu,Jiefu Chen,Jinhui Xiao,Lianli Gao,Fumin Shen,Heng Tao Shen


Dynamic Traffic Modeling From Overhead Imagery
Author: Scott Workman,Nathan Jacobs


Satellite Image Time Series Classification With Pixel-Set Encoders and Temporal Self-Attention
Author: Vivien Sainte Fare Garnot,Loic Landrieu,Sebastien Giordano,Nesrine Chehata


DAVD-Net: Deep Audio-Aided Video Decompression of Talking Heads
Author: Xi Zhang,Xiaolin Wu,Xinliang Zhai,Xianye Ben,Chengjie Tu


Learning When and Where to Zoom With Deep Reinforcement Learning
Author: Burak Uzkent,Stefano Ermon


Cross-Domain Detection via Graph-Induced Prototype Alignment
Author: Minghao Xu,Hang Wang,Bingbing Ni,Qi Tian,Wenjun Zhang


Meta-Learning of Neural Architectures for Few-Shot Learning
Author: Thomas Elsken,Benedikt Staffler,Jan Hendrik Metzen,Frank Hutter


Towards Inheritable Models for Open-Set Domain Adaptation
Author: Jogendra Nath Kundu,Naveen Venkat,Ambareesh Revanur,Rahul M V,R. Venkatesh Babu


Learning From Synthetic Animals
Author: Jiteng Mu,Weichao Qiu,Gregory D. Hager,Alan L. Yuille


Distilling Cross-Task Knowledge via Relationship Matching
Author: Han-Jia Ye,Su Lu,De-Chuan Zhan


Open Compound Domain Adaptation
Author: Ziwei Liu,Zhongqi Miao,Xingang Pan,Xiaohang Zhan,Dahua Lin,Stella X. Yu,Boqing Gong


Context Prior for Scene Segmentation
Author: Changqian Yu,Jingbo Wang,Changxin Gao,Gang Yu,Chunhua Shen,Nong Sang


Tangent Images for Mitigating Spherical Distortion
Author: Marc Eder,Mykhailo Shvets,John Lim,Jan-Michael Frahm


Learning a Dynamic Map of Visual Appearance
Author: Tawfiq Salem,Scott Workman,Nathan Jacobs


Webly Supervised Knowledge Embedding Model for Visual Reasoning
Author: Wenbo Zheng,Lan Yan,Chao Gou,Fei-Yue Wang


Gradually Vanishing Bridge for Adversarial Domain Adaptation
Author: Shuhao Cui,Shuhui Wang,Junbao Zhuo,Chi Su,Qingming Huang,Qi Tian


Active Speakers in Context
Author: Juan Leon Alcazar,Fabian Caba,Long Mai,Federico Perazzi,Joon-Young Lee,Pablo Arbelaez,Bernard Ghanem


Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation
Author: Bowen Cheng,Maxwell D. Collins,Yukun Zhu,Ting Liu,Thomas S. Huang,Hartwig Adam,Liang-Chieh Chen


Inter-Region Affinity Distillation for Road Marking Segmentation
Author: Yuenan Hou,Zheng Ma,Chunxiao Liu,Tak-Wai Hui,Chen Change Loy


Unified Dynamic Convolutional Network for Super-Resolution With Variational Degradations
Author: Yu-Syuan Xu,Shou-Yao Roy Tseng,Yu Tseng,Hsien-Kai Kuo,Yi-Min Tsai


Making Better Mistakes: Leveraging Class Hierarchies With Deep Networks
Author: Luca Bertinetto,Romain Mueller,Konstantinos Tertikas,Sina Samangooei,Nicholas A. Lord


Data-Free Knowledge Amalgamation via Group-Stack Dual-GAN
Author: Jingwen Ye,Yixin Ji,Xinchao Wang,Xin Gao,Mingli Song


Screencast Tutorial Video Understanding
Author: Kunpeng Li,Chen Fang,Zhaowen Wang,Seokhwan Kim,Hailin Jin,Yun Fu


DSGN: Deep Stereo Geometry Network for 3D Object Detection
Author: Yilun Chen,Shu Liu,Xiaoyong Shen,Jiaya Jia


Weakly-Supervised Salient Object Detection via Scribble Annotations
Author: Jing Zhang,Xin Yu,Aixuan Li,Peipei Song,Bowen Liu,Yuchao Dai


Learning to Learn Single Domain Generalization
Author: Fengchun Qiao,Long Zhao,Xi Peng


Severity-Aware Semantic Segmentation With Reinforced Wasserstein Training
Author: Xiaofeng Liu,Wenxuan Ji,Jane You,Georges El Fakhri,Jonghye Woo


Boosting Few-Shot Learning With Adaptive Margin Loss
Author: Aoxue Li,Weiran Huang,Xu Lan,Jiashi Feng,Zhenguo Li,Liwei Wang


JA-POLS: A Moving-Camera Background Model via Joint Alignment and Partially-Overlapping Local Subspaces
Author: Irit Chelly,Vlad Winter,Dor Litvak,David Rosen,Oren Freifeld


AugFPN: Improving Multi-Scale Feature Learning for Object Detection
Author: Chaoxu Guo,Bin Fan,Qian Zhang,Shiming Xiang,Chunhong Pan


xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
Author: Maximilian Jaritz,Tuan-Hung Vu,Raoul de Charette,Emilie Wirbel,Patrick Perez


Norm-Aware Embedding for Efficient Person Search
Author: Di Chen,Shanshan Zhang,Jian Yang,Bernt Schiele


Intelligent Home 3D: Automatic 3D-House Design From Linguistic Descriptions Only
Author: Qi Chen,Qi Wu,Rui Tang,Yuhan Wang,Shuai Wang,Mingkui Tan


Differential Treatment for Stuff and Things: A Simple Unsupervised Domain Adaptation Method for Semantic Segmentation
Author: Zhonghao Wang,Mo Yu,Yunchao Wei,Rogerio Feris,Jinjun Xiong,Wen-mei Hwu,Thomas S. Huang,Honghui Shi


Robust Object Detection Under Occlusion With Context-Aware CompositionalNets
Author: Angtian Wang,Yihong Sun,Adam Kortylewski,Alan L. Yuille


IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval
Author: Hui Chen,Guiguang Ding,Xudong Liu,Zijia Lin,Ji Liu,Jungong Han


Domain-Aware Visual Bias Eliminating for Generalized Zero-Shot Learning
Author: Shaobo Min,Hantao Yao,Hongtao Xie,Chaoqun Wang,Zheng-Jun Zha,Yongdong Zhang


Semi-Supervised Semantic Segmentation With Cross-Consistency Training
Author: Yassine Ouali,Celine Hudelot,Myriam Tami


Learning to Learn Cropping Models for Different Aspect Ratio Requirements
Author: Debang Li,Junge Zhang,Kaiqi Huang


What Makes Training Multi-Modal Classification Networks Hard?
Author: Weiyao Wang,Du Tran,Matt Feiszli


Selective Transfer With Reinforced Transfer Network for Partial Domain Adaptation
Author: Zhihong Chen,Chao Chen,Zhaowei Cheng,Boyuan Jiang,Ke Fang,Xinyu Jin


Semi-Supervised Semantic Image Segmentation With Self-Correcting Networks
Author: Mostafa S. Ibrahim,Arash Vahdat,Mani Ranjbar,William G. Macready


Exemplar Normalization for Learning Deep Representation
Author: Ruimao Zhang,Zhanglin Peng,Lingyun Wu,Zhen Li,Ping Luo


Imitative Non-Autoregressive Modeling for Trajectory Forecasting and Imputation
Author: Mengshi Qi,Jie Qin,Yu Wu,Yi Yang


Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text
Author: Difei Gao,Ke Li,Ruiping Wang,Shiguang Shan,Xilin Chen


StereoGAN: Bridging Synthetic-to-Real Domain Gap by Joint Optimization of Domain Translation and Stereo Matching
Author: Rui Liu,Chengxi Yang,Wenxiu Sun,Xiaogang Wang,Hongsheng Li


Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning
Author: Jiamin Wu,Tianzhu Zhang,Zheng-Jun Zha,Jiebo Luo,Yongdong Zhang,Feng Wu


Sparse Layered Graphs for Multi-Object Segmentation
Author: Niels Jeppesen,Anders N. Christensen,Vedrana A. Dahl,Anders B. Dahl


Visual-Semantic Matching by Exploring High-Order Attention and Distraction
Author: Yongzhi Li,Duo Zhang,Yadong Mu


End-to-End 3D Point Cloud Instance Segmentation Without Detection
Author: Haiyong Jiang,Feilong Yan,Jianfei Cai,Jianmin Zheng,Jun Xiao


Deep Adversarial Decomposition: A Unified Framework for Separating Superimposed Images
Author: Zhengxia Zou,Sen Lei,Tianyang Shi,Zhenwei Shi,Jieping Ye


Differentiable Adaptive Computation Time for Visual Reasoning
Author: Cristobal Eyzaguirre,Alvaro Soto


DeepLPF: Deep Local Parametric Filters for Image Enhancement
Author: Sean Moran,Pierre Marza,Steven McDonagh,Sarah Parisot,Gregory Slabaugh


Instance Credibility Inference for Few-Shot Learning
Author: Yikai Wang,Chengming Xu,Chen Liu,Li Zhang,Yanwei Fu


Learning From Web Data With Self-Organizing Memory Module
Author: Yi Tu,Li Niu,Junjie Chen,Dawei Cheng,Liqing Zhang


TransMatch: A Transfer-Learning Scheme for Semi-Supervised Few-Shot Learning
Author: Zhongjie Yu,Lin Chen,Zhongwei Cheng,Jiebo Luo


Learning the Redundancy-Free Features for Generalized Zero-Shot Object Recognition
Author: Zongyan Han,Zhenyong Fu,Jian Yang


Neural Topological SLAM for Visual Navigation
Author: Devendra Singh Chaplot,Ruslan Salakhutdinov,Abhinav Gupta,Saurabh Gupta


WaveletStereo: Learning Wavelet Coefficients of Disparity Map in Stereo Matching
Author: Menglong Yang,Fangrui Wu,Wei Li


Robust Superpixel-Guided Attentional Adversarial Attack
Author: Xiaoyi Dong,Jiangfan Han,Dongdong Chen,Jiayang Liu,Huanyu Bian,Zehua Ma,Hongsheng Li,Xiaogang Wang,Weiming Zhang,Nenghai Yu


BEDSR-Net: A Deep Shadow Removal Network From a Single Document Image
Author: Yun-Hsuan Lin,Wen-Chin Chen,Yung-Yu Chuang


Cross-Domain Document Object Detection: Benchmark Suite and Method
Author: Kai Li,Curtis Wigington,Chris Tensmeyer,Handong Zhao,Nikolaos Barmpalios,Vlad I. Morariu,Varun Manjunatha,Tong Sun,Yun Fu


Explaining Knowledge Distillation by Quantifying the Knowledge
Author: Xu Cheng,Zhefan Rao,Yilan Chen,Quanshi Zhang


Exploring Bottom-Up and Top-Down Cues With Attentive Learning for Webly Supervised Object Detection
Author: Zhonghua Wu,Qingyi Tao,Guosheng Lin,Jianfei Cai


Enhancing Generic Segmentation With Learned Region Representations
Author: Or Isaacs,Oran Shayer,Michael Lindenbaum


Adaptive Hierarchical Down-Sampling for Point Cloud Classification
Author: Ehsan Nezhadarya,Ehsan Taghavi,Ryan Razani,Bingbing Liu,Jun Luo


FBNetV2: Differentiable Neural Architecture Search for Spatial and Channel Dimensions
Author: Alvin Wan,Xiaoliang Dai,Peizhao Zhang,Zijian He,Yuandong Tian,Saining Xie,Bichen Wu,Matthew Yu,Tao Xu,Kan Chen,Peter Vajda,Joseph E. Gonzalez


Learning Texture Invariant Representation for Domain Adaptation of Semantic Segmentation
Author: Myeongjin Kim,Hyeran Byun


Putting Visual Object Recognition in Context
Author: Mengmi Zhang,Claire Tseng,Gabriel Kreiman


SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection
Author: Ze Chen,Zhihang Fu,Rongxin Jiang,Yaowu Chen,Xian-Sheng Hua


Universal Weighting Metric Learning for Cross-Modal Matching
Author: Jiwei Wei,Xing Xu,Yang Yang,Yanli Ji,Zheng Wang,Heng Tao Shen


IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving
Author: Wanli Peng,Hao Pan,He Liu,Yi Sun


Label Decoupling Framework for Salient Object Detection
Author: Jun Wei,Shuhui Wang,Zhe Wu,Chi Su,Qingming Huang,Qi Tian


Transform and Tell: Entity-Aware News Image Captioning
Author: Alasdair Tran,Alexander Mathews,Lexing Xie


HAMBox: Delving Into Mining High-Quality Anchors on Face Detection
Author: Yang Liu,Xu Tang,Junyu Han,Jingtuo Liu,Dinger Rui,Xiang Wu


Hierarchical Feature Embedding for Attribute Recognition
Author: Jie Yang,Jiarou Fan,Yiru Wang,Yige Wang,Weihao Gan,Lin Liu,Wei Wu


Squeeze-and-Attention Networks for Semantic Segmentation
Author: Zilong Zhong,Zhong Qiu Lin,Rene Bidart,Xiaodan Hu,Ibrahim Ben Daya,Zhifeng Li,Wei-Shi Zheng,Jonathan Li,Alexander Wong


Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection
Author: Sara Beery,Guanhang Wu,Vivek Rathod,Ronny Votel,Jonathan Huang


Mixture Dense Regression for Object Detection and Human Pose Estimation
Author: Ali Varamesh,Tinne Tuytelaars


Syntax-Aware Action Targeting for Video Captioning
Author: Qi Zheng,Chaoyue Wang,Dacheng Tao


Learning Visual Emotion Representations From Web Data
Author: Zijun Wei,Jianming Zhang,Zhe Lin,Joon-Young Lee,Niranjan Balasubramanian,Minh Hoai,Dimitris Samaras


The Edge of Depth: Explicit Constraints Between Segmentation and Depth
Author: Shengjie Zhu,Garrick Brazil,Xiaoming Liu


A Context-Aware Loss Function for Action Spotting in Soccer Videos
Author: Anthony Cioppa,Adrien Deliege,Silvio Giancola,Bernard Ghanem,Marc Van Droogenbroeck,Rikke Gade,Thomas B. Moeslund


Towards Learning a Generic Agent for Vision-and-Language Navigation via Pre-Training
Author: Weituo Hao,Chunyuan Li,Xiujun Li,Lawrence Carin,Jianfeng Gao


Video Instance Segmentation Tracking With a Modified VAE Architecture
Author: Chung-Ching Lin,Ying Hung,Rogerio Feris,Linglin He


Deformation-Aware Unpaired Image Translation for Pose Estimation on Laboratory Animals
Author: Siyuan Li,Semih Gunel,Mirela Ostrek,Pavan Ramdya,Pascal Fua,Helge Rhodin


ZeroQ: A Novel Zero Shot Quantization Framework
Author: Yaohui Cai,Zhewei Yao,Zhen Dong,Amir Gholami,Michael W. Mahoney,Kurt Keutzer


Disparity-Aware Domain Adaptation in Stereo Image Restoration
Author: Bo Yan,Chenxi Ma,Bahetiyaer Bare,Weimin Tan,Steven C. H. Hoi


Offset Bin Classification Network for Accurate Object Detection
Author: Heqian Qiu,Hongliang Li,Qingbo Wu,Hengcan Shi


TBT: Targeted Neural Network Attack With Bit Trojan
Author: Adnan Siraj Rakin,Zhezhi He,Deliang Fan


Maintaining Discrimination and Fairness in Class Incremental Learning
Author: Bowen Zhao,Xi Xiao,Guojun Gan,Bin Zhang,Shu-Tao Xia


Background Data Resampling for Outlier-Aware Classification
Author: Yi Li,Nuno Vasconcelos


STEFANN: Scene Text Editor Using Font Adaptive Neural Network
Author: Prasun Roy,Saumik Bhattacharya,Subhankar Ghosh,Umapada Pal


Geometry and Learning Co-Supported Normal Estimation for Unstructured Point Cloud
Author: Haoran Zhou,Honghua Chen,Yidan Feng,Qiong Wang,Jing Qin,Haoran Xie,Fu Lee Wang,Mingqiang Wei,Jun Wang


Sequential Motif Profiles and Topological Plots for Offline Signature Verification
Author: Elias N. Zois,Evangelos Zervas,Dimitrios Tsourounis,George Economou


Optical Flow in Dense Foggy Scenes Using Semi-Supervised Learning
Author: Wending Yan,Aashish Sharma,Robby T. Tan


A Spatial RNN Codec for End-to-End Image Compression
Author: Chaoyi Lin,Jiabao Yao,Fangdong Chen,Li Wang


Object Relational Graph With Teacher-Recommended Learning for Video Captioning
Author: Ziqi Zhang,Yaya Shi,Chunfeng Yuan,Bing Li,Peijin Wang,Weiming Hu,Zheng-Jun Zha


MMTM: Multimodal Transfer Module for CNN Fusion
Author: Hamid Reza Vaezi Joze,Amirreza Shaban,Michael L. Iuzzolino,Kazuhito Koishida


Generalized Zero-Shot Learning via Over-Complete Distribution
Author: Rohit Keshari,Richa Singh,Mayank Vatsa


Gait Recognition via Semi-supervised Disentangled Representation Learning to Identity and Covariate Features
Author: Xiang Li,Yasushi Makihara,Chi Xu,Yasushi Yagi,Mingwu Ren


Unifying Training and Inference for Panoptic Segmentation
Author: Qizhu Li,Xiaojuan Qi,Philip H.S. Torr


Associate-3Ddet: Perceptual-to-Conceptual Association for 3D Point Cloud Object Detection
Author: Liang Du,Xiaoqing Ye,Xiao Tan,Jianfeng Feng,Zhenbo Xu,Errui Ding,Shilei Wen


Interactive Image Segmentation With First Click Attention
Author: Zheng Lin,Zhao Zhang,Lin-Zhuo Chen,Ming-Ming Cheng,Shao-Ping Lu


NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection
Author: Yazhao Li,Yanwei Pang,Jianbing Shen,Jiale Cao,Ling Shao


Scale-Equalizing Pyramid Convolution for Object Detection
Author: Xinjiang Wang,Shilong Zhang,Zhuoran Yu,Litong Feng,Wayne Zhang


Learning to Cluster Faces via Confidence and Connectivity Estimation
Author: Lei Yang,Dapeng Chen,Xiaohang Zhan,Rui Zhao,Chen Change Loy,Dahua Lin


Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer
Author: Yan Lu,Yue Wu,Bin Liu,Tianzhu Zhang,Baopu Li,Qi Chu,Nenghai Yu


DPGN: Distribution Propagation Graph Network for Few-Shot Learning
Author: Ling Yang,Liangliang Li,Zilun Zhang,Xinyu Zhou,Erjin Zhou,Yu Liu


Density-Aware Graph for Deep Semi-Supervised Visual Recognition
Author: Suichan Li,Bin Liu,Dongdong Chen,Qi Chu,Lu Yuan,Nenghai Yu


Unsupervised Multi-Modal Image Registration via Geometry Preserving Image-to-Image Translation
Author: Moab Arar,Yiftach Ginger,Dov Danon,Amit H. Bermano,Daniel Cohen-Or


Binarizing MobileNet via Evolution-Based Searching
Author: Hai Phan,Zechun Liu,Dang Huynh,Marios Savvides,Kwang-Ting Cheng,Zhiqiang Shen


Temporal-Context Enhanced Detection of Heavily Occluded Pedestrians
Author: Jialian Wu,Chunluan Zhou,Ming Yang,Qian Zhang,Yuan Li,Junsong Yuan


Orderless Recurrent Models for Multi-Label Classification
Author: Vacit Oguz Yazici,Abel Gonzalez-Garcia,Arnau Ramisa,Bartlomiej Twardowski,Joost van de Weijer


Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning
Author: Ehsan Abbasnejad,Iman Abbasnejad,Qi Wu,Javen Shi,Anton van den Hengel


Rethinking the Route Towards Weakly Supervised Object Localization
Author: Chen-Lin Zhang,Yun-Hao Cao,Jianxin Wu


Adversarial Feature Hallucination Networks for Few-Shot Learning
Author: Kai Li,Yulun Zhang,Kunpeng Li,Yun Fu


Conditional Gaussian Distribution Learning for Open Set Recognition
Author: Xin Sun,Zhenning Yang,Chi Zhang,Keck-Voon Ling,Guohao Peng


Connect-and-Slice: An Hybrid Approach for Reconstructing 3D Objects
Author: Hao Fang,Florent Lafarge


Attentive Weights Generation for Few Shot Learning via Information Maximization
Author: Yiluan Guo,Ngai-Man Cheung


Assessing Eye Aesthetics for Automatic Multi-Reference Eye In-Painting
Author: Bo Yan,Qing Lin,Weimin Tan,Shili Zhou


PuppeteerGAN: Arbitrary Portrait Animation With Semantic-Aware Appearance Transformation
Author: Zhuo Chen,Chaoyue Wang,Bo Yuan,Dacheng Tao


SEED: Semantics Enhanced Encoder-Decoder Framework for Scene Text Recognition
Author: Zhi Qiao,Yu Zhou,Dongbao Yang,Yucan Zhou,Weiping Wang


Texture and Shape Biased Two-Stream Networks for Clothing Classification and Attribute Recognition
Author: Yuwei Zhang,Peng Zhang,Chun Yuan,Zhi Wang


Distortion Agnostic Deep Watermarking
Author: Xiyang Luo,Ruohan Zhan,Huiwen Chang,Feng Yang,Peyman Milanfar


RMP-SNN: Residual Membrane Potential Neuron for Enabling Deeper High-Accuracy and Low-Latency Spiking Neural Network
Author: Bing Han,Gopalakrishnan Srinivasan,Kaushik Roy


BFBox: Searching Face-Appropriate Backbone and Feature Pyramid Network for Face Detector
Author: Yang Liu,Xu Tang


PFCNN: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames
Author: Yuqi Yang,Shilin Liu,Hao Pan,Yang Liu,Xin Tong


iTAML: An Incremental Task-Agnostic Meta-learning Approach
Author: Jathushan Rajasegaran,Salman Khan,Munawar Hayat,Fahad Shahbaz Khan,Mubarak Shah


Optimal least-squares solution to the hand-eye calibration problem
Author: Amit Dekel,Linus Harenstam-Nielsen,Sergio Caccamo


MnasFPN: Learning Latency-Aware Pyramid Architecture for Object Detection on Mobile Devices
Author: Bo Chen,Golnaz Ghiasi,Hanxiao Liu,Tsung-Yi Lin,Dmitry Kalenichenko,Hartwig Adam,Quoc V. Le


VSGNet: Spatial Attention Network for Detecting Human Object Interactions Using Graph Convolutions
Author: Oytun Ulutan,A S M Iftekhar,B. S. Manjunath


End-to-End Camera Calibration for Broadcast Videos
Author: Long Sha,Jennifer Hobbs,Panna Felsen,Xinyu Wei,Patrick Lucey,Sujoy Ganguly


Regularizing CNN Transfer Learning With Randomised Regression
Author: Yang Zhong,Atsuto Maki


KeypointNet: A Large-Scale 3D Keypoint Dataset Aggregated From Numerous Human Annotations
Author: Yang You,Yujing Lou,Chengkun Li,Zhoujun Cheng,Liangwei Li,Lizhuang Ma,Cewu Lu,Weiming Wang


Hierarchical Clustering With Hard-Batch Triplet Loss for Person Re-Identification
Author: Kaiwei Zeng,Munan Ning,Yaohua Wang,Yang Guo


Joint Semantic Segmentation and Boundary Detection Using Iterative Pyramid Contexts
Author: Mingmin Zhen,Jinglu Wang,Lei Zhou,Shiwei Li,Tianwei Shen,Jiaxiang Shang,Tian Fang,Long Quan


Attention-Guided Hierarchical Structure Aggregation for Image Matting
Author: Yu Qiao,Yuhao Liu,Xin Yang,Dongsheng Zhou,Mingliang Xu,Qiang Zhang,Xiaopeng Wei


MetaFuse: A Pre-trained Fusion Model for Human Pose Estimation
Author: Rongchang Xie,Chunyu Wang,Yizhou Wang


Prior Guided GAN Based Semantic Inpainting
Author: Avisek Lahiri,Arnav Kumar Jain,Sanskar Agrawal,Pabitra Mitra,Prabir Kumar Biswas


Weakly Supervised Semantic Point Cloud Segmentation: Towards 10x Fewer Labels
Author: Xun Xu,Gim Hee Lee


Physically Realizable Adversarial Examples for LiDAR Object Detection
Author: James Tu,Mengye Ren,Sivabalan Manivasagam,Ming Liang,Bin Yang,Richard Du,Frank Cheng,Raquel Urtasun


Combating Noisy Labels by Agreement: A Joint Training Method with Co-Regularization
Author: Hongxin Wei,Lei Feng,Xiangyu Chen,Bo An


Light-weight Calibrator: A Separable Component for Unsupervised Domain Adaptation
Author: Shaokai Ye,Kailu Wu,Mu Zhou,Yunfei Yang,Sia Huat Tan,Kaidi Xu,Jiebo Song,Chenglong Bao,Kaisheng Ma


Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition
Author: Canjie Luo,Yuanzhi Zhu,Lianwen Jin,Yongpan Wang


Learning Selective Self-Mutual Attention for RGB-D Saliency Detection
Author: Nian Liu,Ni Zhang,Junwei Han


Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation
Author: Yangtao Zheng,Di Huang,Songtao Liu,Yunhong Wang


Estimating Low-Rank Region Likelihood Maps
Author: Gabriela Csurka,Zoltan Kato,Andor Juhasz,Martin Humenberger


Neural Head Reenactment with Latent Pose Descriptors
Author: Egor Burkov,Igor Pasechnik,Artur Grigorev,Victor Lempitsky


Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis
Author: K R Prajwal,Rudrabha Mukhopadhyay,Vinay P. Namboodiri,C.V. Jawahar


Self-Supervised Learning of Video-Induced Visual Invariances
Author: Michael Tschannen,Josip Djolonga,Marvin Ritter,Aravindh Mahendran,Neil Houlsby,Sylvain Gelly,Mario Lucic


Two-Stage Peer-Regularized Feature Recombination for Arbitrary Image Style Transfer
Author: Jan Svoboda,Asha Anoosheh,Christian Osendorfer,Jonathan Masci


MINA: Convex Mixed-Integer Programming for Non-Rigid Shape Alignment
Author: Florian Bernard,Zeeshan Khan Suri,Christian Theobalt


Improving One-Shot NAS by Suppressing the Posterior Fading
Author: Xiang Li,Chen Lin,Chuming Li,Ming Sun,Wei Wu,Junjie Yan,Wanli Ouyang


Incremental Few-Shot Object Detection
Author: Juan-Manuel Perez-Rua,Xiatian Zhu,Timothy M. Hospedales,Tao Xiang


Synthetic Learning: Learn From Distributed Asynchronized Discriminator GAN Without Sharing Medical Image Data
Author: Qi Chang,Hui Qu,Yikai Zhang,Mert Sabuncu,Chao Chen,Tong Zhang,Dimitris N. Metaxas


Exploring Category-Agnostic Clusters for Open-Set Domain Adaptation
Author: Yingwei Pan,Ting Yao,Yehao Li,Chong-Wah Ngo,Tao Mei


Regularizing Class-Wise Predictions via Self-Knowledge Distillation
Author: Sukmin Yun,Jongjin Park,Kimin Lee,Jinwoo Shin


Hierarchical Graph Attention Network for Visual Relationship Detection
Author: Li Mi,Zhenzhong Chen


M2m: Imbalanced Classification via Major-to-Minor Translation
Author: Jaehyung Kim,Jongheon Jeong,Jinwoo Shin


CenterMask: Real-Time Anchor-Free Instance Segmentation
Author: Youngwan Lee,Jongyoul Park


Multi-Path Learning for Object Pose Estimation Across Domains
Author: Martin Sundermeyer,Maximilian Durner,En Yen Puang,Zoltan-Csaba Marton,Narunas Vaskevicius,Kai O. Arras,Rudolph Triebel


Incremental Learning in Online Scenario
Author: Jiangpeng He,Runyu Mao,Zeman Shao,Fengqing Zhu


Enhanced Transport Distance for Unsupervised Domain Adaptation
Author: Mengxue Li,Yi-Ming Zhai,You-Wei Luo,Peng-Fei Ge,Chuan-Xian Ren


TESA: Tensor Element Self-Attention via Matricization
Author: Francesca Babiloni,Ioannis Marras,Gregory Slabaugh,Stefanos Zafeiriou


Training a Steerable CNN for Guidewire Detection
Author: Donghang Li,Adrian Barbu


Superpixel Segmentation With Fully Convolutional Networks
Author: Fengting Yang,Qian Sun,Hailin Jin,Zihan Zhou


SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation
Author: Koutilya PNVR,Hao Zhou,David Jacobs


Label Distribution Learning on Auxiliary Label Space Graphs for Facial Expression Recognition
Author: Shikai Chen,Jianfeng Wang,Yuedong Chen,Zhongchao Shi,Xin Geng,Yong Rui


Deep Residual Flow for Out of Distribution Detection
Author: Ev Zisselman,Aviv Tamar


FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation
Author: Shurui Gui,Chaoyue Wang,Qihua Chen,Dacheng Tao


Learning Nanoscale Motion Patterns of Vesicles in Living Cells
Author: Arif Ahmed Sekh,Ida Sundvor Opstad,Asa Birna Birgisdottir,Truls Myrmel,Balpreet Singh Ahluwalia,Krishna Agarwal,Dilip K. Prasad


Improving Action Segmentation via Graph-Based Temporal Reasoning
Author: Yifei Huang,Yusuke Sugano,Yoichi Sato


Episode-Based Prototype Generating Network for Zero-Shot Learning
Author: Yunlong Yu,Zhong Ji,Jungong Han,Zhongfei Zhang


Learning to Segment the Tail
Author: Xinting Hu,Yi Jiang,Kaihua Tang,Jingyuan Chen,Chunyan Miao,Hanwang Zhang


Learning to Evaluate Perception Models Using Planner-Centric Metrics
Author: Jonah Philion,Amlan Kar,Sanja Fidler


Where, What, Whether: Multi-Modal Learning Meets Pedestrian Detection
Author: Yan Luo,Chongyang Zhang,Muming Zhao,Hao Zhou,Jun Sun


CoverNet: Multimodal Behavior Prediction Using Trajectory Sets
Author: Tung Phan-Minh,Elena Corina Grigore,Freddy A. Boulton,Oscar Beijbom,Eric M. Wolff


Real-World Person Re-Identification via Degradation Invariance Learning
Author: Yukun Huang,Zheng-Jun Zha,Xueyang Fu,Richang Hong,Liang Li


Defending and Harnessing the Bit-Flip Based Adversarial Weight Attack
Author: Zhezhi He,Adnan Siraj Rakin,Jingtao Li,Chaitali Chakrabarti,Deliang Fan


Adversarial Latent Autoencoders
Author: Stanislav Pidhorskyi,Donald A. Adjeroh,Gianfranco Doretto


Adaptive Fractional Dilated Convolution Network for Image Aesthetics Assessment
Author: Qiuyu Chen,Wei Zhang,Ning Zhou,Peng Lei,Yi Xu,Yu Zheng,Jianping Fan


Deep Generative Model for Robust Imbalance Classification
Author: Xinyue Wang,Yilin Lyu,Liping Jing


Learning Deep Network for Detecting 3D Object Keypoints and 6D Poses
Author: Wanqing Zhao,Shaobo Zhang,Ziyu Guan,Wei Zhao,Jinye Peng,Jianping Fan


MetaIQA: Deep Meta-Learning for No-Reference Image Quality Assessment
Author: Hancheng Zhu,Leida Li,Jinjian Wu,Weisheng Dong,Guangming Shi


Sketchformer: Transformer-Based Representation for Sketched Structure
Author: Leo Sampaio Ferraz Ribeiro,Tu Bui,John Collomosse,Moacir Ponti


Cylindrical Convolutional Networks for Joint Object Detection and Viewpoint Estimation
Author: Sunghun Joung,Seungryong Kim,Hanjae Kim,Minsu Kim,Ig-Jae Kim,Junghyun Cho,Kwanghoon Sohn


Learning a Unified Sample Weighting Network for Object Detection
Author: Qi Cai,Yingwei Pan,Yu Wang,Jingen Liu,Ting Yao,Tao Mei


Old Is Gold: Redefining the Adversarially Learned One-Class Classifier Training Paradigm
Author: Muhammad Zaigham Zaheer,Jin-Ha Lee,Marcella Astrid,Seung-Ik Lee


An Adaptive Neural Network for Unsupervised Mosaic Consistency Analysis in Image Forensics
Author: Quentin Bammey,Rafael Grompone von Gioi,Jean-Michel Morel


McFlow: Monte Carlo Flow Models for Data Imputation
Author: Trevor W. Richardson,Wencheng Wu,Lei Lin,Beilei Xu,Edgar A. Bernal


Learning to See Through Obstructions
Author: Yu-Lun Liu,Wei-Sheng Lai,Ming-Hsuan Yang,Yung-Yu Chuang,Jia-Bin Huang


GaitPart: Temporal Part-Based Model for Gait Recognition
Author: Chao Fan,Yunjie Peng,Chunshui Cao,Xu Liu,Saihui Hou,Jiannan Chi,Yongzhen Huang,Qing Li,Zhiqiang He


EmotiCon: Context-Aware Multimodal Emotion Recognition Using Frege’s Principle
Author: Trisha Mittal,Pooja Guhan,Uttaran Bhattacharya,Rohan Chandra,Aniket Bera,Dinesh Manocha


Can Deep Learning Recognize Subtle Human Activities?
Author: Vincent Jacquot,Zhuofan Ying,Gabriel Kreiman


PhysGAN: Generating Physical-World-Resilient Adversarial Examples for Autonomous Driving
Author: Zelun Kong,Junfeng Guo,Ang Li,Cong Liu


ILFO: Adversarial Attack on Adaptive Neural Networks
Author: Mirazul Haque,Anki Chauhan,Cong Liu,Wei Yang


On Translation Invariance in CNNs: Convolutional Layers Can Exploit Absolute Spatial Location
Author: Osman Semih Kayhan,Jan C. van Gemert


Diverse Image Generation via Self-Conditioned GANs
Author: Steven Liu,Tongzhou Wang,David Bau,Jun-Yan Zhu,Antonio Torralba


Inducing Hierarchical Compositional Model by Sparsifying Generator Network
Author: Xianglei Xing,Tianfu Wu,Song-Chun Zhu,Ying Nian Wu


CARP: Compression Through Adaptive Recursive Partitioning for Multi-Dimensional Images
Author: Rongjie Liu,Meng Li,Li Ma


GrappaNet: Combining Parallel Imaging With Deep Learning for Multi-Coil MRI Reconstruction
Author: Anuroop Sriram,Jure Zbontar,Tullie Murrell,C. Lawrence Zitnick,Aaron Defazio,Daniel K. Sodickson


Can Weight Sharing Outperform Random Architecture Search? An Investigation With TuNAS
Author: Gabriel Bender,Hanxiao Liu,Bo Chen,Grace Chu,Shuyang Cheng,Pieter-Jan Kindermans,Quoc V. Le


Context Aware Graph Convolution for Skeleton-Based Action Recognition
Author: Xikun Zhang,Chang Xu,Dacheng Tao


Fast(er) Reconstruction of Shredded Text Documents via Self-Supervised Deep Asymmetric Metric Learning
Author: Thiago M. Paixao,Rodrigo F. Berriel,Maria C. S. Boeres,Alessandro L. Koerich,Claudine Badue,Alberto F. De Souza,Thiago Oliveira-Santos


Revisiting Pose-Normalization for Fine-Grained Few-Shot Recognition
Author: Luming Tang,Davis Wertheimer,Bharath Hariharan


RankMI: A Mutual Information Maximizing Ranking Loss
Author: Mete Kemertas,Leila Pishdad,Konstantinos G. Derpanis,Afsaneh Fazly


Learning Memory-Guided Normality for Anomaly Detection
Author: Hyunjong Park,Jongyoun Noh,Bumsub Ham


Appearance Shock Grammar for Fast Medial Axis Extraction From Real Images
Author: Charles-Olivier Dufresne Camaro,Morteza Rezanejad,Stavros Tsogkas,Kaleem Siddiqi,Sven Dickinson


Generalizing Hand Segmentation in Egocentric Videos With Uncertainty-Guided Model Adaptation
Author: Minjie Cai,Feng Lu,Yoichi Sato


DeFeat-Net: General Monocular Depth via Simultaneous Unsupervised Representation Learning
Author: Jaime Spencer,Richard Bowden,Simon Hadfield


Learning Visual Motion Segmentation Using Event Surfaces
Author: Anton Mitrokhin,Zhiyuan Hua,Cornelia Fermuller,Yiannis Aloimonos


Social-STGCNN: A Social Spatio-Temporal Graph Convolutional Neural Network for Human Trajectory Prediction
Author: Abduallah Mohamed,Kun Qian,Mohamed Elhoseiny,Christian Claudel


Discriminative Multi-Modality Speech Recognition
Author: Bo Xu,Cheng Lu,Yandong Guo,Jacob Wang


Clean-Label Backdoor Attacks on Video Recognition Models
Author: Shihao Zhao,Xingjun Ma,Xiang Zheng,James Bailey,Jingjing Chen,Yu-Gang Jiang


Detecting Adversarial Samples Using Influence Functions and Nearest Neighbors
Author: Gilad Cohen,Guillermo Sapiro,Raja Giryes


Unsupervised Model Personalization While Preserving Privacy and Scalability: An Open Problem
Author: Matthias De Lange,Xu Jia,Sarah Parisot,Ales Leonardis,Gregory Slabaugh,Tinne Tuytelaars


GIFnets: Differentiable GIF Encoding Framework
Author: Innfarn Yoo,Xiyang Luo,Yilin Wang,Feng Yang,Peyman Milanfar


Learning Invariant Representation for Unsupervised Image Restoration
Author: Wenchao Du,Hu Chen,Hongyu Yang


Improved Few-Shot Visual Classification
Author: Peyman Bateni,Raghav Goyal,Vaden Masrani,Frank Wood,Leonid Sigal


Learning Weighted Submanifolds With Variational Autoencoders and Riemannian Variational Autoencoders
Author: Author: Nina Miolane,Susan Holmes


Learning Geocentric Object Pose in Oblique Monocular Images
Author: Gordon Christie,Rodrigo Rene Rai Munoz Abujder,Kevin Foster,Shea Hagstrom,Gregory D. Hager,Myron Z. Brown


Understanding Adversarial Examples From the Mutual Influence of Images and Perturbations
Author: Chaoning Zhang,Philipp Benz,Tooba Imtiaz,In So Kweon


Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
Author: Giannis Daras,Augustus Odena,Han Zhang,Alexandros G. Dimakis


MoreFusion: Multi-object Reasoning for 6D Pose Estimation from Volumetric Fusion
Author: Kentaro Wada,Edgar Sucar,Stephen James,Daniel Lenton,Andrew J. Davison


HCNAF: Hyper-Conditioned Neural Autoregressive Flow and its Application for Probabilistic Occupancy Map Forecasting
Author: Geunseob Oh,Jean-Sebastien Valois


Detail-recovery Image Deraining via Context Aggregation Networks
Author: Sen Deng,Mingqiang Wei,Jun Wang,Yidan Feng,Luming Liang,Haoran Xie,Fu Lee Wang,Meng Wang


MCEN: Bridging Cross-Modal Gap between Cooking Recipes and Dish Images with Latent Variable Model
Author: Han Fu,Rui Wu,Chenghao Liu,Jianling Sun


Hypergraph Attention Networks for Multimodal Learning
Author: Eun-Sol Kim,Woo Young Kang,Kyoung-Woon On,Yu-Jung Heo,Byoung-Tak Zhang


Moving in the Right Direction: A Regularization for Deep Metric Learning
Author: Deen Dayal Mohan,Nishant Sankaran,Dennis Fedorishin,Srirangaraj Setlur,Venu Govindaraju


Rethinking Depthwise Separable Convolutions: How Intra-Kernel Correlations Lead to Improved MobileNets
Author: Daniel Haase,Manuel Amthor


Seeing without Looking: Contextual Rescoring of Object Detections for AP Maximization
Author: Lourenco V. Pato,Renato Negrinho,Pedro M. Q. Aguiar


End-to-End Adversarial-Attention Network for Multi-Modal Clustering
Author: Runwu Zhou,Yi-Dong Shen


Fast Sparse ConvNets
Author: Erich Elsen,Marat Dukhan,Trevor Gale,Karen Simonyan


Few Sample Knowledge Distillation for Efficient Network Compression
Author: Tianhong Li,Jianguo Li,Zhuang Liu,Changshui Zhang


Predicting Sharp and Accurate Occlusion Boundaries in Monocular Depth Estimation Using Displacement Fields
Author: Michael Ramamonjisoa,Yuming Du,Vincent Lepetit


Shape correspondence using anisotropic Chebyshev spectral CNNs
Author: Qinsong Li,Shengjun Liu,Ling Hu,Xinru Liu


RetinaTrack: Online Single Stage Joint Detection and Tracking
Author: Zhichao Lu,Vivek Rathod,Ronny Votel,Jonathan Huang


Multimodal Categorization of Crisis Events in Social Media
Author: Mahdi Abavisani,Liwei Wu,Shengli Hu,Joel Tetreault,Alejandro Jaimes


SPARE3D: A Dataset for SPAtial REasoning on Three-View Line Drawings
Author: Wenyu Han,Siyuan Xiang,Chenhui Liu,Ruoyu Wang,Chen Feng


SwapText: Image Based Texts Transfer in Scenes
Author: Qiangpeng Yang,Jun Huang,Wei Lin


OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold
Author: Mohamed Yousef,Tom E. Bishop


FroDO: From Detections to 3D Objects
Author: Martin Runz,Kejie Li,Meng Tang,Lingni Ma,Chen Kong,Tanner Schmidt,Ian Reid,Lourdes Agapito,Julian Straub,Steven Lovegrove,Richard Newcombe

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值