【CVPR2021】论文汇总列表--Part1

379 篇文章 72 订阅
261 篇文章 13 订阅

- 本帖汇总了CVPR2021目前在arxiv上公布的文章列表,持续更新中。

  • Best wishes! ☔ ❤ 🍜 👉 part2 👈
  • ckpt: 20210501, data from arxiv.org

在这里插入图片描述

Action Unit Memory Network for Weakly Supervised Temporal Action Localization
AuthorsWang Luo, Tianzhu Zhang, Wenfei Yang, Jingen Liu, Tao Mei, Feng Wu, Yongdong Zhang
Practical Wide-Angle Portraits Correction with Deep Structured Models
AuthorsJing Tan, Shan Zhao, Pengfei Xiong, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu
Deep Lucas-Kanade Homography for Multimodal Image Alignment
AuthorsYiming Zhao, Xinming Huang, Ziming Zhang
Network Space Search for Pareto-Efficient Spaces
AuthorsMin-Fong Hong, Hao-Yun Chen, Min-Hung Chen, Yu-Syuan Xu, Hsien-Kai Kuo, Yi-Min Tsai, Hung-Jen Chen, Kevin Jou
Distilling Audio-Visual Knowledge by Compositional Contrastive Learning
AuthorsYanbei Chen, Yongqin Xian, A. Sophia Koepke, Ying Shan, Zeynep Akata
Guided Interactive Video Object Segmentation Using Reliability-Based Attention Maps
AuthorsYuk Heo, Yeong Jun Koh, Chang-Su Kim
EarthNet2021: A large-scale dataset and challenge for Earth surface forecasting as a guided video prediction task
AuthorsChristian Requena-Mesa, Vitus Benson, Markus Reichstein, Jakob Runge, Joachim Denzler
Shadow Neural Radiance Fields for Multi-view Satellite Photogrammetry
AuthorsDawa Derksen, Dario Izzo
Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation
AuthorsJichang Li, Guanbin Li, Yemin Shi, Yizhou Yu
Camera Calibration and Player Localization in SoccerNet-v2 and Investigation of their Representations for Action Spotting
AuthorsAnthony Cioppa, Adrien Deliège, Floriane Magera, Silvio Giancola, Olivier Barnich, Bernard Ghanem, Marc Van Droogenbroeck
Plants Don't Walk on the Street: Common-Sense Reasoning for Reliable Semantic Segmentation
AuthorsLinara Adilova, Elena Schulz, Maram Akila, Sebastian Houben, Jan David Schneider, Fabian Hueger, Tim Wirtz
Self-Supervised Pillar Motion Learning for Autonomous Driving
AuthorsChenxu Luo, Xiaodong Yang, Alan Yuille
I Only Have Eyes for You: The Impact of Masks On Convolutional-Based Facial Expression Recognition
AuthorsPablo Barros, Alessandra Sciutti
Audio-Driven Emotional Video Portraits
AuthorsXinya Ji, Hang Zhou, Kaisiyuan Wang, Wayne Wu, Chen Change Loy, Xun Cao, Feng Xu
A Hyperbolic-to-Hyperbolic Graph Convolutional Network
AuthorsJindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia
Harmonious Semantic Line Detection via Maximal Weight Clique Selection
AuthorsDongkwon Jin, Wonhui Park, Seong-Gyun Jeong, Chang-Su Kim
Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds
AuthorsBowen Cheng, Lu Sheng, Shaoshuai Shi, Ming Yang, Dong Xu
IMAGINE: Image Synthesis by Image-Guided Model Inversion
AuthorsPei Wang, Yijun Li, Krishna Kumar Singh, Jingwan Lu, Nuno Vasconcelos
Semantic Segmentation with Generative Models: Semi-Supervised Learning and Strong Out-of-Domain Generalization
AuthorsDaiqing Li, Junlin Yang, Karsten Kreis, Antonio Torralba, Sanja Fidler
View-Guided Point Cloud Completion
AuthorsXuancheng Zhang, Yutong Feng, Siqi Li, Changqing Zou, Hai Wan, Xibin Zhao, Yandong Guo, Yue Gao
Rethinking and Improving the Robustness of Image Style Transfer
AuthorsPei Wang, Yijun Li, Nuno Vasconcelos
Where and What? Examining Interpretable Disentangled Representations
AuthorsXinqi Zhu, Chang Xu, Dacheng Tao
Spatial Feature Calibration and Temporal Fusion for Effective One-stage Video Instance Segmentation
AuthorsMinghan Li, Shuai Li, Lida Li, Lei Zhang
StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision
AuthorsYang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao
Glance and Gaze: Inferring Action-aware Points for One-Stage Human-Object Interaction Detection
AuthorsXubin Zhong, Xian Qu, Changxing Ding, Dacheng Tao
All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training
AuthorsIslam Nassar, Samitha Herath, Ehsan Abbasnejad, Wray Buntine, Gholamreza Haffari
Neural Camera Simulators
AuthorsHao Ouyang, Zifan Shi, Chenyang Lei, Ka Lung Law, Qifeng Chen
CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching
AuthorsZhelun Shen, Yuchao Dai, Zhibo Rao
Combined Depth Space based Architecture Search For Person Re-identification
AuthorsHanjun Li, Gaojie Wu, Wei-Shi Zheng
Progressive Temporal Feature Alignment Network for Video Inpainting
AuthorsXueyan Zou, Linjie Yang, Ding Liu, Yong Jae Lee
Riggable 3D Face Reconstruction via In-Network Optimization
AuthorsZiqian Bai, Zhaopeng Cui, Xiaoming Liu, Ping Tan
Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning
AuthorsZhicheng Huang, Zhaoyang Zeng, Yupan Huang, Bei Liu, Dongmei Fu, Jianlong Fu
Everything's Talkin': Pareidolia Face Reenactment
AuthorsLinsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He
Few-Shot Incremental Learning with Continually Evolved Classifiers
AuthorsChi Zhang, Nan Song, Guosheng Lin, Yun Zheng, Pan Pan, Yinghui Xu
Affordance Transfer Learning for Human-Object Interaction Detection
AuthorsZhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao
Learning Triadic Belief Dynamics in Nonverbal Communication from Videos
AuthorsLifeng Fan, Shuwen Qiu, Zilong Zheng, Tao Gao, Song-Chun Zhu, Yixin Zhu
Localizing Visual Sounds the Hard Way
AuthorsHonglie Chen, Weidi Xie, Triantafyllos Afouras, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman
Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression
AuthorsZigang Geng, Ke Sun, Bin Xiao, Zhaoxiang Zhang, Jingdong Wang
Content-Aware GAN Compression
AuthorsYuchen Liu, Zhixin Shu, Yijun Li, Zhe Lin, Federico Perazzi, S. Y. Kung
Adaptive Prototype Learning and Allocation for Few-Shot Segmentation
AuthorsGen Li, Varun Jampani, Laura Sevilla-Lara, Deqing Sun, Jonghyun Kim, Joongkyu Kim
SGCN:Sparse Graph Convolution Network for Pedestrian Trajectory Prediction
AuthorsLiushuai Shi, Le Wang, Chengjiang Long, Sanping Zhou, Mo Zhou, Zhenxing Niu, Gang Hua
UAV-Human: A Large Benchmark for Human Behavior Understanding with Unmanned Aerial Vehicles
AuthorsTianjiao Li, Jun Liu, Wei Zhang, Yun Ni, Wenqian Wang, Zhiheng Li
Adaptive Class Suppression Loss for Long-Tail Object Detection
AuthorsTong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng, Jinqiao Wang, Ming Tang
S2R-DepthNet: Learning a Generalizable Depth-specific Structural Representation
AuthorsXiaotian Chen, Yuwang Wang, Xuejin Chen, Wenjun Zeng
Self-supervised Video Representation Learning by Context and Motion Decoupling
AuthorsLianghua Huang, Yu Liu, Bin Wang, Pan Pan, Yinghui Xu, Rong Jin
Bipartite Graph Network with Adaptive Message Passing for Unbiased Scene Graph Generation
AuthorsRongjie Li, Songyang Zhang, Bo Wan, Xuming He
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes
AuthorsDmytro Kotovenko, Matthias Wright, Arthur Heimbrecht, Björn Ommer
Dense Relation Distillation with Context-aware Aggregation for Few-Shot Object Detection
AuthorsHanzhe Hu, Shuai Bai, Aoxue Li, Jinshi Cui, Liwei Wang
Learning Camera Localization via Dense Scene Matching
AuthorsShitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan
DER: Dynamically Expandable Representation for Class Incremental Learning
AuthorsShipeng Yan, Jiangwei Xie, Xuming He
Unsupervised Disentanglement of Linear-Encoded Facial Semantics
AuthorsYutong Zheng, Yu-Kai Huang, Ran Tao, Zhiqiang Shen, Marios Savvides
3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding
AuthorsShengheng Deng, Xun Xu, Chaozheng Wu, Ke Chen, Kui Jia
Complementary Relation Contrastive Distillation
AuthorsJinguo Zhu, Shixiang Tang, Dapeng Chen, Shijie Yu, Yakun Liu, Aijun Yang, Mingzhe Rong, Xiaohua Wang
Learning monocular 3D reconstruction of articulated categories from motion
AuthorsFilippos Kokkinos, Iasonas Kokkinos
Learning Parallel Dense Correspondence from Spatio-Temporal Descriptors for Efficient and Robust 4D Reconstruction
AuthorsJiapeng Tang, Dan Xu, Kui Jia, Lei Zhang
Contrastive Embedding for Generalized Zero-Shot Learning
AuthorsZongyan Han, Zhenyong Fu, Shuo Chen, Jian Yang
Kaleido-BERT: Vision-Language Pre-training on Fashion Domain
AuthorsMingchen Zhuge, Dehong Gao, Deng-Ping Fan, Linbo Jin, Ben Chen, Haoming Zhou, Minghui Qiu, Ling Shao
Progressive Domain Expansion Network for Single Domain Generalization
AuthorsLei Li, Ke Gao, Juan Cao, Ziyao Huang, Yepeng Weng, Xiaoyue Mi, Zhengze Yu, Xiaoya Li, Boyang xia
Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow
AuthorsShangrong Yang, Chunyu Lin, Kang Liao, Chunjie Zhang, Yao Zhao
TransFill: Reference-guided Image Inpainting by Merging Multiple Color and Spatial Transformations
AuthorsYuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi
Flow-based Kernel Prior with Application to Blind Super-Resolution
AuthorsJingyun Liang, Kai Zhang, Shuhang Gu, Luc Van Gool, Radu Timofte
DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation
AuthorsYufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu
Transformer Tracking
AuthorsXin Chen, Bin Yan, Jiawen Zhu, Dong Wang, Xiaoyun Yang, Huchuan Lu
Self-Supervised Visibility Learning for Novel View Synthesis
AuthorsYujiao Shi, Hongdong Li, Xin Yu
Meta-Mining Discriminative Samples for Kinship Verification
AuthorsWanhua Li, Shiwei Wang, Jiwen Lu, Jianjiang Feng, Jie Zhou
Picasso: A CUDA-based Library for Deep Learning over 3D Meshes
AuthorsHuan Lei, Naveed Akhtar, Ajmal Mian
Invertible Image Signal Processing
AuthorsYazhou Xing, Zian Qian, Qifeng Chen
IoU Attack: Towards Temporally Coherent Black-Box Adversarial Attack for Visual Object Tracking
AuthorsShuai Jia, Yibing Song, Chao Ma, Xiaokang Yang
From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation
AuthorsChen Li, Gim Hee Lee
PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds
AuthorsMutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi
Rethinking Graph Neural Network Search from Message-passing
AuthorsShaofei Cai, Liang Li, Jincan Deng, Beichen Zhang, Zheng-Jun Zha, Li Su, Qingming Huang
Confluent Vessel Trees with Accurate Bifurcations
AuthorsZhongwen Zhang, Dmitrii Marin, Maria Drangova, Yuri Boykov
OTA: Optimal Transport Assignment for Object Detection
AuthorsZheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun
MagDR: Mask-guided Detection and Reconstruction for Defending Deepfakes
AuthorsZhikai Chen, Lingxi Xie, Shanmin Pang, Yong He, Bo Zhang
Equivariant Point Network for 3D Point Cloud Analysis
AuthorsHaiwei Chen, Shichen Liu, Weikai Chen, Hao Li
OTCE: A Transferability Metric for Cross-Domain Cross-Task Representations
AuthorsYang Tan, Yang Li, Shao-Lun Huang
Dynamic Weighted Learning for Unsupervised Domain Adaptation
AuthorsNi Xiao, Lei Zhang
Learning Probabilistic Ordinal Embeddings for Uncertainty-Aware Regression
AuthorsWanhua Li, Xiaoke Huang, Jiwen Lu, Jianjiang Feng, Jie Zhou
Learning Dynamic Alignment via Meta-filter for Few-shot Learning
AuthorsChengming Xu, Chen Liu, Li Zhang, Chengjie Wang, Jilin Li, Feiyue Huang, Xiangyang Xue, Yanwei Fu
MetaAlign: Coordinating Domain Alignment and Classification for Unsupervised Domain Adaptation
AuthorsGuoqiang Wei, Cuiling Lan, Wenjun Zeng, Zhibo Chen
The Blessings of Unlabeled Background in Untrimmed Videos
AuthorsYuan Liu, Jingyuan Chen, Zhenfang Chen, Bing Deng, Jianqiang Huang, Hanwang Zhang
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
AuthorsZhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization
AuthorsChuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang, Ying Tai, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu
Coarse-to-Fine Domain Adaptive Semantic Segmentation with Photometric Alignment and Category-Center Regularization
AuthorsHaoyu Ma, Xiangru Lin, Zifeng Wu, Yizhou Yu
From Shadow Generation to Shadow Removal
AuthorsZhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi, Song Wang
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding
AuthorsYongfei Liu, Bo Wan, Lin Ma, Xuming He
DeFLOCNet: Deep Image Editing via Flexible Low-level Controls
AuthorsHongyu Liu, Ziyu Wan, Wei Huang, Yibing Song, Xintong Han, Jing Liao, Bing Jiang, Wei Liu
Lifelong Person Re-Identification via Adaptive Knowledge Accumulation
AuthorsNan Pu, Wei Chen, Yu Liu, Erwin M. Bakker, Michael S. Lew
Co-Grounding Networks with Semantic Attention for Referring Expression Comprehension in Videos
AuthorsSijie Song, Xudong Lin, Jiaying Liu, Zongming Guo, Shih-Fu Chang
Deep Occlusion-Aware Instance Segmentation with Overlapping BiLayers
AuthorsLei Ke, Yu-Wing Tai, Chi-Keung Tang
Prioritized Architecture Sampling with Monto-Carlo Tree Search
AuthorsXiu Su, Tao Huang, Yanxi Li, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu
Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion
AuthorsPeng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li
Intra-Inter Camera Similarity for Unsupervised Person Re-Identification
AuthorsShiyu Xuan, Shiliang Zhang
Multimodal Motion Prediction with Stacked Transformers
AuthorsYicheng Liu, Jinghuai Zhang, Liangji Fang, Qinhong Jiang, Bolei Zhou
Brain Image Synthesis with Unsupervised Multivariate Canonical CSC$\ell_4$Net
AuthorsYawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott, Ling Shao
Cross-Dataset Collaborative Learning for Semantic Segmentation
AuthorsLi Wang, Dong Li, Yousong Zhu, Lu Tian, Yi Shan
PGT: A Progressive Method for Training Models on Long Videos
AuthorsBo Pang, Gao Peng, Yizhuo Li, Cewu Lu
Learning the Superpixel in a Non-iterative and Lifelong Manner
AuthorsLei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu
XProtoNet: Diagnosis in Chest Radiography with Global and Local Explanations
AuthorsEunji Kim, Siwon Kim, Minji Seo, Sungroh Yoon
Disentangled Cycle Consistency for Highly-realistic Virtual Try-On
AuthorsChongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo
You Only Look One-level Feature
AuthorsQiang Chen, Yingming Wang, Tong Yang, Xiangyu Zhang, Jian Cheng, Jian Sun
Skeleton Aware Multi-modal Sign Language Recognition
AuthorsSongyao Jiang, Bin Sun, Lichen Wang, Yue Bai, Kunpeng Li, Yun Fu
Learning Compositional Representation for 4D Captures with Neural ODE
AuthorsBoyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu
Detecting Human-Object Interaction via Fabricated Compositional Learning
AuthorsZhi Hou, Baosheng Yu, Yu Qiao, Xiaojiang Peng, Dacheng Tao
DivCo: Diverse Conditional Image Synthesis via Contrastive Generative Adversarial Network
AuthorsRui Liu, Yixiao Ge, Ching Lam Choi, Xiaogang Wang, Hongsheng Li
ReDet: A Rotation-equivariant Detector for Aerial Object Detection
AuthorsJiaming Han, Jian Ding, Nan Xue, Gui-Song Xia
PLADE-Net: Towards Pixel-Level Accuracy for Self-Supervised Single-View Depth Estimation with Neural Positional Encoding and Distilled Matting Loss
AuthorsJuan Luis Gonzalez Bello, Munchurl Kim
Training Networks in Null Space of Feature Covariance for Continual Learning
AuthorsShipeng Wang, Xiaorong Li, Jian Sun, Zongben Xu
Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion
AuthorsShi Qiu, Saeed Anwar, Nick Barnes
FS-Net: Fast Shape-based Network for Category-Level 6D Object Pose Estimation with Decoupled Rotation Mechanism
AuthorsWei Chen, Xi Jia, Hyung Jin Chang, Jinming Duan, Linlin Shen, Ales Leonardis
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation
AuthorsXiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin
Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink
AuthorsRanjie Duan, Xiaofeng Mao, A. K. Qin, Yun Yang, Yuefeng Chen, Shaokai Ye, Yuan He
Capturing Omni-Range Context for Omnidirectional Segmentation
AuthorsKailun Yang, Jiaming Zhang, Simon Reiß, Xinxin Hu, Rainer Stiefelhagen
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis
AuthorsChaoyi Zhang, Jianhui Yu, Yang Song, Weidong Cai
QPIC: Query-Based Pairwise Human-Object Interaction Detection with Image-Wide Contextual Information
AuthorsMasato Tamura, Hiroki Ohashi, Tomoaki Yoshinaga
ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection
AuthorsJihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi
Multi-Source Domain Adaptation with Collaborative Learning for Semantic Segmentation
AuthorsJianzhong He, Xu Jia, Shuaijun Chen, Jianzhuang Liu
Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation
AuthorsShuaijun Chen, Xu Jia, Jianzhong He, Yongjie Shi, Jianzhuang Liu
Deep Gradient Projection Networks for Pan-sharpening
AuthorsShuang Xu, Jiangshe Zhang, Zixiang Zhao, Kai Sun, Junmin Liu, Chunxia Zhang
Parser-Free Virtual Try-on via Distilling Appearance Flows
AuthorsYuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo
Unveiling the Potential of Structure Preserving for Weakly Supervised Object Localization
AuthorsXingjia Pan, Yingguo Gao, Zhiwen Lin, Fan Tang, Weiming Dong, Haolei Yuan, Feiyue Huang, Changsheng Xu
End-to-End Human Object Interaction Detection with HOI Transformer
AuthorsCheng Zou, Bohan Wang, Yue Hu, Junqi Liu, Qian Wu, Yu Zhao, Boxun Li, Chenguang Zhang, Chi Zhang, Yichen Wei, Jian Sun
Watching You: Global-guided Reciprocal Learning for Video-based Person Re-identification
AuthorsXuehu Liu, Pingping Zhang, Chenyang Yu, Huchuan Lu, Xiaoyun Yang
Robust Reflection Removal with Reflection-free Flash-only Cues
AuthorsChenyang Lei, Qifeng Chen
WebFace260M: A Benchmark Unveiling the Power of Million-Scale Deep Face Recognition
AuthorsZheng Zhu, Guan Huang, Jiankang Deng, Yun Ye, Junjie Huang, Xinze Chen, Jiagang Zhu, Tian Yang, Jiwen Lu, Dalong Du, Jie Zhou
ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic
AuthorsXiangtao Kong, Hengyuan Zhao, Yu Qiao, Chao Dong
PISE: Person Image Synthesis and Editing with Decoupled GAN
AuthorsJinsong Zhang, Kun Li, Yu-Kun Lai, Jingyu Yang
Structured Scene Memory for Vision-Language Navigation
AuthorsHanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen
Goal-Oriented Gaze Estimation for Zero-Shot Learning
AuthorsYang Liu, Lei Zhou, Xiao Bai, Yifei Huang, Lin Gu, Jun Zhou, Tatsuya Harada
Coordinate Attention for Efficient Mobile Network Design
AuthorsQibin Hou, Daquan Zhou, Jiashi Feng
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration
AuthorsXingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng
Multi-attentional Deepfake Detection
AuthorsHanqing Zhao, Wenbo Zhou, Dongdong Chen, Tianyi Wei, Weiming Zhang, Nenghai Yu
FSDR: Frequency Space Domain Randomization for Domain Generalization
AuthorsJiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu
ID-Unet: Iterative Soft and Hard Deformation for View Synthesis
AuthorsMingyu Yin, Li Sun, Qingli Li
FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation
AuthorsYisheng He, Haibin Huang, Haoqiang Fan, Qifeng Chen, Jian Sun
PML: Progressive Margin Loss for Long-tailed Age Classification
AuthorsZongyong Deng, Hao Liu, Yaoxing Wang, Chenyang Wang, Zekuan Yu, Xuehong Sun
MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing
AuthorsZhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan
Hierarchical and Partially Observable Goal-driven Policy Learning with Goals Relational Graph
AuthorsXin Ye, Yezhou Yang
Auto-Exposure Fusion for Single-Image Shadow Removal
AuthorsLan Fu, Changqing Zhou, Qing Guo, Felix Juefei-Xu, Hongkai Yu, Wei Feng, Yang Liu, Song Wang
Cross Modal Focal Loss for RGBD Face Anti-Spoofing
AuthorsAnjith George, Sebastien Marcel
DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes with Biharmonic Coordinates
AuthorsMinghua Liu, Minhyuk Sung, Radomir Mech, Hao Su
Protecting Intellectual Property of Generative Adversarial Networks from Ambiguity Attack
AuthorsDing Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang
Regularization Strategy for Point Cloud via Rigidly Mixed Sample
AuthorsDogyoon Lee, Jaeha Lee, Junhyeop Lee, Hyeongmin Lee, Minhyeok Lee, Sungmin Woo, Sangyoun Lee
Neighbor2Neighbor: Self-Supervised Denoising from Single Noisy Images
AuthorsTao Huang, Songjiang Li, Xu Jia, Huchuan Lu, Jianzhuang Liu
Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
AuthorsZhengxiong Luo, Zhicheng Wang, Yan Huang, Tieniu Tan, Erjin Zhou
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation
AuthorsHyojin Park, Jayeon Yoo, Seohyeong Jeong, Ganesh Venkatesh, Nojun Kwak
Populating 3D Scenes by Learning Human-Scene Interaction
AuthorsMohamed Hassan, Partha Ghosh, Joachim Tesch, Dimitrios Tzionas, Michael J. Black
Image Translation via Fine-grained Knowledge Transfer
AuthorsXuanhong Chen, Ziang Liu, Ting Qiu, Bingbing Ni, Naiyuan Liu, Xiwei Hu, Yuhan Li
Improving Unsupervised Image Clustering With Robust Learning
AuthorsSungwon Park, Sungwon Han, Sundong Kim, Danu Kim, Sungkyu Park, Seunghoon Hong, Meeyoung Cha
TDN: Temporal Difference Networks for Efficient Action Recognition
AuthorsLimin Wang, Zhan Tong, Bin Ji, Gangshan Wu
Understanding the Behaviour of Contrastive Loss
AuthorsFeng Wang, Huaping Liu
RainNet: A Large-Scale Dataset for Spatial Precipitation Downscaling
AuthorsXuanhong Chen, Kairui Feng, Naiyuan Liu, Yifan Lu, Zhengyan Tong, Bingbing Ni, Ziang Liu, Ning Lin
Equalization Loss v2: A New Gradient Balance Approach for Long-tailed Object Detection
AuthorsJingru Tan, Xin Lu, Gang Zhang, Changqing Yin, Quanquan Li
Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection
AuthorsAlexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic
Removing Class Imbalance using Polarity-GAN: An Uncertainty Sampling Approach
AuthorsKumari Deepshikha, Anugunj Naman
Deep Lesion Tracker: Monitoring Lesions in 4D Longitudinal Imaging Studies
AuthorsJinzheng Cai, Youbao Tang, Ke Yan, Adam P. Harrison, Jing Xiao, Gigin Lin, Le Lu
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
AuthorsLingbo Liu, Jiaqi Chen, Hefeng Wu, Guanbin Li, Chenglong Li, Liang Lin
End-to-End Object Detection with Fully Convolutional Network
AuthorsJianfeng Wang, Lin Song, Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng
Learning to Fuse Asymmetric Feature Maps in Siamese Trackers
AuthorsWencheng Han, Xingping Dong, Fahad Shahbaz Khan, Ling Shao, Jianbing Shen
Patch2Pix: Epipolar-Guided Pixel-Level Correspondences
AuthorsQunjie Zhou, Torsten Sattler, Laura Leal-Taixe
Fully Convolutional Networks for Panoptic Segmentation
AuthorsYanwei Li, Hengshuang Zhao, Xiaojuan Qi, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia
Unpaired Image-to-Image Translation via Latent Energy Transport
AuthorsYang Zhao, Changyou Chen
Point2Skeleton: Learning Skeletal Representations from Point Clouds
AuthorsCheng Lin, Changjian Li, Yuan Liu, Nenglun Chen, Yi-King Choi, Wenping Wang
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation
AuthorsJiefeng Li, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu
End-to-End Video Instance Segmentation with Transformers
AuthorsYuqing Wang, Zhaoliang Xu, Xinlong Wang, Chunhua Shen, Baoshan Cheng, Hao Shen, Huaxia Xia
SoccerNet-v2: A Dataset and Benchmarks for Holistic Understanding of Broadcast Soccer Videos
AuthorsAdrien Deliège, Anthony Cioppa, Silvio Giancola, Meisam J. Seikavandi, Jacob V. Dueholm, Kamal Nasrollahi, Bernard Ghanem, Thomas B. Moeslund, Marc Van Droogenbroeck
Privacy-preserving Collaborative Learning with Automatic Transformation Search
AuthorsWei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu
Recurrent Multi-view Alignment Network for Unsupervised Surface Registration
AuthorsWanquan Feng, Juyong Zhang, Hongrui Cai, Haofei Xu, Junhui Hou, Hujun Bao
Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning
AuthorsZhenda Xie, Yutong Lin, Zheng Zhang, Yue Cao, Stephen Lin, Han Hu
MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation
AuthorsTianxiang Ma, Bo Peng, Wei Wang, Jing Dong
Your "Flamingo" is My "Bird": Fine-Grained, or Not
AuthorsDongliang Chang, Kaiyue Pang, Yixiao Zheng, Zhanyu Ma, Yi-Zhe Song, Jun Guo
Intentonomy: a Dataset and Study towards Human Intent Understanding
AuthorsMenglin Jia, Zuxuan Wu, Austin Reiter, Claire Cardie, Serge Belongie, Ser-Nam Lim
Regularizing Neural Networks via Adversarial Model Perturbation
AuthorsYaowei Zheng, Richong Zhang, Yongyi Mao
Progressive Semantic-Aware Style Transformation for Blind Face Restoration
AuthorsChaofeng Chen, Xiaoming Li, Lingbo Yang, Xianhui Lin, Lei Zhang, Kwan-Yee K. Wong
Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning
AuthorsJinpeng Wang, Yuting Gao, Ke Li, Yiqi Lin, Andy J. Ma, Hao Cheng, Pai Peng, Feiyue Huang, Rongrong Ji, Xing Sun
Spatiotemporal Contrastive Video Representation Learning
AuthorsRui Qian, Tianjian Meng, Boqing Gong, Ming-Hsuan Yang, Huisheng Wang, Serge Belongie, Yin Cui
Camera Pose Matters: Improving Depth Prediction by Mitigating Pose Distribution Bias
AuthorsYunhan Zhao, Shu Kong, Charless Fowlkes
Learning Decision Trees Recurrently Through Communication
AuthorsStephan Alaniz, Diego Marcos, Bernt Schiele, Zeynep Akata
Fast Bayesian Uncertainty Estimation and Reduction of Batch Normalized Single Image Super-Resolution Network
AuthorsAupendu Kar, Prabir Kumar Biswas
Natural Adversarial Examples
AuthorsDan Hendrycks, Kevin Zhao, Steven Basart, Jacob Steinhardt, Dawn Song
How does topology influence gradient propagation and model performance of deep networks with DenseNet-type skip connections?
AuthorsKartikeya Bhardwaj, Guihong Li, Radu Marculescu
ReNAS:Relativistic Evaluation of Neural Architecture Search
AuthorsYixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu
Deeply Shape-guided Cascade for Instance Segmentation
AuthorsHao Ding, Siyuan Qiao, Alan Yuille, Wei Shen
Unlocking the Full Potential of Small Data with Diverse Supervision
AuthorsZiqi Pang, Zhiyuan Hu, Pavel Tokmakov, Yu-Xiong Wang, Martial Hebert
Point Cloud Instance Segmentation using Probabilistic Embeddings
AuthorsBiao Zhang, Peter Wonka
Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers
AuthorsQi Feng, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff
PU-GCN: Point Cloud Upsampling using Graph Convolutional Networks
AuthorsGuocheng Qian, Abdulellah Abualshour, Guohao Li, Ali Thabet, Bernard Ghanem
Explaining Classifiers using Adversarial Perturbations on the Perceptual Ball
AuthorsAndrew Elliott, Stephen Law, Chris Russell
Ternary Feature Masks: zero-forgetting for task-incremental learning
AuthorsMarc Masana, Tinne Tuytelaars, Joost van de Weijer
Renofeation: A Simple Transfer Learning Method for Improved Adversarial Robustness
AuthorsTing-Wu Chin, Cha Zhang, Diana Marculescu
Cross-Iteration Batch Normalization
AuthorsZhuliang Yao, Yue Cao, Shuxin Zheng, Gao Huang, Stephen Lin
On Feature Normalization and Data Augmentation
AuthorsBoyi Li, Felix Wu, Ser-Nam Lim, Serge Belongie, Kilian Q. Weinberger
Single-View 3D Object Reconstruction from Shape Priors in Memory
AuthorsShuo Yang, Min Xu, Haozhe Xie, Stuart Perry, Jiahao Xia
Restore from Restored: Video Restoration with Pseudo Clean Video
AuthorsSeunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim
Image Restoration for Under-Display Camera
AuthorsYuqian Zhou, David Ren, Neil Emerton, Sehoon Lim, Timothy Large
clDice -- a Novel Topology-Preserving Loss Function for Tubular Structure Segmentation
AuthorsSuprosanna Shit, Johannes C. Paetzold, Anjany Sekuboyina, Ivan Ezhov, Alexander Unger, Andrey Zhylka, Josien P. W. Pluim, Ulrich Bauer, Bjoern H. Menze
Learning Multi-Scale Photo Exposure Correction
AuthorsMahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, Michael S. Brown
DCNAS: Densely Connected Neural Architecture Search for Semantic Image Segmentation
AuthorsXiong Zhang, Hongmin Xu, Hong Mo, Jianchao Tan, Cheng Yang, Lei Wang, Wenqi Ren
Dynamic Region-Aware Convolution
AuthorsJin Chen, Xijun Wang, Zichao Guo, Xiangyu Zhang, Jian Sun
Generative PointNet: Deep Energy-Based Learning on Unordered Point Sets for 3D Generation, Reconstruction and Classification
AuthorsJianwen Xie, Yifei Xu, Zilong Zheng, Song-Chun Zhu, Ying Nian Wu
AdaStereo: A Simple and Efficient Approach for Adaptive Stereo Matching
AuthorsXiao Song, Guorun Yang, Xinge Zhu, Hui Zhou, Zhe Wang, Jianping Shi
Orthogonal Over-Parameterized Training
AuthorsWeiyang Liu, Rongmei Lin, Zhen Liu, James M. Rehg, Liam Paull, Li Xiong, Le Song, Adrian Weller
MobileDets: Searching for Object Detection Architectures for Mobile Accelerators
AuthorsYunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen
Polygonal Building Segmentation by Frame Field Learning
AuthorsNicolas Girard, Dmitriy Smirnov, Justin Solomon, Yuliya Tarabalka
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-training
AuthorsMinheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Jianfeng Gao, Dongdong Zhang, Nan Duan
Counterfactual VQA: A Cause-Effect Look at Language Bias
AuthorsYulei Niu, Kaihua Tang, Hanwang Zhang, Zhiwu Lu, Xian-Sheng Hua, Ji-Rong Wen
Achieving robustness in classification using optimal transport with hinge regularization
AuthorsMathieu Serrurier, Franck Mamalet, Alberto González-Sanz, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio
Privacy-Preserving Image Features via Adversarial Affine Subspace Embeddings
AuthorsMihai Dusmanu, Johannes L. Schönberger, Sudipta N. Sinha, Marc Pollefeys
A Sliced Wasserstein Loss for Neural Texture Synthesis
AuthorsEric Heitz, Kenneth Vanhoey, Thomas Chambon, Laurent Belcour
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization
AuthorsJunting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li
Exploring Sparsity in Image Super-Resolution for Efficient Inference
AuthorsLongguang Wang, Xiaoyu Dong, Yingqian Wang, Xinyi Ying, Zaiping Lin, Wei An, Yulan Guo
TearingNet: Point Cloud Autoencoder to Learn Topology-Friendly Representations
AuthorsJiahao Pang, Duanshun Li, Dong Tian
UV-Net: Learning from Boundary Representations
AuthorsPradeep Kumar Jayaraman, Aditya Sanghi, Joseph G. Lambourne, Karl D. D. Willis, Thomas Davies, Hooman Shayani, Nigel Morris
Sequential Graph Convolutional Network for Active Learning
AuthorsRazvan Caramalau, Binod Bhattarai, Tae-Kyun Kim
Few-shot 3D Point Cloud Semantic Segmentation
AuthorsNa Zhao, Tat-Seng Chua, Gim Hee Lee
Backdoor Attacks Against Deep Learning Systems in the Physical World
AuthorsEmily Wenger, Josephine Passananti, Arjun Bhagoji, Yuanshun Yao, Haitao Zheng, Ben Y. Zhao
Rethinking Channel Dimensions for Efficient Model Design
AuthorsDongyoon Han, Sangdoo Yun, Byeongho Heo, YoungJoon Yoo
Domain Adaptation with Auxiliary Target Domain-Oriented Classifier
AuthorsJian Liang, Dapeng Hu, Jiashi Feng
Closed-Form Factorization of Latent Semantics in GANs
AuthorsYujun Shen, Bolei Zhou
Co-Attention for Conditioned Image Matching
AuthorsOlivia Wiles, Sebastien Ehrhardt, Andrew Zisserman
On Robustness and Transferability of Convolutional Neural Networks
AuthorsJosip Djolonga, Jessica Yung, Michael Tschannen, Rob Romijnders, Lucas Beyer, Alexander Kolesnikov, Joan Puigcerver, Matthias Minderer, Alexander D'Amour, Dan Moldovan, Sylvain Gelly, Neil Houlsby, Xiaohua Zhai, Mario Lucic
Generative Hierarchical Features from Synthesizing Images
AuthorsYinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou
Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation
AuthorsElad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, Daniel Cohen-Or
DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping
AuthorsYanchao Yang, Brian Lai, Stefano Soatto
How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language
AuthorsAmanda Duarte, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giro-i-Nieto
Discovering Multi-Hardware Mobile Models via Architecture Search
AuthorsGrace Chu, Okan Arikan, Gabriel Bender, Weijun Wang, Achille Brighton, Pieter-Jan Kindermans, Hanxiao Liu, Berkin Akin, Suyog Gupta, Andrew Howard
VarifocalNet: An IoU-aware Dense Object Detector
AuthorsHaoyang Zhang, Ying Wang, Feras Dayoub, Niko Sünderhauf
Simulating Unknown Target Models for Query-Efficient Black-box Attacks
AuthorsChen Ma, Li Chen, Jun-Hai Yong
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges
AuthorsQingyong Hu, Bo Yang, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham
Activate or Not: Learning Customized Activation
AuthorsNingning Ma, Xiangyu Zhang, Ming Liu, Jian Sun
Group Whitening: Balancing Learning Efficiency and Representational Capacity
AuthorsLei Huang, Yi Zhou, Li Liu, Fan Zhu, Ling Shao
Shot in the Dark: Few-Shot Learning with No Base-Class Labels
AuthorsZitian Chen, Subhransu Maji, Erik Learned-Miller
Learning Invariant Representations and Risks for Semi-supervised Domain Adaptation
AuthorsBo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Trevor Darrell, Kurt Keutzer, Han Zhao
Adaptive Aggregation Networks for Class-Incremental Learning
AuthorsYaoyao Liu, Bernt Schiele, Qianru Sun
Bi-GCN: Binary Graph Convolutional Network
AuthorsJunfu Wang, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo
Automotive Radar Interference Mitigation with Unfolded Robust PCA based on Residual Overcomplete Auto-Encoder Blocks
AuthorsNicolae-Cătălin Ristea, Andrei Anghel, Radu Tudor Ionescu, Yonina C. Eldar
Deep Analysis of CNN-based Spatio-temporal Representations for Action Recognition
AuthorsChun-Fu Chen, Rameswar Panda, Kandan Ramakrishnan, Rogerio Feris, John Cohn, Aude Oliva, Quanfu Fan
Symmetric Parallax Attention for Stereo Image Super-Resolution
AuthorsYingqian Wang, Xinyi Ying, Longguang Wang, Jungang Yang, Wei An, Yulan Guo
Learning the Best Pooling Strategy for Visual Semantic Embedding
AuthorsJiacheng Chen, Hexiang Hu, Hao Wu, Yuning Jiang, Changhu Wang
Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
AuthorsMariana-Iuliana Georgescu, Antonio Barbalau, Radu Tudor Ionescu, Fahad Shahbaz Khan, Marius Popescu, Mubarak Shah
EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation
AuthorsYang Jiao, Trac D. Tran, Guangming Shi
AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations from Self-Trained Negative Adversaries
AuthorsQianjiang Hu, Xiao Wang, Wei Hu, Guo-Jun Qi
Shared Cross-Modal Trajectory Prediction for Autonomous Driving
AuthorsChiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla
Exploring intermediate representation for monocular vehicle pose estimation
AuthorsShichao Li, Zengqiang Yan, Hongyang Li, Kwang-Ting Cheng
Beyond Static Features for Temporally Consistent 3D Human Pose and Shape from a Video
AuthorsHongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee
3D CNNs with Adaptive Temporal Feature Resolutions
AuthorsMohsen Fayyaz, Emad Bahrami, Ali Diba, Mehdi Noroozi, Ehsan Adeli, Luc Van Gool, Juergen Gall
Dual-stream Multiple Instance Learning Network for Whole Slide Image Classification with Self-supervised Contrastive Learning
AuthorsBin Li, Yin Li, Kevin W. Eliceiri
UP-DETR: Unsupervised Pre-training for Object Detection with Transformers
AuthorsZhigang Dai, Bolun Cai, Yugeng Lin, Junying Chen
Dense Contrastive Learning for Self-Supervised Visual Pre-Training
AuthorsXinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li
FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation
AuthorsJaemin Na, Heechul Jung, Hyung Jin Chang, Wonjun Hwang
FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation
AuthorsYair Kittenplon, Yonina C. Eldar, Dan Raviv
SLADE: A Self-Training Framework For Distance Metric Learning
AuthorsJiali Duan, Yen-Liang Lin, Son Tran, Larry S. Davis, C. -C. Jay Kuo
Open-Vocabulary Object Detection Using Captions
AuthorsAlireza Zareian, Kevin Dela Rosa, Derek Hao Hu, Shih-Fu Chang
HDR Environment Map Estimation for Real-Time Augmented Reality
AuthorsGowri Somanath, Daniel Kurz
FP-NAS: Fast Probabilistic Neural Architecture Search
AuthorsZhicheng Yan, Xiaoliang Dai, Peizhao Zhang, Yuandong Tian, Bichen Wu, Matt Feiszli
Ranking Neural Checkpoints
AuthorsYandong Li, Xuhui Jia, Ruoxin Sang, Yukun Zhu, Bradley Green, Liqiang Wang, Boqing Gong
PLOP: Learning without Forgetting for Continual Semantic Segmentation
AuthorsArthur Douillard, Yifu Chen, Arnaud Dapogny, Matthieu Cord
Rotation-Only Bundle Adjustment
AuthorsSeong Hun Lee, Javier Civera
HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms
AuthorsMahmoud Afifi, Marcus A. Brubaker, Michael S. Brown
GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields
AuthorsMichael Niemeyer, Andreas Geiger
VIGOR: Cross-View Image Geo-localization beyond One-to-one Retrieval
AuthorsSijie Zhu, Taojiannan Yang, Chen Chen
Discovering Hidden Physics Behind Transport Dynamics
AuthorsPeirong Liu, Lin Tian, Yubo Zhang, Stephen R. Aylward, Yueh Z. Lee, Marc Niethammer
PREDATOR: Registration of 3D Point Clouds with Low Overlap
AuthorsShengyu Huang, Zan Gojcic, Mikhail Usvyatsov, Andreas Wieser, Konrad Schindler
Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes
AuthorsZhengqi Li, Simon Niklaus, Noah Snavely, Oliver Wang
Lifting 2D StyleGAN for 3D-Aware Face Generation
AuthorsYichun Shi, Divyansh Aggarwal, Anil K. Jain
Transformation Driven Visual Reasoning
AuthorsXin Hong, Yanyan Lan, Liang Pang, Jiafeng Guo, Xueqi Cheng
DyCo3D: Robust Instance Segmentation of 3D Point Clouds through Dynamic Convolution
AuthorsTong He, Chunhua Shen, Anton van den Hengel
How Well Do Self-Supervised Models Transfer?
AuthorsLinus Ericsson, Henry Gouk, Timothy M. Hospedales
PCLs: Geometry-aware Neural Reconstruction of 3D Pose with Perspective Crop Layers
AuthorsFrank Yu, Mathieu Salzmann, Pascal Fua, Helge Rhodin
Task Programming: Learning Data Efficient Behavior Representations
AuthorsJennifer J. Sun, Ann Kennedy, Eric Zhan, David J. Anderson, Yisong Yue, Pietro Perona
Learning from Incomplete Features by Simultaneous Training of Neural Networks and Sparse Coding
AuthorsCesar F. Caiafa, Ziyao Wang, Jordi Solé-Casals, Qibin Zhao
Hijack-GAN: Unintended-Use of Pretrained, Black-Box GANs
AuthorsHui-Po Wang, Ning Yu, Mario Fritz
Meta Batch-Instance Normalization for Generalizable Person Re-Identification
AuthorsSeokeon Choi, Taekyung Kim, Minki Jeong, Hyoungseob Park, Changick Kim
Occlusion Guided Scene Flow Estimation on 3D Point Clouds
AuthorsBojun Ouyang, Dan Raviv
One-Shot Free-View Neural Talking-Head Synthesis for Video Conferencing
AuthorsTing-Chun Wang, Arun Mallya, Ming-Yu Liu
Unsupervised Part Discovery via Feature Alignment
AuthorsMengqi Guo, Yutong Bai, Zhishuai Zhang, Adam Kortylewski, Alan Yuille
Disentangling Label Distribution for Long-tailed Visual Recognition
AuthorsYoungkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, Buru Chang
Learning to Generalize Unseen Domains via Memory-based Multi-Source Meta-Learning for Person Re-Identification
AuthorsYuyang Zhao, Zhun Zhong, Fengxiang Yang, Zhiming Luo, Yaojin Lin, Shaozi Li, Nicu Sebe
DeFMO: Deblurring and Shape Recovery of Fast Moving Objects
AuthorsDenys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Jiri Matas, Marc Pollefeys
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
AuthorsHuiyu Wang, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen
Improving Accuracy of Binary Neural Networks using Unbalanced Activation Distribution
AuthorsHyungjun Kim, Jihoon Park, Changhun Lee, Jae-Joon Kim
PWCLO-Net: Deep LiDAR Odometry in 3D Point Clouds Using Hierarchical Embedding Mask Optimization
AuthorsGuangming Wang, Xinrui Wu, Zhe Liu, Hesheng Wang
How Robust are Randomized Smoothing based Defenses to Data Poisoning?
AuthorsAkshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm
Learning View-Disentangled Human Pose Representation by Contrastive Cross-View Mutual Information Maximization
AuthorsLong Zhao, Yuxiao Wang, Jiaping Zhao, Liangzhe Yuan, Jennifer J. Sun, Florian Schroff, Hartwig Adam, Xi Peng, Dimitris Metaxas, Ting Liu
Fair Attribute Classification through Latent Space De-biasing
AuthorsVikram V. Ramaswamy, Sunnie S. Y. Kim, Olga Russakovsky
Few-Shot Classification with Feature Map Reconstruction Networks
AuthorsDavis Wertheimer, Luming Tang, Bharath Hariharan
Neural Prototype Trees for Interpretable Fine-grained Image Recognition
AuthorsMeike Nauta, Ron van Bree, Christin Seifert
CoCosNet v2: Full-Resolution Correspondence Learning for Image Translation
AuthorsXingran Zhou, Bo Zhang, Ting Zhang, Pan Zhang, Jianmin Bao, Dong Chen, Zhongfei Zhang, Fang Wen
Robust Instance Segmentation through Reasoning about Multi-Object Occlusion
AuthorsXiaoding Yuan, Adam Kortylewski, Yihong Sun, Alan Yuille
BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond
AuthorsKelvin C. K. Chan, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
AuthorsWeihao Xia, Yujiu Yang, Jing-Hao Xue, Baoyuan Wu
PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths
AuthorsXin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
Unsupervised Pre-training for Person Re-identification
AuthorsDengpan Fu, Dongdong Chen, Jianmin Bao, Hao Yang, Lu Yuan, Lei Zhang, Houqiang Li, Dong Chen
Continual Adaptation of Visual Representations via Domain Randomization and Meta-learning
AuthorsRiccardo Volpi, Diane Larlus, Grégory Rogez
3DIoUMatch: Leveraging IoU Prediction for Semi-Supervised 3D Object Detection
AuthorsHe Wang, Yezhen Cong, Or Litany, Yue Gao, Leonidas J. Guibas
Multi-Objective Interpolation Training for Robustness to Label Noise
AuthorsDiego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness
The Lottery Ticket Hypothesis for Object Recognition
AuthorsSharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry Davis, Abhinav Shrivastava
Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments
AuthorsZhihao Xia, Michaël Gharbi, Federico Perazzi, Kalyan Sunkavalli, Ayan Chakrabarti
Monocular Real-time Full Body Capture with Inter-part Correlations
AuthorsYuxiao Zhou, Marc Habermann, Ikhsanul Habibie, Ayush Tewari, Christian Theobalt, Feng Xu
Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?
AuthorsMalik Boudiaf, Hoel Kervadec, Ziko Imtiaz Masud, Pablo Piantanida, Ismail Ben Ayed, Jose Dolz
Iso-Points: Optimizing Neural Implicit Surfaces with Hybrid Representations
AuthorsWang Yifan, Shihao Wu, Cengiz Oztireli, Olga Sorkine-Hornung
Mask Guided Matting via Progressive Refinement Network
AuthorsQihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan Yuille
Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces
AuthorsBerk Kaya, Suryansh Kumar, Carlos Oliveira, Vittorio Ferrari, Luc Van Gool
Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation
AuthorsBin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang
The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models
AuthorsTianlong Chen, Jonathan Frankle, Shiyu Chang, Sijia Liu, Yang Zhang, Michael Carbin, Zhangyang Wang
Information-Theoretic Segmentation by Inpainting Error Maximization
AuthorsPedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David McAllester
KOALAnet: Blind Super-Resolution using Kernel-Oriented Adaptive Local Adjustment
AuthorsSoo Ye Kim, Hyeonjun Sim, Munchurl Kim
Improved Image Matting via Real-time User Clicks and Uncertainty Estimation
AuthorsTianyi Wei, Dongdong Chen, Wenbo Zhou, Jing Liao, Hanqing Zhao, Weiming Zhang, Nenghai Yu
Wasserstein Contrastive Representation Distillation
AuthorsLiqun Chen, Dong Wang, Zhe Gan, Jingjing Liu, Ricardo Henao, Lawrence Carin
Joint Generative and Contrastive Learning for Unsupervised Person Re-identification
AuthorsHao Chen, Yaohui Wang, Benoit Lagadec, Antitza Dantcheva, Francois Bremond
DECOR-GAN: 3D Shape Detailization by Conditional Refinement
AuthorsZhiqin Chen, Vladimir G. Kim, Matthew Fisher, Noam Aigerman, Hao Zhang, Siddhartha Chaudhuri
Learning Continuous Image Representation with Local Implicit Image Function
AuthorsYinbo Chen, Sifei Liu, Xiaolong Wang
Multi-shot Temporal Event Localization: a Benchmark
AuthorsXiaolong Liu, Yao Hu, Song Bai, Fei Ding, Xiang Bai, Philip H. S. Torr
End-to-End Human Pose and Mesh Reconstruction with Transformers
AuthorsKevin Lin, Lijuan Wang, Zicheng Liu
A 3D GAN for Improved Large-pose Facial Recognition
AuthorsRichard T. Marriott, Sami Romdhani, Liming Chen
From Points to Multi-Object 3D Reconstruction
AuthorsFrancis Engelmann, Konstantinos Rematas, Bastian Leibe, Vittorio Ferrari
A Second-Order Approach to Learning with Instance-Dependent Label Noise
AuthorsZhaowei Zhu, Tongliang Liu, Yang Liu
Generative Interventions for Causal Learning
AuthorsChengzhi Mao, Augustine Cha, Amogh Gupta, Hao Wang, Junfeng Yang, Carl Vondrick
Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder
AuthorsTal Daniel, Aviv Tamar
2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
AuthorsHengduo Li, Zuxuan Wu, Abhinav Shrivastava, Larry S. Davis
Binary Graph Neural Networks
AuthorsMehdi Bahri, Gaétan Bahl, Stefanos Zafeiriou
Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans
AuthorsSida Peng, Yuanqing Zhang, Yinghao Xu, Qianqian Wang, Qing Shuai, Hujun Bao, Xiaowei Zhou
Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
AuthorsSixiao Zheng, Jiachen Lu, Hengshuang Zhao, Xiatian Zhu, Zekun Luo, Yabiao Wang, Yanwei Fu, Jianfeng Feng, Tao Xiang, Philip H. S. Torr, Li Zhang
VinVL: Revisiting Visual Representations in Vision-Language Models
AuthorsPengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao
Style Normalization and Restitution for DomainGeneralization and Adaptation
AuthorsXin Jin, Cuiling Lan, Wenjun Zeng, Zhibo Chen
Bilateral Grid Learning for Stereo Matching Networks
AuthorsBin Xu, Yuhua Xu, Xiaoli Yang, Wei Jia, Yulan Guo
Learning Accurate Dense Correspondences and When to Trust Them
AuthorsPrune Truong, Martin Danelljan, Luc Van Gool, Radu Timofte
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
AuthorsRuohan Gao, Kristen Grauman
RepVGG: Making VGG-style ConvNets Great Again
AuthorsXiaohan Ding, Xiangyu Zhang, Ningning Ma, Jungong Han, Guiguang Ding, Jian Sun
WiCV 2020: The Seventh Women In Computer Vision Workshop
AuthorsHazel Doughty, Nour Karessli, Kathryn Leonard, Boyi Li, Carianne Martinez, Azadeh Mobasher, Arsha Nagrani, Srishti Yadav
Binary TTC: A Temporal Geofence for Autonomous Navigation
AuthorsAbhishek Badki, Orazio Gallo, Jan Kautz, Pradeep Sen
Temporal-Relational CrossTransformers for Few-Shot Action Recognition
AuthorsToby Perrett, Alessandro Masullo, Tilo Burghardt, Majid Mirmehdi, Dima Damen
GeoSim: Realistic Video Simulation via Geometry-Aware Composition for Self-Driving
AuthorsYun Chen, Frieda Rong, Shivam Duggal, Shenlong Wang, Xinchen Yan, Sivabalan Manivasagam, Shangjie Xue, Ersin Yumer, Raquel Urtasun
AdvSim: Generating Safety-Critical Scenarios for Self-Driving Vehicles
AuthorsJingkang Wang, Ava Pun, James Tu, Sivabalan Manivasagam, Abbas Sadat, Sergio Casas, Mengye Ren, Raquel Urtasun
End-to-end Interpretable Neural Motion Planner
AuthorsWenyuan Zeng, Wenjie Luo, Simon Suo, Abbas Sadat, Bin Yang, Sergio Casas, Raquel Urtasun
Deep Multi-Task Learning for Joint Localization, Perception, and Prediction
AuthorsJohn Phillips, Julieta Martinez, Ioan Andrei Bârsan, Sergio Casas, Abbas Sadat, Raquel Urtasun
Deep Parametric Continuous Convolutional Neural Networks
AuthorsShenlong Wang, Simon Suo, Wei-Chiu Ma, Andrei Pokrovsky, Raquel Urtasun
Joint Learning of 3D Shape Retrieval and Deformation
AuthorsMikaela Angelina Uy, Vladimir G. Kim, Minhyuk Sung, Noam Aigerman, Siddhartha Chaudhuri, Leonidas Guibas
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation
AuthorsBrendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning
AuthorsJuhyoung Lee, Sangyeob Kim, Sangjin Kim, Wooyoung Jo, Hoi-Jun Yoo
Generic Event Boundary Detection: A Benchmark for Event Segmentation
AuthorsMike Zheng Shou, Stan W. Lei, Weiyao Wang, Deepti Ghadiyaram, Matt Feiszli
Ikshana: A Theory of Human Scene Understanding Mechanism
AuthorsVenkata Satya Sai Ajay Daliparthi
Open World Compositional Zero-Shot Learning
AuthorsMassimiliano Mancini, Muhammad Ferjad Naeem, Yongqin Xian, Zeynep Akata
Learning Graph Embeddings for Compositional Zero-shot Learning
AuthorsMuhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata
Semi-Supervised Action Recognition with Temporal Contrastive Learning
AuthorsAnkit Singh, Omprakash Chakraborty, Ashutosh Varshney, Rameswar Panda, Rogerio Feris, Kate Saenko, Abir Das
Multi-Stage Progressive Image Restoration
AuthorsSyed Waqas Zamir, Aditya Arora, Salman Khan, Munawar Hayat, Fahad Shahbaz Khan, Ming-Hsuan Yang, Ling Shao
ZeroScatter: Domain Transfer for Long Distance Imaging and Vision through Scattering Media
AuthorsZheng Shi, Ethan Tseng, Mario Bijelic, Werner Ritter, Felix Heide
Instance Localization for Self-supervised Detection Pretraining
AuthorsCeyuan Yang, Zhirong Wu, Bolei Zhou, Stephen Lin
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
AuthorsSoravit Changpinyo, Piyush Sharma, Nan Ding, Radu Soricut
StablePose: Learning 6D Object Poses from Geometrically Stable Patches
AuthorsYifei Shi, Junwen Huang, Xin Xu, Yifan Zhang, Kai Xu
GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation
AuthorsGu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji
4D Panoptic LiDAR Segmentation
AuthorsMehmet Aygün, Aljoša Ošep, Mark Weber, Maxim Maximov, Cyrill Stachniss, Jens Behley, Laura Leal-Taixé
IBRNet: Learning Multi-View Image-Based Rendering
AuthorsQianqian Wang, Zhicheng Wang, Kyle Genova, Pratul Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas Funkhouser
Training Generative Adversarial Networks in One Stage
AuthorsChengchao Shen, Youtan Yin, Xinchao Wang, Xubin LI, Jie Song, Mingli Song
Counterfactual Zero-Shot and Open-Set Visual Recognition
AuthorsZhongqi Yue, Tan Wang, Hanwang Zhang, Qianru Sun, Xian-Sheng Hua
Self-Supervised Simultaneous Multi-Step Prediction of Road Dynamics and Cost Map
AuthorsElmira Amirloo, Mohsen Rohani, Ershad Banijamali, Jun Luo, Pascal Poupart
Categorical Depth Distribution Network for Monocular 3D Object Detection
AuthorsCody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander
Domain Generalization via Inference-time Label-Preserving Target Projections
AuthorsPrashant Pandey, Mrigank Raman, Sumanth Varambally, Prathosh AP
A Deep Emulator for Secondary Motion of 3D Characters
AuthorsMianlun Zheng, Yi Zhou, Duygu Ceylan, Jernej Barbič
Coarse-Fine Networks for Temporal Activity Detection in Videos
AuthorsKumara Kahatapitiya, Michael S. Ryoo
Exploring Complementary Strengths of Invariant and Equivariant Representations for Few-Shot Learning
AuthorsMamshad Nayeem Rizve, Salman Khan, Fahad Shahbaz Khan, Mubarak Shah
There is More than Meets the Eye: Self-Supervised Multi-Object Detection and Tracking with Sound by Distilling Multimodal Knowledge
AuthorsFrancisco Rivera Valverde, Juana Valeria Hurtado, Abhinav Valada
Image-to-image Translation via Hierarchical Style Disentanglement
AuthorsXinyang Li, Shengchuan Zhang, Jie Hu, Liujuan Cao, Xiaopeng Hong, Xudong Mao, Feiyue Huang, Yongjian Wu, Rongrong Ji
Diffusion Probabilistic Models for 3D Point Cloud Generation
AuthorsShitong Luo, Wei Hu
Depth from Camera Motion and Object Detection
AuthorsBrent A. Griffin, Jason J. Corso
Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition
AuthorsStephen Hausler, Sourav Garg, Ming Xu, Michael Milford, Tobias Fischer
When Age-Invariant Face Recognition Meets Face Age Synthesis: A Multi-Task Learning Framework
AuthorsZhizhong Huang, Junping Zhang, Hongming Shan
MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing
AuthorsZhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan
Square Root Bundle Adjustment for Large-Scale Reconstruction
AuthorsNikolaus Demmel, Christiane Sommer, Daniel Cremers, Vladyslav Usenko
Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain
AuthorsHonggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue, Weiming Zhang, Nenghai Yu
Semantic Relation Reasoning for Shot-Stable Few-Shot Object Detection
AuthorsChenchen Zhu, Fangyi Chen, Uzair Ahmed, Zhiqiang Shen, Marios Savvides
Multi-institutional Collaborations for Improving Deep Learning-based Magnetic Resonance Image Reconstruction Using Federated Learning
AuthorsPengfei Guo, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel
Adaptive Consistency Regularization for Semi-Supervised Transfer Learning
AuthorsAbulikemu Abuduweili, Xingjian Li, Humphrey Shi, Cheng-Zhong Xu, Dejing Dou
General Instance Distillation for Object Detection
AuthorsXing Dai, Zeren Jiang, Zhao Wu, Yiping Bao, Zhicheng Wang, Si Liu, Erjin Zhou
$S^3$: Learnable Sparse Signal Superdensity for Guided Depth Estimation
AuthorsYu-Kai Huang, Yueh-Cheng Liu, Tsung-Han Wu, Hung-Ting Su, Yu-Cheng Chang, Tsung-Lin Tsou, Yu-An Wang, Winston H. Hsu
Style-based Point Generator with Adversarial Rendering for Point Cloud Completion
AuthorsChulin Xie, Chuxin Wang, Bo Zhang, Hao Yang, Dong Chen, Fang Wen
Cross-View Regularization for Domain Adaptive Panoptic Segmentation
AuthorsJiaxing Huang, Dayan Guan, Aoran Xiao, Shijian Lu
Towards Open World Object Detection
AuthorsK J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian
Learning Asynchronous and Sparse Human-Object Interaction in Videos
AuthorsRomero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran
DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images
AuthorsMeng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas
A Cross Channel Context Model for Latents in Deep Image Compression
AuthorsChangyue Ma, Zhao Wang, Ruling Liao, Yan Ye
PointGuard: Provably Robust 3D Point Cloud Classification
AuthorsHongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong
TPCN: Temporal Point Cloud Networks for Motion Forecasting
AuthorsMaosheng Ye, Tongyi Cao, Qifeng Chen
Self-supervised Geometric Perception
AuthorsHeng Yang, Wei Dong, Luca Carlone, Vladlen Koltun
Anycost GANs for Interactive Image Synthesis and Editing
AuthorsJi Lin, Richard Zhang, Frieder Ganz, Song Han, Jun-Yan Zhu
Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
AuthorsQuin Thames, Arjun Karpur, Wade Norris, Fangting Xia, Liviu Panait, Tobias Weyand, Jack Sim
Teachers Do More Than Teach: Compressing Image-to-Image Models
AuthorsQing Jin, Jian Ren, Oliver J. Woodford, Jiazhuo Wang, Geng Yuan, Yanzhi Wang, Sergey Tulyakov
Unsupervised Learning for Robust Fitting:A Reinforcement Learning Approach
AuthorsGiang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani
LOHO: Latent Optimization of Hairstyles via Orthogonalization
AuthorsRohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi
Selective Replay Enhances Learning in Online Continual Analogical Reasoning
AuthorsTyler L. Hayes, Christopher Kanan
Simultaneously Localize, Segment and Rank the Camouflaged Objects
AuthorsYunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan
Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
AuthorsAli Cheraghian, Shafin Rahman, Pengfei Fang, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi
Learning Statistical Texture for Semantic Segmentation
AuthorsLanyun Zhu, Deyi Ji, Shiping Zhu, Weihao Gan, Wei Wu, Junjie Yan
Consensus Maximisation Using Influences of Monotone Boolean Functions
AuthorsRuwan Tennakoon, David Suter, Erchuan Zhang, Tat-Jun Chin, Alireza Bab-Hadiashar
MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection
AuthorsVibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel
Robust Point Cloud Registration Framework Based on Deep Graph Matching
AuthorsKexue Fu, Shaolei Liu, Xiaoyuan Luo, Manning Wang
ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring
AuthorsDongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li
Repurposing GANs for One-shot Semantic Part Segmentation
AuthorsNontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn
What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels
AuthorsJeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa
OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection
AuthorsTingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing
AuthorsTianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc Van Gool
Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification
AuthorsFengxiang Yang, Zhun Zhong, Zhiming Luo, Yuanzheng Cai, Yaojin Lin, Shaozi Li, Nicu Sebe
Behavior-Driven Synthesis of Human Dynamics
AuthorsAndreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer
How Privacy-Preserving are Line Clouds? Recovering Scene Details from 3D Lines
AuthorsKunal Chelani, Fredrik Kahl, Torsten Sattler
Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles
AuthorsJevgenij Gamper, Nasir Rajpoot
Knowledge Evolution in Neural Networks
AuthorsAhmed Taha, Abhinav Shrivastava, Larry Davis
MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation
AuthorsXiaoqing Guo, Chen Yang, Baopu Li, Yixuan Yuan
BASAR:Black-box Attack on Skeletal Action Recognition
AuthorsYunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang
Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
AuthorsGengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang
Open-book Video Captioning with Retrieve-Copy-Generate Network
AuthorsZiqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu
Understanding the Robustness of Skeleton-based Action Recognition under Adversarial Attack
AuthorsHe Wang, Feixiang He, Zhexi Peng, Tianjia Shao, Yong-Liang Yang, Kun Zhou, David Hogg
PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency
AuthorsXuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai
Contrastive Neural Architecture Search with Neural Architecture Comparators
AuthorsYaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, Mingkui Tan
NeX: Real-time View Synthesis with Neural Basis Expansion
AuthorsSuttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
AuthorsYinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng, Jing Shao, Ziwei Liu
Manifold Regularized Dynamic Network Pruning
AuthorsYehui Tang, Yunhe Wang, Yixing Xu, Yiping Deng, Chao Xu, Dacheng Tao, Chang Xu
AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation
AuthorsDenis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa
Limitations of Post-Hoc Feature Alignment for Robustness
AuthorsCollin Burns, Jacob Steinhardt
VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples
AuthorsTian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu
FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding
AuthorsBo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang
SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance
AuthorsFu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang
Reformulating HOI Detection as Adaptive Set Prediction
AuthorsMingfei Chen, Yue Liao, Si Liu, Zhiyuan Chen, Fei Wang, Chen Qian
FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space
AuthorsQuande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng
Spatially Consistent Representation Learning
AuthorsByungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim
Involution: Inverting the Inherence of Convolution for Visual Recognition
AuthorsDuo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen
Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations
AuthorsUmberto Michieli, Pietro Zanuttigh
Holistic 3D Scene Understanding from a Single Image with Implicit Representation
AuthorsCheng Zhang, Zhaopeng Cui, Yinda Zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
AuthorsShancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality
AuthorsTrisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha
MagFace: A Universal Representation for Face Recognition and Quality Assessment
AuthorsQiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou
Temporal Action Segmentation from Timestamp Supervision
AuthorsZhe Li, Yazan Abu Farha, Juergen Gall
SMPLicit: Topology-aware Generative Model for Clothed People
AuthorsEnric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer
Fast and Accurate Model Scaling
AuthorsPiotr Dollár, Mannat Singh, Ross Girshick
Diverse Semantic Image Synthesis via Probability Distribution Modeling
AuthorsZhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu
CoMoGAN: continuous model-guided image-to-image translation
AuthorsFabio Pizzati, Pietro Cerri, Raoul de Charette
The Semi-Supervised iNaturalist-Aves Challenge at FGVC7 Workshop
AuthorsJong-Chyi Su, Subhransu Maji
CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement
AuthorsNoranart Vesdapunt, Baoyuan Wang
Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging
AuthorsTao Huang, Weisheng Dong, Xin Yuan, Jinjian Wu, Guangming Shi
Learnable Companding Quantization for Accurate Low-bit Neural Networks
AuthorsKohei Yamamoto
Deep Dual Consecutive Network for Human Pose Estimation
AuthorsZhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang
Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator
AuthorsSian-Yao Huang, Wei-Ta Chu
ACTION-Net: Multipath Excitation for Action Recognition
AuthorsZhengwei Wang, Qi She, Aljosa Smolic
Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
AuthorsMasoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker
Uncertainty-guided Model Generalization to Unseen Domains
AuthorsFengchun Qiao, Xi Peng
Student-Teacher Learning from Clean Inputs to Noisy Inputs
AuthorsGuanzhe Hong, Zhiyuan Mao, Xiaojun Lin, Stanley H. Chan
Reconsidering Representation Alignment for Multi-view Clustering
AuthorsDaniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer
Cycle4Completion: Unpaired Point Cloud Completion using Cycle Transformation with Missing Region Coding
AuthorsXin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
Learning a Proposal Classifier for Multiple Object Tracking
AuthorsPeng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
AuthorsHaolin Liu, Anran Lin, Xiaoguang Han, Lei Yang, Yizhou Yu, Shuguang Cui
Semi-Supervised Video Deraining with Dynamical Rain Generator
AuthorsZongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion
AuthorsHo Kei Cheng, Yu-Wing Tai, Chi-Keung Tang
Monte Carlo Scene Search for 3D Scene Understanding
AuthorsShreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan Srinivasa Kumar, Friedrich Fraundorfer, Vincent Lepetit
3DCaricShop: A Dataset and A Baseline Method for Single-view 3D Caricature Face Reconstruction
AuthorsYuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, Xiaoguang Han
Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation
AuthorsMingi Ji, Seungjae Shin, Seunghyun Hwang, Gibeom Park, Il-Chul Moon
Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging
AuthorsÁlvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid
Beyond Image to Depth: Improving Depth Prediction using Echoes
AuthorsKranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma
Track to Detect and Segment: An Online Multi-Object Tracker
AuthorsJialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan
Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation
AuthorsJungbeom Lee, Eunji Kim, Sungroh Yoon
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation
AuthorsJungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon
Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection
AuthorsJiaming Li, Hongtao Xie, Jiahong Li, Zhongyuan Wang, Yongdong Zhang
Back to the Feature: Learning Robust Camera Localization from Pixels to Pose
AuthorsPaul-Edouard Sarlin, Ajaykumar Unagar, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler
Learning Discriminative Prototypes with Dynamic Time Warping
AuthorsXiaobin Chang, Frederick Tung, Greg Mori
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
AuthorsJialun Peng, Dong Liu, Songcen Xu, Houqiang Li
On Semantic Similarity in Video Retrieval
AuthorsMichael Wray, Hazel Doughty, Dima Damen
SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation
AuthorsDongfang Liu, Yiming Cui, Wenbo Tan, Yingjie Chen
Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
AuthorsZhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao
Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks
AuthorsDespoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler
CDFI: Compression-Driven Network Design for Frame Interpolation
AuthorsTianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov
Generic Perceptual Loss for Modeling Structured Output Dependencies
AuthorsYifan Liu, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen
Dynamic Transfer for Multi-Source Domain Adaptation
AuthorsYunsheng Li, Lu Yuan, Yinpeng Chen, Pei Wang, Nuno Vasconcelos
Skeleton Merger: an Unsupervised Aligned Keypoint Detector
AuthorsRuoxi Shi, Zhengrong Xue, Yang You, Cewu Lu
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
AuthorsJoakim Bruslund Haurum, Thomas B. Moeslund
Probabilistic 3D Human Shape and Pose Estimation from Multiple Unconstrained Images in the Wild
AuthorsAkash Sengupta, Ignas Budvytis, Roberto Cipolla
Video Class Agnostic Segmentation Benchmark for Autonomous Driving
AuthorsMennatullah Siam, Alex Kendall, Martin Jagersand
Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation
AuthorsM. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen
MoViNets: Mobile Video Networks for Efficient Video Recognition
AuthorsDan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong
Context-aware Biaffine Localizing Network for Temporal Sentence Grounding
AuthorsDaizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie
Anchor-Free Person Search
AuthorsYichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
AuthorsNing Wang, Wengang Zhou, Jie Wang, Houqaing Li
Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales
AuthorsYifan Sun, Yuke Zhu, Yuhan Zhang, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei
Context-Aware Layout to Image Generation with Enhanced Object Appearance
AuthorsSen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
AuthorsLong Chen, Zhihong Jiang, Jun Xiao, Wei Liu
Deep Implicit Moving Least-Squares Functions for 3D Reconstruction
AuthorsShi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu
Group-aware Label Transfer for Domain Adaptive Person Re-identification
AuthorsKecheng Zheng, Wu Liu, Lingxiao He, Tao Mei, Jiebo Luo, Zheng-Jun Zha
Transferable Semantic Augmentation for Domain Adaptation
AuthorsShuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition
AuthorsShuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng
MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation
AuthorsHansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
AuthorsAshish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens
Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency
AuthorsQing Liu, Vignesh Ramanathan, Dhruv Mahajan, Alan Yuille, Zhenheng Yang
Efficient Regional Memory Network for Video Object Segmentation
AuthorsHaozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun
Scene-Intuitive Agent for Remote Embodied Visual Grounding
AuthorsXiangru Lin, Guanbin Li, Yizhou Yu
Convex Online Video Frame Subset Selection using Multiple Criteria for Data Efficient Autonomous Driving
AuthorsSoumi Das, Harikrishna Patibandla, Suparna Bhattacharya, Kshounis Bera, Niloy Ganguly, Sourangshu Bhattacharya
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
AuthorsAmaia Salvador, Erhan Gundogdu, Loris Bazzani, Michael Donoser
Repetitive Activity Counting by Sight and Sound
AuthorsYunhua Zhang, Ling Shao, Cees G. M. Snoek
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
AuthorsZhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang
M3DSSD: Monocular 3D Single Stage Object Detector
AuthorsShujie Luo, Hang Dai, Ling Shao, Yong Ding
Structure-Aware Face Clustering on a Large-Scale Graph with $\bf{10^{7}}$ Nodes
AuthorsShuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou
Dynamic Slimmable Network
AuthorsChanglin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang
Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition
AuthorsEnrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos
Diverse Branch Block: Building a Convolution as an Inception-like Unit
AuthorsXiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding
DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation
AuthorsSeunghun Lee, Sunghyun Cho, Sunghoon Im
Efficient Feature Transformations for Discriminative and Generative Continual Learning
AuthorsVinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin
Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation
AuthorsYuntong Ye, Yi Chang, Hanyu Zhou, Luxin Yan
SSLayout360: Semi-Supervised Indoor Layout Estimation from 360-Degree Panorama
AuthorsPhi Vu Tran
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
AuthorsAyan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors
AuthorsChaoqi Chen, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu
Supervised Contrastive Replay: Revisiting the Nearest Class Mean Classifier in Online Class-Incremental Continual Learning
AuthorsZheda Mai, Ruiwen Li, Hyunwoo Kim, Scott Sanner
Robust and Accurate Object Detection via Adversarial Learning
AuthorsXiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval
AuthorsAyan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
AuthorsChi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu
ACRE: Abstract Causal REasoning Beyond Covariation
AuthorsChi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu
Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification
AuthorsPeng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang
Bidirectional Projection Network for Cross Dimension Scene Understanding
AuthorsWenbo Hu, Hengshuang Zhao, Li Jiang, Jiaya Jia, Tien-Tsin Wong
Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation
AuthorsDohun Lim, Hyeonseok Lee, Sungchan Kim
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling
AuthorsZhichao Huang, Xintong Han, Jia Xu, Tong Zhang
Distilling Object Detectors via Decoupled Features
AuthorsJianyuan Guo, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen, Chunjing Xu, Chang Xu
Tuning IR-cut Filter for Illumination-aware Spectral Reconstruction from RGB
AuthorsBo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng
LiBRe: A Practical Bayesian Approach to Adversarial Detection
AuthorsZhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu
Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling
AuthorsYan-Cheng Huang, Yi-Hsin Chen, Cheng-You Lu, Hui-Po Wang, Wen-Hsiao Peng, Ching-Chun Huang
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences
AuthorsShun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari
Embedding Transfer with Label Relaxation for Improved Metric Learning
AuthorsSungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak
Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation
AuthorsZixiang Zhou, Yang Zhang, Hassan Foroosh
Picasso: A CUDA-based Library for Deep Learning over 3D Meshes
AuthorsHuan Lei, Naveed Akhtar, Ajmal Mian
Learning Placeholders for Open-Set Recognition
AuthorsDa-Wei Zhou, Han-Jia Ye, De-Chuan Zhan
Bridging the Visual Gap: Wide-Range Image Blending
AuthorsChia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu
ReAgent: Point Cloud Registration using Imitation and Reinforcement Learning
AuthorsDominik Bauer, Timothy Patten, Markus Vincze
Zero-shot Adversarial Quantization
AuthorsYuang Liu, Wei Zhang, Jun Wang
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
AuthorsShunkai Li, Xin Wu, Yingdian Cao, Hongbin Zha
LiDAR R-CNN: An Efficient and Universal 3D Object Detector
AuthorsZhichao Li, Feng Wang, Naiyan Wang
Checkerboard Context Model for Efficient Learned Image Compression
AuthorsDailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin
POSEFusion: Pose-guided Selective Fusion for Single-view Human Volumetric Capture
AuthorsZhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu
Attention-guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton
AuthorsXi Zhang, Xiaolin Wu
No frame left behind: Full Video Action Recognition
AuthorsXin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert
Capsule Network is Not More Robust than Convolutional Network
AuthorsJindong Gu, Volker Tresp, Han Hu
Cloud2Curve: Generation and Vectorization of Parametric Sketches
AuthorsAyan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
AuthorsLi Xu, He Huang, Jun Liu
Enhancing the Transferability of Adversarial Attacks through Variance Tuning
AuthorsXiaosen Wang, Kun He
RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening
AuthorsSungha Choi, Sanghun Jung, Huiwon Yun, Joanne Kim, Seungryong Kim, Jaegul Choo
StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval
AuthorsAneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song
Slimmable Compressive Autoencoders for Practical Neural Image Compression
AuthorsFei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov
Adaptive Methods for Real-World Domain Generalization
AuthorsAbhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, Dhruv Mahajan
High-Fidelity and Arbitrary Face Editing
AuthorsYue Gao, Fangyun Wei, Jianmin Bao, Shuyang Gu, Dong Chen, Fang Wen, Zhouhui Lian
High-fidelity Face Tracking for AR/VR via Deep Lighting Adaptation
AuthorsLele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh
Domain-robust VQA with diverse datasets and methods but no target labels
AuthorsMingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning
AuthorsMadeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
Noise-resistant Deep Metric Learning with Ranking-based Instance Selection
AuthorsChang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao
Face Forensics in the Wild
AuthorsTianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen
Fully Convolutional Scene Graph Generation
AuthorsHengyue Liu, Ning Yan, Masood S. Mortazavi, Bir Bhanu
Self-Guided and Cross-Guided Learning for Few-Shot Segmentation
AuthorsBingfeng Zhang, Jimin Xiao, Terry Qin
Learnable Graph Matching: Incorporating Graph Partitioning with Deep Feature Learning for Multiple Object Tracking
AuthorsJiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang
Repopulating Street Scenes
AuthorsYifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely
Delving into Localization Errors for Monocular 3D Object Detection
AuthorsXinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang
Model-Contrastive Federated Learning
AuthorsQinbin Li, Bingsheng He, Dawn Song
Locate then Segment: A Strong Pipeline for Referring Image Segmentation
AuthorsYa Jing, Tao Kong, Wei Wang, Liang Wang, Lei Li, Tieniu Tan
Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection
AuthorsZhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
AuthorsSongyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun
Source-Free Domain Adaptation for Semantic Segmentation
AuthorsYuang Liu, Wei Zhang, Jun Wang
Graph Stacked Hourglass Networks for 3D Human Pose Estimation
AuthorsTianhan Xu, Wataru Takano
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
AuthorsCan Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou
Dynamic Domain Adaptation for Efficient Inference
AuthorsShuang Li, Jinming Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li
Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
AuthorsShanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
AuthorsLi Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang
Read and Attend: Temporal Localisation in Sign Language Videos
AuthorsGül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman
Benchmarking Representation Learning for Natural World Image Collections
AuthorsGrant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha
Recognizing Actions in Videos from Unseen Viewpoints
AuthorsAJ Piergiovanni, Michael S. Ryoo
Visual Room Rearrangement
AuthorsLuca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
AuthorsAntoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
AuthorsBowen Cheng, Ross Girshick, Piotr Dollár, Alexander C. Berg, Alexander Kirillov
Rectification-based Knowledge Retention for Continual Learning
AuthorsPravendra Singh, Pratik Mazumder, Piyush Rai, Vinay P. Namboodiri
DAP: Detection-Aware Pre-training with Weak Supervision
AuthorsYuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yu-Xiong Wang, Lei Zhang
Denoise and Contrast for Category Agnostic Shape Completion
AuthorsAntonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi
Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging
AuthorsIlya Chugunov, Seung-Hwan Baek, Qiang Fu, Wolfgang Heidrich, Felix Heide
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification
AuthorsZijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
AuthorsXiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu
Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation
AuthorsXiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli
Convolutional Hough Matching Networks
AuthorsJuhong Min, Minsu Cho
Online Learning of a Probabilistic and Adaptive Scene Representation
AuthorsZike Yan, Xin Wang, Hongbin Zha
Towards Open World Object Detection
AuthorsK J Joseph, Salman Khan, Fahad Shahbaz Khan, Vineeth N Balasubramanian
Learning Asynchronous and Sparse Human-Object Interaction in Videos
AuthorsRomero Morais, Vuong Le, Svetha Venkatesh, Truyen Tran
DeepTag: An Unsupervised Deep Learning Method for Motion Tracking on Cardiac Tagging Magnetic Resonance Images
AuthorsMeng Ye, Mikael Kanski, Dong Yang, Qi Chang, Zhennan Yan, Qiaoying Huang, Leon Axel, Dimitris Metaxas
A Cross Channel Context Model for Latents in Deep Image Compression
AuthorsChangyue Ma, Zhao Wang, Ruling Liao, Yan Ye
PointGuard: Provably Robust 3D Point Cloud Classification
AuthorsHongbin Liu, Jinyuan Jia, Neil Zhenqiang Gong
TPCN: Temporal Point Cloud Networks for Motion Forecasting
AuthorsMaosheng Ye, Tongyi Cao, Qifeng Chen
Self-supervised Geometric Perception
AuthorsHeng Yang, Wei Dong, Luca Carlone, Vladlen Koltun
Anycost GANs for Interactive Image Synthesis and Editing
AuthorsJi Lin, Richard Zhang, Frieder Ganz, Song Han, Jun-Yan Zhu
Nutrition5k: Towards Automatic Nutritional Understanding of Generic Food
AuthorsQuin Thames, Arjun Karpur, Wade Norris, Fangting Xia, Liviu Panait, Tobias Weyand, Jack Sim
Teachers Do More Than Teach: Compressing Image-to-Image Models
AuthorsQing Jin, Jian Ren, Oliver J. Woodford, Jiazhuo Wang, Geng Yuan, Yanzhi Wang, Sergey Tulyakov
Unsupervised Learning for Robust Fitting:A Reinforcement Learning Approach
AuthorsGiang Truong, Huu Le, David Suter, Erchuan Zhang, Syed Zulqarnain Gilani
LOHO: Latent Optimization of Hairstyles via Orthogonalization
AuthorsRohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi
Selective Replay Enhances Learning in Online Continual Analogical Reasoning
AuthorsTyler L. Hayes, Christopher Kanan
Simultaneously Localize, Segment and Rank the Camouflaged Objects
AuthorsYunqiu Lv, Jing Zhang, Yuchao Dai, Aixuan Li, Bowen Liu, Nick Barnes, Deng-Ping Fan
Semantic-aware Knowledge Distillation for Few-Shot Class-Incremental Learning
AuthorsAli Cheraghian, Shafin Rahman, Pengfei Fang, Soumava Kumar Roy, Lars Petersson, Mehrtash Harandi
Learning Statistical Texture for Semantic Segmentation
AuthorsLanyun Zhu, Deyi Ji, Shiping Zhu, Weihao Gan, Wei Wu, Junjie Yan
Consensus Maximisation Using Influences of Monotone Boolean Functions
AuthorsRuwan Tennakoon, David Suter, Erchuan Zhang, Tat-Jun Chin, Alireza Bab-Hadiashar
MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection
AuthorsVibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel
Robust Point Cloud Registration Framework Based on Deep Graph Matching
AuthorsKexue Fu, Shaolei Liu, Xiaoyuan Luo, Manning Wang
ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring
AuthorsDongxu Li, Chenchen Xu, Kaihao Zhang, Xin Yu, Yiran Zhong, Wenqi Ren, Hanna Suominen, Hongdong Li
Repurposing GANs for One-shot Semantic Part Segmentation
AuthorsNontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn
What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels
AuthorsJeonghun Baek, Yusuke Matsui, Kiyoharu Aizawa
OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection
AuthorsTingting Liang, Yongtao Wang, Zhi Tang, Guosheng Hu, Haibin Ling
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing
AuthorsTianfei Zhou, Wenguan Wang, Si Liu, Yi Yang, Luc Van Gool
Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification
AuthorsFengxiang Yang, Zhun Zhong, Zhiming Luo, Yuanzheng Cai, Yaojin Lin, Shaozi Li, Nicu Sebe
Behavior-Driven Synthesis of Human Dynamics
AuthorsAndreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer
How Privacy-Preserving are Line Clouds? Recovering Scene Details from 3D Lines
AuthorsKunal Chelani, Fredrik Kahl, Torsten Sattler
Multiple Instance Captioning: Learning Representations from Histopathology Textbooks and Articles
AuthorsJevgenij Gamper, Nasir Rajpoot
Knowledge Evolution in Neural Networks
AuthorsAhmed Taha, Abhinav Shrivastava, Larry Davis
MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation
AuthorsXiaoqing Guo, Chen Yang, Baopu Li, Yixuan Yuan
BASAR:Black-box Attack on Skeletal Action Recognition
AuthorsYunfeng Diao, Tianjia Shao, Yong-Liang Yang, Kun Zhou, He Wang
Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
AuthorsGengcong Yang, Jingyi Zhang, Yong Zhang, Baoyuan Wu, Yujiu Yang
Open-book Video Captioning with Retrieve-Copy-Generate Network
AuthorsZiqi Zhang, Zhongang Qi, Chunfeng Yuan, Ying Shan, Bing Li, Ying Deng, Weiming Hu
Understanding the Robustness of Skeleton-based Action Recognition under Adversarial Attack
AuthorsHe Wang, Feixiang He, Zhexi Peng, Tianjia Shao, Yong-Liang Yang, Kun Zhou, David Hogg
PointDSC: Robust Point Cloud Registration using Deep Spatial Consistency
AuthorsXuyang Bai, Zixin Luo, Lei Zhou, Hongkai Chen, Lei Li, Zeyu Hu, Hongbo Fu, Chiew-Lan Tai
Contrastive Neural Architecture Search with Neural Architecture Comparators
AuthorsYaofo Chen, Yong Guo, Qi Chen, Minli Li, Wei Zeng, Yaowei Wang, Mingkui Tan
NeX: Real-time View Synthesis with Neural Basis Expansion
AuthorsSuttisak Wizadwongsa, Pakkapon Phongthawee, Jiraphon Yenphraphai, Supasorn Suwajanakorn
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis
AuthorsYinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng, Jing Shao, Ziwei Liu
Manifold Regularized Dynamic Network Pruning
AuthorsYehui Tang, Yunhe Wang, Yixing Xu, Yiping Deng, Chao Xu, Dacheng Tao, Chang Xu
AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation
AuthorsDenis Gudovskiy, Luca Rigazio, Shun Ishizaka, Kazuki Kozuka, Sotaro Tsukizawa
Limitations of Post-Hoc Feature Alignment for Robustness
AuthorsCollin Burns, Jacob Steinhardt
VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples
AuthorsTian Pan, Yibing Song, Tianyu Yang, Wenhao Jiang, Wei Liu
FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding
AuthorsBo Sun, Banghuai Li, Shengcai Cai, Ye Yuan, Chi Zhang
SDD-FIQA: Unsupervised Face Image Quality Assessment with Similarity Distribution Distance
AuthorsFu-Zhao Ou, Xingyu Chen, Ruixin Zhang, Yuge Huang, Shaoxin Li, Jilin Li, Yong Li, Liujuan Cao, Yuan-Gen Wang
Reformulating HOI Detection as Adaptive Set Prediction
AuthorsMingfei Chen, Yue Liao, Si Liu, Zhiyuan Chen, Fei Wang, Chen Qian
FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space
AuthorsQuande Liu, Cheng Chen, Jing Qin, Qi Dou, Pheng-Ann Heng
Spatially Consistent Representation Learning
AuthorsByungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim
Involution: Inverting the Inherence of Convolution for Visual Recognition
AuthorsDuo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen
Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations
AuthorsUmberto Michieli, Pietro Zanuttigh
Holistic 3D Scene Understanding from a Single Image with Implicit Representation
AuthorsCheng Zhang, Zhaopeng Cui, Yinda Zhang, Bing Zeng, Marc Pollefeys, Shuaicheng Liu
Read Like Humans: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text Recognition
AuthorsShancheng Fang, Hongtao Xie, Yuxin Wang, Zhendong Mao, Yongdong Zhang
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality
AuthorsTrisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha
MagFace: A Universal Representation for Face Recognition and Quality Assessment
AuthorsQiang Meng, Shichao Zhao, Zhida Huang, Feng Zhou
Temporal Action Segmentation from Timestamp Supervision
AuthorsZhe Li, Yazan Abu Farha, Juergen Gall
SMPLicit: Topology-aware Generative Model for Clothed People
AuthorsEnric Corona, Albert Pumarola, Guillem Alenyà, Gerard Pons-Moll, Francesc Moreno-Noguer
Fast and Accurate Model Scaling
AuthorsPiotr Dollár, Mannat Singh, Ross Girshick
Diverse Semantic Image Synthesis via Probability Distribution Modeling
AuthorsZhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Bin Liu, Gang Hua, Nenghai Yu
CoMoGAN: continuous model-guided image-to-image translation
AuthorsFabio Pizzati, Pietro Cerri, Raoul de Charette
The Semi-Supervised iNaturalist-Aves Challenge at FGVC7 Workshop
AuthorsJong-Chyi Su, Subhransu Maji
CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement
AuthorsNoranart Vesdapunt, Baoyuan Wang
Deep Gaussian Scale Mixture Prior for Spectral Compressive Imaging
AuthorsTao Huang, Weisheng Dong, Xin Yuan, Jinjian Wu, Guangming Shi
Learnable Companding Quantization for Accurate Low-bit Neural Networks
AuthorsKohei Yamamoto
Deep Dual Consecutive Network for Human Pose Estimation
AuthorsZhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu, Shouling Ji, Bailin Yang, Xun Wang
Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator
AuthorsSian-Yao Huang, Wei-Ta Chu
ACTION-Net: Multipath Excitation for Action Recognition
AuthorsZhengwei Wang, Qi She, Aljosa Smolic
Cross-Domain Similarity Learning for Face Recognition in Unseen Domains
AuthorsMasoud Faraki, Xiang Yu, Yi-Hsuan Tsai, Yumin Suh, Manmohan Chandraker
Uncertainty-guided Model Generalization to Unseen Domains
AuthorsFengchun Qiao, Xi Peng
Student-Teacher Learning from Clean Inputs to Noisy Inputs
AuthorsGuanzhe Hong, Zhiyuan Mao, Xiaojun Lin, Stanley H. Chan
Reconsidering Representation Alignment for Multi-view Clustering
AuthorsDaniel J. Trosten, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer
Cycle4Completion: Unpaired Point Cloud Completion using Cycle Transformation with Missing Region Coding
AuthorsXin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
Learning a Proposal Classifier for Multiple Object Tracking
AuthorsPeng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding
Refer-it-in-RGBD: A Bottom-up Approach for 3D Visual Grounding in RGBD Images
AuthorsHaolin Liu, Anran Lin, Xiaoguang Han, Lei Yang, Yizhou Yu, Shuguang Cui
Semi-Supervised Video Deraining with Dynamical Rain Generator
AuthorsZongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion
AuthorsHo Kei Cheng, Yu-Wing Tai, Chi-Keung Tang
Monte Carlo Scene Search for 3D Scene Understanding
AuthorsShreyas Hampali, Sinisa Stekovic, Sayan Deb Sarkar, Chetan Srinivasa Kumar, Friedrich Fraundorfer, Vincent Lepetit
3DCaricShop: A Dataset and A Baseline Method for Single-view 3D Caricature Face Reconstruction
AuthorsYuda Qiu, Xiaojie Xu, Lingteng Qiu, Yan Pan, Yushuang Wu, Weikai Chen, Xiaoguang Han
Refine Myself by Teaching Myself: Feature Refinement via Self-Knowledge Distillation
AuthorsMingi Ji, Seungjae Shin, Seunghyun Hwang, Gibeom Park, Il-Chul Moon
Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging
AuthorsÁlvaro Parra, Shin-Fang Chng, Tat-Jun Chin, Anders Eriksson, Ian Reid
Beyond Image to Depth: Improving Depth Prediction using Echoes
AuthorsKranti Kumar Parida, Siddharth Srivastava, Gaurav Sharma
Track to Detect and Segment: An Online Multi-Object Tracker
AuthorsJialian Wu, Jiale Cao, Liangchen Song, Yu Wang, Ming Yang, Junsong Yuan
Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation
AuthorsJungbeom Lee, Eunji Kim, Sungroh Yoon
BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation
AuthorsJungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon
Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection
AuthorsJiaming Li, Hongtao Xie, Jiahong Li, Zhongyuan Wang, Yongdong Zhang
Back to the Feature: Learning Robust Camera Localization from Pixels to Pose
AuthorsPaul-Edouard Sarlin, Ajaykumar Unagar, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler
Learning Discriminative Prototypes with Dynamic Time Warping
AuthorsXiaobin Chang, Frederick Tung, Greg Mori
Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
AuthorsJialun Peng, Dong Liu, Songcen Xu, Houqiang Li
On Semantic Similarity in Video Retrieval
AuthorsMichael Wray, Hazel Doughty, Dima Damen
SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation
AuthorsDongfang Liu, Yiming Cui, Wenbo Tan, Yingjie Chen
Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
AuthorsZhaoyuan Yin, Jia Zheng, Weixin Luo, Shenhan Qian, Hanling Zhang, Shenghua Gao
Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks
AuthorsDespoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler
CDFI: Compression-Driven Network Design for Frame Interpolation
AuthorsTianyu Ding, Luming Liang, Zhihui Zhu, Ilya Zharkov
Generic Perceptual Loss for Modeling Structured Output Dependencies
AuthorsYifan Liu, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen
Dynamic Transfer for Multi-Source Domain Adaptation
AuthorsYunsheng Li, Lu Yuan, Yinpeng Chen, Pei Wang, Nuno Vasconcelos
Skeleton Merger: an Unsupervised Aligned Keypoint Detector
AuthorsRuoxi Shi, Zhengrong Xue, Yang You, Cewu Lu
Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
AuthorsJoakim Bruslund Haurum, Thomas B. Moeslund
Probabilistic 3D Human Shape and Pose Estimation from Multiple Unconstrained Images in the Wild
AuthorsAkash Sengupta, Ignas Budvytis, Roberto Cipolla
Video Class Agnostic Segmentation Benchmark for Autonomous Driving
AuthorsMennatullah Siam, Alex Kendall, Martin Jagersand
Temporally-Weighted Hierarchical Clustering for Unsupervised Action Segmentation
AuthorsM. Saquib Sarfraz, Naila Murray, Vivek Sharma, Ali Diba, Luc Van Gool, Rainer Stiefelhagen
MoViNets: Mobile Video Networks for Efficient Video Recognition
AuthorsDan Kondratyuk, Liangzhe Yuan, Yandong Li, Li Zhang, Mingxing Tan, Matthew Brown, Boqing Gong
Context-aware Biaffine Localizing Network for Temporal Sentence Grounding
AuthorsDaizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Yu Cheng, Wei Wei, Zichuan Xu, Yulai Xie
Anchor-Free Person Search
AuthorsYichao Yan, Jinpeng Li, Jie Qin, Song Bai, Shengcai Liao, Li Liu, Fan Zhu, Ling Shao
Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking
AuthorsNing Wang, Wengang Zhou, Jie Wang, Houqaing Li
Dynamic Metric Learning: Towards a Scalable Metric Space to Accommodate Multiple Semantic Scales
AuthorsYifan Sun, Yuke Zhu, Yuhan Zhang, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei
Context-Aware Layout to Image Generation with Enhanced Object Appearance
AuthorsSen He, Wentong Liao, Michael Ying Yang, Yongxin Yang, Yi-Zhe Song, Bodo Rosenhahn, Tao Xiang
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
AuthorsLong Chen, Zhihong Jiang, Jun Xiao, Wei Liu
Deep Implicit Moving Least-Squares Functions for 3D Reconstruction
AuthorsShi-Lin Liu, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong, Yang Liu
Group-aware Label Transfer for Domain Adaptive Person Re-identification
AuthorsKecheng Zheng, Wu Liu, Lingxiao He, Tao Mei, Jiebo Luo, Zheng-Jun Zha
Transferable Semantic Augmentation for Domain Adaptation
AuthorsShuang Li, Mixue Xie, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Wei Li
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition
AuthorsShuang Li, Kaixiong Gong, Chi Harold Liu, Yulin Wang, Feng Qiao, Xinjing Cheng
MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation
AuthorsHansheng Chen, Yuyao Huang, Wei Tian, Zhong Gao, Lu Xiong
Scaling Local Self-Attention for Parameter Efficient Visual Backbones
AuthorsAshish Vaswani, Prajit Ramachandran, Aravind Srinivas, Niki Parmar, Blake Hechtman, Jonathon Shlens
Weakly Supervised Instance Segmentation for Videos with Temporal Mask Consistency
AuthorsQing Liu, Vignesh Ramanathan, Dhruv Mahajan, Alan Yuille, Zhenheng Yang
Efficient Regional Memory Network for Video Object Segmentation
AuthorsHaozhe Xie, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun
Scene-Intuitive Agent for Remote Embodied Visual Grounding
AuthorsXiangru Lin, Guanbin Li, Yizhou Yu
Convex Online Video Frame Subset Selection using Multiple Criteria for Data Efficient Autonomous Driving
AuthorsSoumi Das, Harikrishna Patibandla, Suparna Bhattacharya, Kshounis Bera, Niloy Ganguly, Sourangshu Bhattacharya
Revamping Cross-Modal Recipe Retrieval with Hierarchical Transformers and Self-supervised Learning
AuthorsAmaia Salvador, Erhan Gundogdu, Loris Bazzani, Michael Donoser
Repetitive Activity Counting by Sight and Sound
AuthorsYunhua Zhang, Ling Shao, Cees G. M. Snoek
Temporal Context Aggregation Network for Temporal Action Proposal Refinement
AuthorsZhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao, Junjie Yan, Changxin Gao, Nong Sang
M3DSSD: Monocular 3D Single Stage Object Detector
AuthorsShujie Luo, Hang Dai, Ling Shao, Yong Ding
Structure-Aware Face Clustering on a Large-Scale Graph with $\bf{10^{7}}$ Nodes
AuthorsShuai Shen, Wanhua Li, Zheng Zhu, Guan Huang, Dalong Du, Jiwen Lu, Jie Zhou
Dynamic Slimmable Network
AuthorsChanglin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang
Affective Processes: stochastic modelling of temporal context for emotion and facial expression recognition
AuthorsEnrique Sanchez, Mani Kumar Tellamekala, Michel Valstar, Georgios Tzimiropoulos
Diverse Branch Block: Building a Convolution as an Inception-like Unit
AuthorsXiaohan Ding, Xiangyu Zhang, Jungong Han, Guiguang Ding
DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation
AuthorsSeunghun Lee, Sunghyun Cho, Sunghoon Im
Efficient Feature Transformations for Discriminative and Generative Continual Learning
AuthorsVinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin
Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation
AuthorsYuntong Ye, Yi Chang, Hanyu Zhou, Luxin Yan
SSLayout360: Semi-Supervised Indoor Layout Estimation from 360-Degree Panorama
AuthorsPhi Vu Tran
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting
AuthorsAyan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song
I3Net: Implicit Instance-Invariant Network for Adapting One-Stage Object Detectors
AuthorsChaoqi Chen, Zebiao Zheng, Yue Huang, Xinghao Ding, Yizhou Yu
Supervised Contrastive Replay: Revisiting the Nearest Class Mean Classifier in Online Class-Incremental Continual Learning
AuthorsZheda Mai, Ruiwen Li, Hyunwoo Kim, Scott Sanner
Robust and Accurate Object Detection via Adversarial Learning
AuthorsXiangning Chen, Cihang Xie, Mingxing Tan, Li Zhang, Cho-Jui Hsieh, Boqing Gong
More Photos are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval
AuthorsAyan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain, Yongxin Yang, Tao Xiang, Yi-Zhe Song
Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and Execution
AuthorsChi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu
ACRE: Abstract Causal REasoning Beyond Covariation
AuthorsChi Zhang, Baoxiong Jia, Mark Edmonds, Song-Chun Zhu, Yixin Zhu
Contrastive Learning based Hybrid Networks for Long-Tailed Image Classification
AuthorsPeng Wang, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang
Bidirectional Projection Network for Cross Dimension Scene Understanding
AuthorsWenbo Hu, Hengshuang Zhao, Li Jiang, Jiaya Jia, Tien-Tsin Wong
Building Reliable Explanations of Unreliable Neural Networks: Locally Smoothing Perspective of Model Interpretation
AuthorsDohun Lim, Hyeonseok Lee, Sungchan Kim
Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling
AuthorsZhichao Huang, Xintong Han, Jia Xu, Tong Zhang
Distilling Object Detectors via Decoupled Features
AuthorsJianyuan Guo, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen, Chunjing Xu, Chang Xu
Tuning IR-cut Filter for Illumination-aware Spectral Reconstruction from RGB
AuthorsBo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng
LiBRe: A Practical Bayesian Approach to Adversarial Detection
AuthorsZhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu
Video Rescaling Networks with Joint Optimization Strategies for Downscaling and Upscaling
AuthorsYan-Cheng Huang, Yi-Hsin Chen, Cheng-You Lu, Hui-Po Wang, Wen-Hsiao Peng, Ching-Chun Huang
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences
AuthorsShun-Cheng Wu, Johanna Wald, Keisuke Tateno, Nassir Navab, Federico Tombari
Embedding Transfer with Label Relaxation for Improved Metric Learning
AuthorsSungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak
Panoptic-PolarNet: Proposal-free LiDAR Point Cloud Panoptic Segmentation
AuthorsZixiang Zhou, Yang Zhang, Hassan Foroosh
Picasso: A CUDA-based Library for Deep Learning over 3D Meshes
AuthorsHuan Lei, Naveed Akhtar, Ajmal Mian
Learning Placeholders for Open-Set Recognition
AuthorsDa-Wei Zhou, Han-Jia Ye, De-Chuan Zhan
Bridging the Visual Gap: Wide-Range Image Blending
AuthorsChia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu
ReAgent: Point Cloud Registration using Imitation and Reinforcement Learning
AuthorsDominik Bauer, Timothy Patten, Markus Vincze
Zero-shot Adversarial Quantization
AuthorsYuang Liu, Wei Zhang, Jun Wang
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation
AuthorsShunkai Li, Xin Wu, Yingdian Cao, Hongbin Zha
LiDAR R-CNN: An Efficient and Universal 3D Object Detector
AuthorsZhichao Li, Feng Wang, Naiyan Wang
Checkerboard Context Model for Efficient Learned Image Compression
AuthorsDailan He, Yaoyan Zheng, Baocheng Sun, Yan Wang, Hongwei Qin
POSEFusion: Pose-guided Selective Fusion for Single-view Human Volumetric Capture
AuthorsZhe Li, Tao Yu, Zerong Zheng, Kaiwen Guo, Yebin Liu
Attention-guided Image Compression by Deep Reconstruction of Compressive Sensed Saliency Skeleton
AuthorsXi Zhang, Xiaolin Wu
No frame left behind: Full Video Action Recognition
AuthorsXin Liu, Silvia L. Pintea, Fatemeh Karimi Nejadasl, Olaf Booij, Jan C. van Gemert
Capsule Network is Not More Robust than Convolutional Network
AuthorsJindong Gu, Volker Tresp, Han Hu
Cloud2Curve: Generation and Vectorization of Parametric Sketches
AuthorsAyan Das, Yongxin Yang, Timothy Hospedales, Tao Xiang, Yi-Zhe Song
TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events
AuthorsLi Xu, He Huang, Jun Liu
Enhancing the Transferability of Adversarial Attacks through Variance Tuning
AuthorsXiaosen Wang, Kun He
RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening
AuthorsSungha Choi, Sanghun Jung, Huiwon Yun, Joanne Kim, Seungryong Kim, Jaegul Choo
StyleMeUp: Towards Style-Agnostic Sketch-Based Image Retrieval
AuthorsAneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song
Slimmable Compressive Autoencoders for Practical Neural Image Compression
AuthorsFei Yang, Luis Herranz, Yongmei Cheng, Mikhail G. Mozerov
Adaptive Methods for Real-World Domain Generalization
AuthorsAbhimanyu Dubey, Vignesh Ramanathan, Alex Pentland, Dhruv Mahajan
High-Fidelity and Arbitrary Face Editing
AuthorsYue Gao, Fangyun Wei, Jianmin Bao, Shuyang Gu, Dong Chen, Fang Wen, Zhouhui Lian
High-fidelity Face Tracking for AR/VR via Deep Lighting Adaptation
AuthorsLele Chen, Chen Cao, Fernando De la Torre, Jason Saragih, Chenliang Xu, Yaser Sheikh
Domain-robust VQA with diverse datasets and methods but no target labels
AuthorsMingda Zhang, Tristan Maidment, Ahmad Diab, Adriana Kovashka, Rebecca Hwa
AGQA: A Benchmark for Compositional Spatio-Temporal Reasoning
AuthorsMadeleine Grunde-McLaughlin, Ranjay Krishna, Maneesh Agrawala
Noise-resistant Deep Metric Learning with Ranking-based Instance Selection
AuthorsChang Liu, Han Yu, Boyang Li, Zhiqi Shen, Zhanning Gao, Peiran Ren, Xuansong Xie, Lizhen Cui, Chunyan Miao
Face Forensics in the Wild
AuthorsTianfei Zhou, Wenguan Wang, Zhiyuan Liang, Jianbing Shen
Fully Convolutional Scene Graph Generation
AuthorsHengyue Liu, Ning Yan, Masood S. Mortazavi, Bir Bhanu
Self-Guided and Cross-Guided Learning for Few-Shot Segmentation
AuthorsBingfeng Zhang, Jimin Xiao, Terry Qin
Learnable Graph Matching: Incorporating Graph Partitioning with Deep Feature Learning for Multiple Object Tracking
AuthorsJiawei He, Zehao Huang, Naiyan Wang, Zhaoxiang Zhang
Repopulating Street Scenes
AuthorsYifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely
Delving into Localization Errors for Monocular 3D Object Detection
AuthorsXinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang
Model-Contrastive Federated Learning
AuthorsQinbin Li, Bingsheng He, Dawn Song
Locate then Segment: A Strong Pipeline for Referring Image Segmentation
AuthorsYa Jing, Tao Kong, Wei Wang, Liang Wang, Lei Li, Tieniu Tan
Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection
AuthorsZhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang
Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
AuthorsSongyang Zhang, Zeming Li, Shipeng Yan, Xuming He, Jian Sun
Source-Free Domain Adaptation for Semantic Segmentation
AuthorsYuang Liu, Wei Zhang, Jun Wang
Graph Stacked Hourglass Networks for 3D Human Pose Estimation
AuthorsTianhan Xu, Wataru Takano
CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
AuthorsCan Zhang, Meng Cao, Dongming Yang, Jie Chen, Yuexian Zou
Dynamic Domain Adaptation for Efficient Inference
AuthorsShuang Li, Jinming Zhang, Wenxuan Ma, Chi Harold Liu, Wei Li
Bilevel Online Adaptation for Out-of-Domain Human Mesh Reconstruction
AuthorsShanyan Guan, Jingwei Xu, Yunbo Wang, Bingbing Ni, Xiaokang Yang
Depth-conditioned Dynamic Message Propagation for Monocular 3D Object Detection
AuthorsLi Wang, Liang Du, Xiaoqing Ye, Yanwei Fu, Guodong Guo, Xiangyang Xue, Jianfeng Feng, Li Zhang
Read and Attend: Temporal Localisation in Sign Language Videos
AuthorsGül Varol, Liliane Momeni, Samuel Albanie, Triantafyllos Afouras, Andrew Zisserman
Benchmarking Representation Learning for Natural World Image Collections
AuthorsGrant Van Horn, Elijah Cole, Sara Beery, Kimberly Wilber, Serge Belongie, Oisin Mac Aodha
Recognizing Actions in Videos from Unseen Viewpoints
AuthorsAJ Piergiovanni, Michael S. Ryoo
Visual Room Rearrangement
AuthorsLuca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi
Thinking Fast and Slow: Efficient Text-to-Visual Retrieval with Transformers
AuthorsAntoine Miech, Jean-Baptiste Alayrac, Ivan Laptev, Josef Sivic, Andrew Zisserman
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
AuthorsBowen Cheng, Ross Girshick, Piotr Dollár, Alexander C. Berg, Alexander Kirillov
Rectification-based Knowledge Retention for Continual Learning
AuthorsPravendra Singh, Pratik Mazumder, Piyush Rai, Vinay P. Namboodiri
DAP: Detection-Aware Pre-training with Weak Supervision
AuthorsYuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yu-Xiong Wang, Lei Zhang
Denoise and Contrast for Category Agnostic Shape Completion
AuthorsAntonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi
Mask-ToF: Learning Microlens Masks for Flying Pixel Correction in Time-of-Flight Imaging
AuthorsIlya Chugunov, Seung-Hwan Baek, Qiang Fu, Wolfgang Heidrich, Felix Heide
SimPLE: Similar Pseudo Label Exploitation for Semi-Supervised Classification
AuthorsZijian Hu, Zhengyu Yang, Xuefeng Hu, Ram Nevatia
Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
AuthorsXiao Wang, Xiujun Shu, Zhipeng Zhang, Bo Jiang, Yaowei Wang, Yonghong Tian, Feng Wu
Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation
AuthorsXiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto Sangiovanni Vincentelli
Convolutional Hough Matching Networks
AuthorsJuhong Min, Minsu Cho
  • 0
    点赞
  • 10
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值