【CVPR2022】论文列表与下载——PartOne

CVPR2022将于6月22日召开🎉🎉🎉,本次会议共收录了2067篇论文。由于数量较多,本文将分四个子文章呈现,可直接点击论文标题获取文档。
📃第二部分, 📃第三部分, 📃 第四部分
在这里插入图片描述

在这里插入图片描述

1. Part One

Dual Cross-Attention Learning for Fine-Grained Visual Categorization and Object Re-Identification [supp]
SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Normalization [supp]
GASP, a Generalized Framework for Agglomerative Clustering of Signed Graphs and Its Application to Instance Segmentation [supp]
Estimating Example Difficulty Using Variance of Gradients [supp]
One Loss for Quantization: Deep Hashing With Discrete Wasserstein Distributional Matching [supp]
Pixel Screening Based Intermediate Correction for Blind Deblurring [supp]
Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast
Controllable Animation of Fluid Elements in Still Images
Holocurtains: Programming Light Curtains via Binary Holography [supp]
Recurrent Dynamic Embedding for Video Object Segmentation [supp]
Deep Hierarchical Semantic Segmentation [supp]
f-SfT: Shape-From-Template With a Physics-Based Deformation Model [supp]
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism [supp]
DATA: Domain-Aware and Task-Aware Self-Supervised Learning [supp]
TWIST: Two-Way Inter-Label Self-Training for Semi-Supervised 3D Instance Segmentation [supp]
Voxel Set Transformer: A Set-to-Set Approach to 3D Object Detection From Point Clouds
Learning Adaptive Warping for Real-World Rolling Shutter Correction [supp]
Siamese Contrastive Embedding Network for Compositional Zero-Shot Learning
Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions [supp]
RIM-Net: Recursive Implicit Fields for Unsupervised Learning of Hierarchical Shape Structures
Do Learned Representations Respect Causal Relationships? [supp]
ZebraPose: Coarse To Fine Surface Encoding for 6DoF Object Pose Estimation [supp]
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic [supp]
Learning To Affiliate: Mutual Centralized Learning for Few-Shot Classification [supp]
CAPRI-Net: Learning Compact CAD Shapes With Adaptive Primitive Assembly [supp]
ATPFL: Automatic Trajectory Prediction Model Design Under Federated Learning Framework
Revisiting Learnable Affines for Batch Norm in Few-Shot Transfer Learning
Bridging the Gap Between Classification and Localization for Weakly Supervised Object Localization [supp]
Multi-Class Token Transformer for Weakly Supervised Semantic Segmentation [supp]
3D Moments From Near-Duplicate Photos
Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization [supp]
Blind2Unblind: Self-Supervised Image Denoising With Visible Blind Spots [supp]
Balanced and Hierarchical Relation Learning for One-Shot Object Detection
End-to-End Generative Pretraining for Multimodal Video Captioning [supp]
Delving Deep Into the Generalization of Vision Transformers Under Distribution Shifts
NICE-SLAM: Neural Implicit Scalable Encoding for SLAM [supp]
HyperDet3D: Learning a Scene-Conditioned 3D Object Detector [supp]
Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion [supp]
CLRNet: Cross Layer Refinement Network for Lane Detection [supp]
Cross-Modal Map Learning for Vision and Language Navigation [supp]
Motion-Aware Contrastive Video Representation Learning via Foreground-Background Merging [supp]
Incremental Transformer Structure Enhanced Image Inpainting With Masking Positional Encoding [supp]
Pointly-Supervised Instance Segmentation [supp]
Cross-Modal Clinical Graph Transformer for Ophthalmic Report Generation
Human-Object Interaction Detection via Disentangled Transformer [supp]
DINE: Domain Adaptation From Single and Multiple Black-Box Predictors
LGT-Net: Indoor Panoramic Room Layout Estimation With Geometry-Aware Transformer Network [supp]
CRIS: CLIP-Driven Referring Image Segmentation
Multi-View Mesh Reconstruction With Neural Deferred Shading [supp]
CVF-SID: Cyclic Multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise From Image [supp]
Infrared Invisible Clothing: Hiding From Infrared Detectors at Multiple Angles in Real World [supp]
Distribution-Aware Single-Stage Models for Multi-Person 3D Pose Estimation
FaceFormer: Speech-Driven 3D Facial Animation With Transformers [supp]
Exploring Patch-Wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks [supp]
High-Resolution Face Swapping via Latent Semantics Disentanglement [supp]
Searching the Deployable Convolution Neural Networks for GPUs [supp]
Sparse Local Patch Transformer for Robust Face Alignment and Landmarks Inherent Relation Learning [supp]
DeepFake Disrupter: The Detector of DeepFake Is My Friend [supp]
Rotationally Equivariant 3D Object Detection [supp]
Accelerating DETR Convergence via Semantic-Aligned Matching [supp]
Long-Short Temporal Contrastive Learning of Video Transformers
Vision Transformer With Deformable Attention [supp]
Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture [supp]
Deep Vanishing Point Detection: Geometric Priors Make Dataset Variations Vanish [supp]
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic Scenes
LiT: Zero-Shot Transfer With Locked-Image Text Tuning [supp]
Cloning Outfits From Real-World Images to 3D Characters for Generalizable Person Re-Identification [supp]
GeoNeRF: Generalizing NeRF With Geometry Priors [supp]
ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution Photo [supp]
PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation With Photometrically Challenging Objects [supp]
Neural Compression-Based Feature Learning for Video Restoration [supp]
Expanding Low-Density Latent Regions for Open-Set Object Detection [supp]
Drop the GAN: In Defense of Patches Nearest Neighbors As Single Image Generative Models
Uformer: A General U-Shaped Transformer for Image Restoration [supp]
Exploring Dual-Task Correlation for Pose Guided Person Image Generation
Portrait Eyeglasses and Shadow Removal by Leveraging 3D Synthetic Data [supp]
Neural Rays for Occlusion-Aware Image-Based Rendering [supp]
Modeling 3D Layout for Group Re-Identification
Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity [supp]
SIOD: Single Instance Annotated per Category per Image for Object Detection [supp]
Toward Fast, Flexible, and Robust Low-Light Image Enhancement [supp]
Online Learning of Reusable Abstract Models for Object Goal Navigation
Bridge-Prompt: Towards Ordinal Action Understanding in Instructional Videos
SimMatch: Semi-Supervised Learning With Similarity Matching
OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks [supp]
HandOccNet: Occlusion-Robust 3D Hand Mesh Estimation Network [supp]
EfficientNeRF Efficient Neural Radiance Fields [supp]
Quantifying Societal Bias Amplification in Image Captioning [supp]
Modular Action Concept Grounding in Semantic Video Prediction [supp]
StyleSwin: Transformer-Based GAN for High-Resolution Image Generation [supp]
Reinforced Structured State-Evolution for Vision-Language Navigation
Sub-Word Level Lip Reading With Visual Attention
Weakly Supervised High-Fidelity Clothing Model Generation [supp]
Highly-Efficient Incomplete Large-Scale Multi-View Clustering With Consensus Bipartite Graph [supp]
Towards Principled Disentanglement for Domain Generalization [supp]
Discrete Cosine Transform Network for Guided Depth Map Super-Resolution [supp]
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing [supp]
E2V-SDE: From Asynchronous Events to Fast and Continuous Video Reconstruction via Neural Stochastic Differential Equations [supp]
评论 1
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值