ICCV 2025 Accepted Papers (一)

ID # 0 # paper title # Privacy-centric Deep Motion Retargeting for Anonymization of Skeleton-Based Motion Visualization

Authors # Thomas Carr · Depeng Xu · Shuhan Yuan · Aidong Lu

ID # 1 # paper title # Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics

Authors # Taowen Wang · Cheng Han · James Liang · Wenhao Yang · Dongfang Liu · Luna Zhang · Qifan Wang · Jiebo Luo · Ruixiang Tang

ID # 2 # paper title # Voyaging into Unbounded Dynamic Scenes from a Single View

Authors # Fengrui Tian · Tianjiao Ding · Jinqi Luo · Hancheng Min · Rene Vidal

ID # 3 # paper title # AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations?

Authors # Shouwei Ruan · Hanqing Liu · Yao Huang · XIaoqi Wang · Caixin KANG · Hang Su · Yinpeng Dong · Xingxing Wei

ID # 4 # paper title # NuPlanQA: A Large-Scale Dataset and Benchmark for Multi-View Driving Scene Understanding in Multi-Modal Large Language Models

Authors # Sung-Yeon Park · Can Cui · Yunsheng Ma · Ahmadreza Moradipari · Rohit Gupta · Kyungtae Han · Ziran Wang

ID # 5 # paper title # TRCE: Towards Reliable Malicious Concept Erasure in Text-to-Image Diffusion Models

Authors # Ruidong Chen · honglin guo · Lanjun Wang · Chenyu Zhang · Weizhi Nie · Anan Liu

ID # 6 # paper title # Training-Free Text-Guided Image Editing with Visual Autoregressive Model

Authors # Yufei Wang · Lanqing Guo · Zhihao Li · Jiaxing Huang · Pichao WANG · Bihan Wen · Jian Wang

ID # 7 # paper title # When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training

Authors # Yunwei Lan · Zhigao Cui · Xin Luo · Chang Liu · Nian Wang · Menglin Zhang · Yanzhao Su · Dong Liu

ID # 8 # paper title # MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

Authors # Shengbang Tong · David Fan · Jiachen Zhu · Yunyang Xiong · Xinlei Chen · Koustuv Sinha · Michael Rabbat · Yann LeCun · Saining Xie · Zhuang Liu

ID # 9 # paper title # Text Embedding Knows How to Quantize Text-Guided Diffusion Models

Authors # Hongjae Lee · Myungjun Son · Dongjea Kang · Seung-Won Jung

ID # 10 # paper title # Feature Decomposition-Recomposition in Large Vision-Language Model for Few-Shot Class-Incremental Learning

Authors # Zongyao Xue · Meina Kan · Shiguang Shan · Xilin Chen

ID # 11 # paper title # Training-Free Industrial Defect Generation with Diffusion Models

Authors # Ruyi Xu · Yen-Tzu Chiu · Tai-I Chen · Oscar Chew · Yung-Yu Chuang · Wen-Huang Cheng

ID # 12 # paper title # Auto-Regressive Transformation for Image Alignment

Authors # Kanggeon Lee · Soochahn Lee · Kyoung Mu Lee

ID # 13 # paper title # UniOcc: A Unified Benchmark for Occupancy Forecasting and Prediction in Autonomous Driving

Authors # Yuping Wang · Xiangyu Huang · Xiaokang Sun · Mingxuan Yan · Shuo Xing · Zhengzhong Tu · Jiachen Li

ID # 14 # paper title # HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration

Authors # Xiyu Zhang · Jiayi Ma · Jianwei Guo · Wei Hu · Zhaoshuai Qi · Fei HUI · Jiaqi Yang · Yanning Zhang

ID # 15 # paper title # Reverse Convolution and Its Applications to Image Restoration

Authors # Xuhong Huang · Shiqi Liu · Kai Zhang · Ying Tai · Jian Yang · Hui Zeng · Lei Zhang

ID # 16 # paper title # SAMO: A Lightweight Sharpness-Aware Approach for Multi-Task Optimization with Joint Global-Local Perturbation

Authors # Hao Ban · Gokul Ram Subramani · Kaiyi Ji

ID # 17 # paper title # Retinex-MEF: Retinex-based Glare Effects Aware Unsupervised Multi-Exposure Image Fusion

Authors # Haowen Bai · Jiangshe Zhang · Zixiang Zhao · Lilun Deng · Yukun Cui · Shuang Xu

ID # 18 # paper title # GeoFormer: Geometry Point Encoder for 3D Object Detection with Graph-based Transformer

Authors # Xin Jin · Haisheng Su · Cong Ma · Kai Liu · Wei Wu · Fei HUI · Junchi Yan

ID # 19 # paper title # PriOr-Flow: Enhancing Primitive Panoramic Optical Flow with Orthogonal View

Authors # Longliang Liu · Miaojie Feng · Junda Cheng · Jijun Xiang · Xuan Zhu · Xin Yang

ID # 20 # paper title # Understanding Personal Concept in Open-Vocabulary Semantic Segmentation

Authors # Sunghyun Park · Jungsoo Lee · Shubhankar Borse · Munawar Hayat · Sungha Choi · Kyuwoong Hwang · Fatih Porikli

ID # 21 # paper title # Lark: Low-Rank updates after knowledge localization for Few-shot Class-Incremental Learning

Authors # Jinxin Shi · Jiabao Zhao · Yifan Yang · Xingjiao Wu · Jiawen Li · Liang He

ID # 22 # paper title # FreeFlux: Understanding and Exploiting Layer-Specific Roles in RoPE-Based MMDiT for Versatile Image Editing

Authors # Tianyi Wei · Yifan Zhou · Dongdong Chen · Xingang Pan

ID # 23 # paper title # INS-MMBench: A Comprehensive Benchmark for Evaluating LVLMs’ Performance in Insurance

Authors # Chenwei Lin · Hanjia Lyu · Xian Xu · Jiebo Luo

ID # 24 # paper title # Statistical Confidence Rescoring for Robust 3D Scene Graph Generation from Multi-View Images

Authors # Qi Xun Yeo · Yanyan Li · Gim Hee Lee

ID # 25 # paper title # Learning Large Motion Estimation from Intermediate Representations with a High-Resolution Optical Flow Dataset Featuring Long-Range Dynamic Motion

Authors # Hoonhee Cho · Yuhwan Jeong · Kuk-Jin Yoon

ID # 26 # paper title # VSP: Diagnosing the Dual Challenges of Perception and Reasoning in Spatial Planning Tasks for MLLMs

Authors # Qiucheng Wu · Handong Zhao · Michael Saxon · Trung Bui · William Yang Wang · Yang Zhang · Shiyu Chang

ID # 27 # paper title # Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning

Authors # Tianyi Zhao · Boyang Liu · Yanglei Gao · Yiming Sun · Maoxun Yuan · Xingxing Wei

ID # 28 # paper title # Scaling and Taming Adversarial Training with Synthetic Data

Authors # Juntao Wu · Xianting Huang · Yu Chen · Shuai Pang · Ke Wang

ID # 29 # paper title # Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering

Authors # Kaixuan Jiang · Yang Liu · Weixing Chen · Jingzhou Luo · Ziliang Chen · Ling Pan · Guanbin Li · Liang Lin

ID # 30 # paper title # DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving

Authors # Xuemeng Yang · Licheng Wen · Tiantian Wei · Yukai Ma · Jianbiao Mei · Xin Li · Wenjie Lei · Daocheng Fu · Pinlong Cai · Min Dou · Liang He · Yong Liu · Botian Shi · Yu Qiao

ID # 31 # paper title # MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment

Authors # Yachun Mi · Yu Li · Weicheng Meng · Chaofeng Chen · Chen Hui · Shaohui Liu

ID # 32 # paper title # Omni-scene Perception-oriented Point Cloud Geometry Enhancement for Coordinate Quantization

Authors # Wang Liu · Wei Gao

ID # 33 # paper title # MorphoGen: Efficient Unconditional Generation of Long-Range Projection Neuronal Morphology via a Global-to-Local Framework

Authors # Tianfang Zhu · Hongyang Zhou · Anan LI

ID # 34 # paper title # SAUCE: Selective Concept Unlearning in Vision-Language Models with Sparse Autoencoders

Authors # Jiahui Geng · Qing Li

ID # 35 # paper title # TikZero: Zero-Shot Text-Guided Graphics Program Synthesis

Authors # Jonas Belouadi · Eddy Ilg · Margret Keuper · Hideki Tanaka · Masao Utiyama · Raj Dabre · Steffen Eger · Simone Paolo Ponzetto

ID # 36 # paper title # DiffSim: Taming Diffusion Models for Evaluating Visual Similarity

Authors # Yiren Song · Xiaokang Liu · Mike Zheng Shou

ID # 37 # paper title # Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens

Authors # Qihang Fan · Huaibo Huang · Mingrui Chen · Ran He

ID # 38 # paper title # FaceXFormer: A Unified Transformer for Facial Analysis

Authors # Kartik Narayan · Vibashan VS · Rama Chellappa · Vishal Patel

ID # 39 # paper title # Learning to Generalize without Bias for Open-Vocabulary Action Recognition

Authors # Yating Yu · Congqi Cao · Yifan Zhang · Yanning Zhang

ID # 40 # paper title # Perceive, Understand and Restore: Real-World Image Super-Resolution with Autoregressive Multimodal Generative Models

Authors # Hongyang Wei · Shuaizheng Liu · Chun Yuan · Lei Zhang

ID # 41 # paper title # IMoRe: Implicit Program-Guided Reasoning for Human Motion Q&A

Authors # Chen Li · Chinthani Sugandhika · Ee Yeo Keat · Eric Peh · Hao Zhang · HONG YANG · Deepu Rajan · Basura Fernando

ID # 42 # paper title # TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Authors # Wenhao Wang · Yi Yang

ID # 43 # paper title # VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE

Authors # Yazhou Xing · Yang Fei · Yingqing He · Jingye Chen · Jiaxin Xie · Xiaowei Chi · Qifeng Chen

ID # 44 # paper title # Exploring View Consistency for Scene-Adaptive Low-Light Light Field Image Enhancement

Authors # Shuo Zhang · Chen Gao · Youfang Lin

ID # 45 # paper title # Unleashing Vectset Diffusion Model for Fast Shape Generation

Authors # Zeqiang Lai · Zhao Yunfei · Zibo Zhao · Haolin Liu · Fu-Yun Wang · Huiwen Shi · Xianghui Yang · Qingxiang Lin · Jingwei Huang · Lliu Yuhong · Jie Jiang · Chunchao Guo · Xiangyu Yue

ID # 46 # paper title # Task-Oriented Human Grasp Synthesis via Context- and Task-Aware Diffusers

Authors # An Lun Liu · Yu-Wei Chao · Yi-Ting Chen

ID # 47 # paper title # Visual Intention Grounding for Egocentric Assistant

Authors # Pengzhan Sun · Junbin Xiao · Tze Ho Elden Tse · Yicong Li · Arjun Akula · Angela Yao

ID # 48 # paper title # GSRecon: Efficient Generalizable Gaussian Splatting for Surface Reconstruction from Sparse Views

Authors # Hang Yang · Le Hui · Jianjun Qian · Jin Xie · Jian Yang

ID # 49 # paper title # VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding

Authors # Minchao Jiang · Shunyu Jia · Jiaming Gu · Xiaoyuan Lu · Guangming Zhu · Anqi Dong · zhang liang

ID # 50 # paper title # VTimeCoT: Thinking by Drawing for Video Temporal Grounding and Reasoning

Authors # Jinglei Zhang · Yuanfan Guo · Rolandos Alexandros Potamias · Jiankang Deng · Hang Xu · Chao Ma

ID # 51 # paper title # EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba

Authors # Quang Nguyen · Nhat Le · Baoru Huang · Minh VU · Chengcheng Tang · Van Nguyen · Ngan Le · Thieu Vo · Anh Nguyen

ID # 52 # paper title # Automated Model Evaluation for Object Detection via Prediction Consistency and Reliablity

Authors # Seungju Yoo · Hyuk Kwon · Joong-Won Hwang · Kibok Lee

ID # 53 # paper title # LazyMAR: Accelerating Masked Autoregressive Models via Feature Caching

Authors # Feihong Yan · qingyan wei · Jiayi Tang · Jiajun Li · Yulin Wang · Xuming Hu · Huiqi Li · Linfeng Zhang

ID # 54 # paper title # RIPE: Reinforcement Learning on Unlabeled Image Pairs for Robust Keypoint Extraction

Authors # Johannes Künzel · Anna Hilsmann · Peter Eisert

ID # 55 # paper title # PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

Authors # Ming Dai · Wenxuan Cheng · Jiedong Zhuang · Jiang-Jiang Liu · Hongshen Zhao · Zhenhua Feng · Wankou Yang

ID # 56 # paper title # Unified Video Generation via Next-Set Prediction in Continuous Domain

Authors # Zhanzhou Feng · Qingpei Guo · Xinyu Xiao · Ruihan Xu · Ming Yang · Shiliang Zhang

ID # 57 # paper title # G2DG^{2}DG2D: Boosting Multimodal Learning with Gradient-Guided Distillation

Authors # Mohammed Rakib · Arunkumar Bagavathi

ID # 58 # paper title # Text-guided Visual Prompt DINO for Generic Segmentation

Authors # Yuchen Guan · Chong Sun · Canmiao Fu · Zhipeng Huang · Chun Yuan · Chen Li

ID # 59 # paper title # Low-Light Image Enhancement using Event-Based Illumination Estimation

Authors # Lei Sun · Yuhan Bao · Jiajun Zhai · Jingyun Liang · YULUN ZHANG · Kaiwei Wang · Danda Pani Paudel · Luc Gool

ID # 60 # paper title # SAS: Segment Any 3D Scene with Integrated 2D Priors

Authors # Zhuoyuan Li · Jiahao Lu · Jiacheng Deng · Hanzhi Chang · Lifan Wu · Yanzhe Liang · Tianzhu Zhang

ID # 61 # paper title # MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration

Authors # Zhehui Wu · Yong Chen · Naoto Yokoya · Wei He

ID # 62 # paper title # FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors

Authors # Yabo Zhang · xinpeng zhou · Yihan Zeng · Hang Xu · Hui Li · Wangmeng Zuo

ID # 63 # paper title # MMAD: Multi-label Micro-Action Detection in Videos

Authors # Kun Li · pengyu Liu · Dan Guo · Fei Wang · zhiliang wu · Hehe Fan · Meng Wang

ID # 64 # paper title # UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Authors # Haoxuan Wang · Jinlong Peng · Qingdong He · Hao Yang · Ying Jin · Jiafu Wu · Xiaobin Hu · Yanjie Pan · Zhenye Gan · Mingmin Chi · Bo Peng · Yabiao Wang

ID # 65 # paper title # HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection

Authors # Fengzhe Zhou · Humphrey Shi

ID # 66 # paper title # MAVias: Mitigate any Visual Bias

Authors # Ioannis Sarridis · Christos Koutlis · Symeon Papadopoulos · Christos Diou

ID # 67 # paper title # Joint Diffusion Models in Continual Learning

Authors # Paweł Skierś · Kamil Deja

ID # 68 # paper title # Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations

Authors # Xiang Xu · Lingdong Kong · Song Wang · Chuanwei Zhou · Qingshan Liu

ID # 69 # paper title # T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation

Authors # Chieh-Yun Chen · Min Shi · Gong Zhang · Humphrey Shi

ID # 70 # paper title # ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts

Authors # Xiaoqi Wang · Clint Sebastian · Wenbin He · Liu Ren

ID # 71 # paper title # CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models

Authors # Quang-Binh Nguyen · Minh Luu · Quang Nguyen · Anh Tran · Khoi Nguyen

ID # 72 # paper title # Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration

Authors # Dongyue Wu · Zilin Guo · Jialong Zuo · Nong Sang · Changxin Gao

ID # 73 # paper title # Less is More: Empowering GUI Agent with Context-Aware Simplification

Authors # Gongwei Chen · Xurui Zhou · Rui Shao · Yibo Lyu · Kaiwen Zhou · Shuai Wang · WenTao Li · Yinchuan Li · Zhongang Qi · Liqiang Nie

ID # 74 # paper title # Background Invariance Testing According to Semantic Proximity

Authors # Zukang Liao · Min Chen

ID # 75 # paper title # Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition

Authors # Jeonghyeok Do · Munchurl Kim

ID # 76 # paper title # Optical Model-Driven Sharpness Mapping for Autofocus in Small Depth-of-Field and Severe Defocus Scenarios

Authors # ChenLiang Fan · Mingpei Cao · Chih Hung · Yuesheng Zhu

ID # 77 # paper title # ScanEdit: Hierarchically-Guided Functional 3D Scan Editing

Authors # Mohamed El Amine Boudjoghra · Ivan Laptev · Angela Dai

ID # 78 # paper title # MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent

Authors # Xinyao Liao · Xianfang Zeng · Liao Wang · Gang YU · Guosheng Lin · Chi Zhang

ID # 79 # paper title # MBTI: Masked Blending Transformers with Implicit Positional Encoding for Frame-rate Agnostic Motion Estimation

Authors # Jungwoo Huh · Yeseung Park · Seongjean Kim · Jungsu Kim · Sanghoon Lee

ID # 80 # paper title # DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion

Authors # Maksim Siniukov · Di Chang · Minh Tran · Hongkun Gong · Ashutosh Chaubey · Mohammad Soleymani

ID # 81 # paper title # PUMPS: Skeleton-Agnostic Point-based Universal Motion Pre-Training for Synthesis in Human Motion Tasks

Authors # Clinton A Mo · Kun Hu · Chengjiang Long · Dong Yuan · Wan-Chi Siu · Zhiyong Wang

ID # 82 # paper title # Find Any Part in 3D

Authors # Ziqi Ma · Yisong Yue · Georgia Gkioxari

ID # 83 # paper title # Inpaint4Drag: Drag-based Image Editing via Bidirectional Warping and Inpainting

Authors # Jingyi Lu · Kai Han

ID # 84 # paper title # AutoComPose: Automatic Generation of Pose Transition Descriptions for Composed Pose Retrieval Using Multimodal LLMs

Authors # Yi-Ting Shen · Sungmin Eum · Doheon Lee · Rohit Shete · Chiao-Yi Wang · Heesung Kwon · Shuvra Bhattacharyya

ID # 85 # paper title # Revelio\textit{Revelio}Revelio: Interpreting and leveraging semantic information in diffusion models

Authors # Dahye Kim · Xavier Thomas · Deepti Ghadiyaram

ID # 86 # paper title # From Enhancement to Understanding: Build a Generalized Bridge for Low-light Vision via Semantically Consistent Unsupervised Fine-tuning

Authors # Sen Wang · Shao Zeng · Tianjun Gu · zhizhong zhang · Ruixin Zhang · Shouhong Ding · Jingyun Zhang · Jun Wang · Xin TAN · Yuan Xie · Lizhuang Ma

ID # 87 # paper title # Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding

Authors # Yue Fan · Xiaojian Ma · Rongpeng Su · Jun Guo · Rujie Wu · Xi Chen · Qing Li

ID # 88 # paper title # DreamCube: RGB-D Panorama Generation via Multi-plane Synchronization

Authors # Yukun Huang · Yanning Zhou · Jianan Wang · Kaiyi Huang · Xihui Liu

ID # 89 # paper title # Latent Diffusion Models with Masked AutoEncoders

Authors # Junho Lee · Jeongwoo Shin · Hyungwook Choi · Joonseok Lee

ID # 90 # paper title # ALOcc: Adaptive Lifting-based 3D Semantic Occupancy and Cost Volume-based Flow Predictions

Authors # Dubing Chen · Jin Fang · Wencheng Han · Xinjing Cheng · Junbo Yin · Cheng-zhong Xu · Fahad Khan · Jianbing Shen

ID # 91 # paper title # GameFactory: Creating New Games with Generative Interactive Videos

Authors # Jiwen Yu · Yiran Qin · Xintao Wang · Pengfei Wan · Di ZHANG · Xihui Liu

ID # 92 # paper title # From Gaze to Movement: Predicting Visual Attention for Autonomous Driving Human-Machine Interaction based on Programmatic Imitation Learning

Authors # Yexin Huang · Yongbin Lin · Lishengsa Yue · Zhihong Yao · Jie Wang

ID # 93 # paper title # Generative Zoo

Authors # Tomasz Niewiadomski · Anastasios Yiannakidis · Hanz Cuevas Velasquez · Soubhik Sanyal · Michael Black · Silvia Zuffi · Peter Kulits

ID # 94 # paper title # Event-Driven Storytelling with Multiple Lifelike Humans in a 3D scene

Authors # Donggeun Lim · Jinseok Bae · Inwoo Hwang · Seungmin Lee · Hwanhee Lee · Young Kim Kim

ID # 95 # paper title # DreamFuse: Adaptive Image Fusion with Diffusion Transformer

Authors # Junjia Huang · Pengxiang Yan · Jiyang Liu · Jie Wu · Zhao Wang · Yitong Wang · Liang Lin · Guanbin Li

ID # 96 # paper title # Secure On-Device Video OOD Detection Without Backpropagation

Authors # Li Li · Peilin Cai · Yuxiao Zhou · Zhiyu Ni · Renjie Liang · QIN YOU · Yi Nian · Zhengzhong Tu · Xiyang Hu · Yue Zhao

ID # 97 # paper title # Towards Fine-grained Interactive Segmentation in Images and Videos

Authors # Yuan Yao · Qiushi Yang · Miaomiao Cui · Liefeng Bo

ID # 98 # paper title # PhysRig: Differentiable Physics-Based Skinning and Rigging Framework for Realistic Articulated Object Modeling

Authors # Hao Zhang · Haolan Xu · Chun Feng · Varun Jampani · Narendra Ahuja

ID # 99 # paper title # From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Authors # Le Zhuo · Liangbing Zhao · Sayak Paul · Yue Liao · Renrui Zhang · Yi Xin · Peng Gao · Mohamed Elhoseiny · Hongsheng Li

ID # 100 # paper title # PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions

Authors # Mahesh Bhosale · Abdul Wasi · Yuanhao Zhai · Yunjie Tian · Samuel Border · Nan Xi · Pinaki Sarder · Junsong Yuan · David Doermann · Xuan Gong

ID # 101 # paper title # From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition

Authors # Ling Lo · Kelvin Chan · Wen-Huang Cheng · Ming-Hsuan Yang

ID # 102 # paper title # Multi-Modal Few-Shot Temporal Action Segmentation

Authors # Zijia Lu · Ehsan Elhamifar

ID # 103 # paper title # Future-Aware Interaction Network For Motion Forecasting

Authors # Shijie Li · Chunyu Liu · Xun Xu · Si Yong Yeo · Xulei Yang

ID # 104 # paper title # VideoSetBench: Identifying and Reasoning Similarities and Differences in Similar Videos

Authors # YUE QIU · Yanjun Sun · Takuma Yagi · Shusaku Egami · Natsuki Miyata · Ken Fukuda · Kensho Hara · Ryusuke Sagawa

ID # 105 # paper title # Haze_x0008_Flow: Revisit Haze Physical Model as ODE and Realistic Non-Homogeneous Haze Generation for Real-World Dehazing

Authors # Junseong Shin · Seungwoo Chung · Yunjeong Yang · Tae Hyun Kim

ID # 106 # paper title # Online Generic Event Boundary Detection

Authors # Hyung Rok Jung · Daneul Kim · Seunggyun Lim · Jeany Son · Jonghyun Choi

ID # 107 # paper title # Animate Anyone 2: High-Fidelity Character Image Animation with Environment Affordance

Authors # Li Hu · wang yuan · Zhen Shen · Xin Gao · Dechao Meng · Li’an Zhuo · Peng Zhang · Bang Zhang · Liefeng Bo

ID # 108 # paper title # CHROME: Clothed Human Reconstruction with Occlusion-Resilience and Multiview-Consistency from a Single Image

Authors # Arindam Dutta · Meng Zheng · Zhongpai Gao · Benjamin Planche · Anwesa Choudhuri · Terrence Chen · Amit Roy-Chowdhury · Ziyan Wu

ID # 109 # paper title # SAFER: Sharpness Aware layer-selective Finetuning for Enhanced Robustness in vision transformers

Authors # Bhavna Gopal · Huanrui Yang · Mark Horton · Yiran Chen

ID # 110 # paper title # Flow Stochastic Segmentation Networks

Authors # Fabio De Sousa Ribeiro · Omar Todd · Charles Jones · Avinash Kori · Raghav Mehta · Ben Glocker

ID # 111 # paper title # Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Authors # Weiming Ren · Wentao Ma · Huan Yang · Cong Wei · Ge Zhang · Wenhu Chen

ID # 112 # paper title # VLDrive: Vision-Augmented Lightweight MLLMs for Efficient Language-grounded Autonomous Driving

Authors # Ruifei Zhang · Wei Zhang · Xiao Tan · Sibei Yang · Xiang Wan · Xiaonan Luo · Guanbin Li

ID # 113 # paper title # Instance-Level Video Depth in Groups Beyond Occlusions

Authors # Yuan Liang · Yang Zhou · Ziming Sun · Tianyi Xiang · Guiqing Li · Shengfeng He

ID # 114 # paper title # Wasserstein Style Distribution Analysis and Transform for Stylized Image Generation

Authors # Xi Yu · Xiang Gu · Zhihao Shi · Jian Sun

ID # 115 # paper title # SynAD: Enhancing Real-World End-to-End Autonomous Driving Models through Synthetic Data Integration

Authors # Jongsuk Kim · Jae Young Lee · Gyojin Han · Dong-Jae Lee · Minki Jeong · Junmo Kim

ID # 116 # paper title # Adversarial Robust Memory-Based Continual Learner

Authors # Xiaoyue Mi · Fan Tang · Zonghan Yang · Danding Wang · Juan Cao · Peng Li · Yang Liu

ID # 117 # paper title # Open-World Skill Discovery from Unsegmented Demonstration Videos

Authors # Jingwen Deng · Zihao Wang · Shaofei Cai · Anji Liu · Yitao Liang

ID # 118 # paper title # InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction

Authors # Yuhui WU · Liyi Chen · Ruibin Li · Shihao Wang · Chenxi Xie · Lei Zhang

ID # 119 # paper title # Towards Open-World Generation of Stereo Images and Unsupervised Matching

Authors # Feng Qiao · Zhexiao Xiong · Eric Xing · Nathan Jacobs

ID # 120 # paper title # Certifiably Optimal Anisotropic Rotation Averaging

Authors # Carl Olsson · Yaroslava Lochman · Johan Malmport · Christopher Zach

ID # 121 # paper title # OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images

Authors # Ziyue Huang · Yongchao Feng · Ziqi Liu · Shuai Yang · Qingjie Liu · Yunhong Wang

ID # 122 # paper title # Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Authors # Eric Slyman · Mehrab Tanjim · Kushal Kafle · Stefan Lee

ID # 123 # paper title # Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing

Authors # Yongxin Guo · Lin Wang · Xiaoying Tang · Tao Lin

ID # 124 # paper title # Monocular Semantic Scene Completion via Masked Recurrent Networks

Authors # Xuzhi Wang · Xinran Wu · Song Wang · Lingdong Kong · Ziping Zhao

ID # 125 # paper title # RayZer: A Self-supervised Large View Synthesis Model

Authors # Hanwen Jiang · Hao Tan · Peng Wang · Haian Jin · Yue Zhao · Sai Bi · Kai Zhang · Fujun Luan · Kalyan Sunkavalli · Qixing Huang · Georgios Pavlakos

ID # 126 # paper title # Enhanced Pansharpening via Quaternion Spatial-Spectral Interactions

Authors # Dong Li · Chunhui Luo · Yuanfei Bao · Gang Yang · Jie Xiao · Xueyang Fu · Zheng-Jun Zha

ID # 127 # paper title # InfiniDreamer: Arbitrarily Long Human Motion Generation via Segment Score Distillation

Authors # Wenjie Zhuo · Fan Ma · Hehe Fan

ID # 128 # paper title # Focal Plane Visual Feature Generation and Matching on a Pixel Processor Array

Authors # Hongyi Zhang · Laurie Bose · Jianing Chen · Piotr Dudek · Walterio Mayol-Cuevas

ID # 129 # paper title # Measuring the Impact of Rotation Equivariance on Aerial Object Detection

Authors # Xiuyu Wu · Xinhao Wang · Xiubin Zhu · Lan Yang · Jiyuan Liu · Xingchen Hu

ID # 130 # paper title # MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network

Authors # Jianfei Jiang · Qiankun Liu · Haochen Yu · Hongyuan Liu · Liyong Wang · Jiansheng Chen · Huimin Ma

ID # 131 # paper title # LACONIC: A 3D Layout Adapter for Controllable Image Creation

Authors # Léopold Maillard · Tom Durand · Adrien RAMANANA RAHARY · Maks Ovsjanikov

ID # 132 # paper title # Learning 3D Object Spatial Relationships from Pre-trained 2D Diffusion Models

Authors # Sangwon Baik · Hyeonwoo Kim · Hanbyul Joo

ID # 133 # paper title # CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling

Authors # Trong-Thang Pham · AKASH AWASTHI · Saba Khan · Esteban Marti · Tien-Phat Nguyen · Khoa Vo · Minh Tran · Ngoc Son Nguyen · Cuong Van · Yuki Ikebe · Anh Nguyen · Anh Nguyen · Zhigang Deng · Carol Wu · Hien Nguyen · Ngan Le

ID # 134 # paper title # X-Capture: An Open-Source Portable Device for Multi-Sensory Learning

Authors # Samuel Clarke · Suzannah Wistreich · Yanjie Ze · Jiajun Wu

ID # 135 # paper title # SAGI: Semantically Aligned and Uncertainty Guided AI Image Inpainting

Authors # Paschalis Giakoumoglou · Dimitrios Karageorgiou · Symeon Papadopoulos · Panagiotis Petrantonakis

ID # 136 # paper title # Enhancing Image Restoration Transformer via Adaptive Translation Equivariance

Authors # JiaKui Hu · Zhengjian Yao · Lujia Jin · Hangzhou He · Yanye Lu

ID # 137 # paper title # Region-aware Anchoring Mechanism for Efficient Referring Visual Grounding

Authors # Shuyi Ouyang · Ziwei Niu · Hongyi Wang · Yen-wei Chen · Lanfen Lin

ID # 138 # paper title # Turbo2K: Towards Ultra-Efficient and High-Quality 2K Video Synthesis

Authors # Jingjing Ren · Wenbo Li · Zhongdao Wang · Haoze Sun · Bangzhen Liu · Haoyu Chen · Jiaqi Xu · Aoxue Li · Shifeng Zhang · Bin Shao · Yong Guo · Lei Zhu

ID # 139 # paper title # A Token-level Text Image Foundation Model for Document Understanding

Authors # Tongkun Guan · Zining Wang · Pei Fu · Zhentao Guo · Wei Shen · Kai zhou · Tiezhu Yue · Chen Duan · Hao Sun · Qianyi Jiang · Junfeng Luo · Xiaokang Yang

ID # 140 # paper title # Improving Noise Efficiency in Privacy-preserving Dataset Distillation

Authors # Runkai Zheng · Vishnu Dasu · Yinong Wang · Haohan Wang · Fernando De la Torre

ID # 141 # paper title # Vision-Language Neural Graph Featurization for Extracting Retinal Lesions

Authors # Taimur Hassan · Anabia Sohail · Muzammal Naseer · Naoufel Werghi

ID # 142 # paper title # DepthSync: Diffusion Guidance-Based Depth Synchronization for Scale- and Geometry-Consistent Video Depth Estimation

Authors # Yue-Jiang Dong · Wang Zhao · Jiale Xu · Ying Shan · Song-Hai Zhang

ID # 143 # paper title # VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models

Authors # JIACHENG RUAN · Wenzhen Yuan · Xian Gao · Ye Guo · Daoxin Zhang · Zhe Xu · Yao Hu · Ting Liu · yuzhuo fu

ID # 144 # paper title # LDIP: Long Distance Information Propagation for Video Super-Resolution

Authors # Michael Bernasconi · Abdelaziz Djelouah · Yang Zhang · Markus Gross · Christopher Schroers

ID # 145 # paper title # Blind Noisy Image Deblurring Using Residual Guidance Strategy

Authors # heyan liu · Jianing Sun · Jun Liu · Xi-Le Zhao · Tingting WU · Tieyong Zeng

ID # 146 # paper title # Who is a Better Talker: Subjective and Objective Quality Assessment for AI-Generated Talking Heads

Authors # Yingjie Zhou · Jiezhang Cao · Zicheng Zhang · Farong Wen · Jiang Yanwei · Jun Jia · Xiaohong Liu · Xiongkuo Min · Guangtao Zhai

ID # 147 # paper title # Hierarchical Variational Test-Time Prompt Generation for Zero-Shot Generalization

Authors # Zhaoyang Wu · Fang Liu · Licheng Jiao · Shuo Li · Lingling Li · Xu Liu · Puhua Chen · wenping ma

ID # 148 # paper title # End-to-End Entity-Predicate Association Reasoning for Dynamic Scene Graph Generation

Authors # LiWei Wang · YanDuo Zhang · Tao Lu · Fang Liu · Huiqin Zhang · Jiayi Ma · Huabing Zhou

ID # 149 # paper title # TurboVSR: Fantastic Video Upscalers and Where to Find Them

Authors # Zhongdao Wang · Guodongfang Zhao · Jingjing Ren · bailan feng · Shifeng Zhang · Wenbo Li

ID # 150 # paper title # STDDNet: Harnessing Mamba for Video Polyp Segmentation via Spatial-aligned Temporal Modeling and Discriminative Dynamic Representation Learning

Authors # Guilian Chen · Huisi Wu · Jing Qin

ID # 151 # paper title # Where am I? Cross-View Geo-localization with Natural Language Descriptions

Authors # Junyan Ye · Honglin Lin · Leyan Ou · Dairong Chen · Zihao Wang · Qi Zhu · Conghui He · Weijia Li

ID # 152 # paper title # ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation

Authors # Guosheng Zhao · Xiaofeng Wang · Chaojun Ni · Zheng Zhu · Wenkang Qin · Guan Huang · Xingang Wang

ID # 153 # paper title # WorldScore: Unified Evaluation Benchmark for World Generation

Authors # Haoyi Duan · Hong-Xing Yu · Sirui Chen · Li Fei-Fei · Jiajun Wu

ID # 154 # paper title # Diffusion-based Source-biased Model for Single Domain Generalized Object Detection

Authors # Jiang Han · Wenfei Yang · Tianzhu Zhang · Yongdong Zhang

ID # 155 # paper title # EventUPS: Uncalibrated Photometric Stereo Using an Event Camera

Authors # Jinxiu Liang · Bohan Yu · Siqi Yang · Haotian Zhuang · Jieji Ren · Peiqi Duan · Boxin Shi

ID # 156 # paper title # The Curse of Conditions: Analyzing and Improving Optimal Transport for Conditional Flow-Based Generation

Authors # Ho Kei Cheng · Alex Schwing

ID # 157 # paper title # Riemannian-Geometric Fingerprints of Generative Models

Authors # Hae Jin Song · Laurent Itti

ID # 158 # paper title # Towards a 3D Transfer-based Black-box Attack via Critical Feature Guidance

Authors # Shuchao Pang · Zhenghan Chen · Shen Zhang · Liming Lu · Siyuan Liang · Anan Du · Yongbin Zhou

ID # 159 # paper title # Constraint-Aware Feature Learning for Parametric Point Cloud

Authors # Xi Cheng · Ruiqi Lei · Di Huang · Zhichao Liao · Fengyuan Piao · Yan Chen · Pingfa Feng · Long ZENG

ID # 160 # paper title # PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Authors # Kwanyoung Kim · Byeongsu Sim

ID # 161 # paper title # Taming the Untamed: Graph-Based Knowledge Retrieval and Reasoning for MLLMs to Conquer the Unknown

Authors # Bowen Wang · Zhouqiang Jiang · Yasuaki Susumu · Shotaro Miwa · Tianwei Chen · Yuta Nakashima

ID # 162 # paper title # CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval

Authors # Zelong Sun · Dong Jing · Zhiwu Lu

ID # 163 # paper title # V2M4: 4D Mesh Animation Reconstruction from a Single Monocular Video

Authors # Jianqi Chen · Biao Zhang · Xiangjun Tang · Peter Wonka

ID # 164 # paper title # PolarAnything: Diffusion-based Polarimetric Image Synthesis

Authors # Kailong Zhang · Youwei Lyu · Heng Guo · Si Li · Zhanyu Ma · Boxin Shi

ID # 165 # paper title # Polarimetric Neural Field with Unified Complex-Valued Wavefunction

Authors # Chu Zhou · Yixin Yang · Junda Liao · Heng Guo · Boxin Shi · Imari Sato

ID # 166 # paper title # RAGDiffusion: Faithful Cloth Generation via External Knowledge Assimilation

Authors # Yuhan Li · Xianfeng Tan · Wenxiang Shang · Yubo Wu · Jian Wang · Xuanhong Chen · Yi Zhang · Zhu Hangcheng · Bingbing Ni

ID # 167 # paper title # Exploring Multimodal Diffusion Transformers for Enhanced Prompt-based Image Editing

Authors # Joonghyuk Shin · Alchan Hwang · Yujin Kim · Daneul Kim · Jaesik Park

ID # 168 # paper title # Long-Context State-Space Video World Models

Authors # Ryan Po · Yotam Nitzan · Richard Zhang · Berlin Chen · Tri Dao · Eli Shechtman · Gordon Wetzstein · Xun Huang

ID # 169 # paper title # Open-ended Hierarchical Streaming Video Understanding with Vision Language Models

Authors # Hyolim Kang · YUNSU PARK · Youngbeom Yoo · Yeeun Choi · Seon Joo Kim

ID # 170 # paper title # Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection

Authors # Yupeng Hu · Changxing Ding · Chang Sun · Shaoli Huang · Xiangmin Xu

ID # 171 # paper title # DisenQ: Disentangling Q-Former for Activity-Biometrics

Authors # Shehreen Azad · Yogesh Rawat

ID # 172 # paper title # InterSyn: Interleaved Learning for Dynamic Motion Synthesis in the Wild

Authors # Yiyi Ma · Yuanzhi Liang · Xiu Li · Chi Zhang · Xuelong Li

ID # 173 # paper title # Epipolar Consistent Attention Aggregation Network for Unsupervised Light Field Disparity Estimation

Authors # Chen Gao · Shuo Zhang · Youfang Lin

ID # 174 # paper title # GARF: Learning Generalizable 3D Reassembly for Real-World Fractures

Authors # Sihang Li · Zeyu Jiang · Grace Chen · Chenyang Xu · Siqi Tan · Xue Wang · Irving Fang · Kristof Zyskowski · Shannon McPherron · Radu Iovita · Chen Feng · Jing Zhang

ID # 175 # paper title # AerialVG: A Challenging Benchmark for Aerial Visual Grounding by Exploring Positional Relations

Authors # Junli Liu · Qizhi Chen · Zhigang Wang · Yiwen Tang · Yiting Zhang · Chi Yan · Dong Wang · Xuelong Li · Bin Zhao

ID # 176 # paper title # Diving into the Fusion of Monocular Priors for Generalized Stereo Matching

Authors # Chengtang Yao · Lidong Yu · Zhidan Liu · Jiaxi Zeng · Yuwei Wu · Yunde Jia

ID # 177 # paper title # DICE: Staleness-Centric Optimizations for Parallel Diffusion MoE Inference

Authors # Jiajun Luo · Lizhuo Luo · Jianru Xu · Jiajun Song · Rongwei Lu · Chen Tang · Zhi Wang

ID # 178 # paper title # Emulating Self-attention with Convolution for Efficient Image Super-Resolution

Authors # Dongheon Lee · Seokju Yun · Youngmin Ro

ID # 179 # paper title # BlinkTrack: Feature Tracking over 80 FPS via Events and Images

Authors # Yichen Shen · Yijin Li · Shuo Chen · Guanglin Li · Zhaoyang Huang · Hujun Bao · Zhaopeng Cui · Guofeng Zhang

ID # 180 # paper title # GeoDistill: Geometry-Guided Self-Distillation for Weakly Supervised Cross-View Localization

Authors # Shaowen Tong · Zimin Xia · Alexandre Alahi · Xuming He · Yujiao Shi

ID # 181 # paper title # Image-Guided Shape-from-Template Using Mesh Inextensibility Constraints

Authors # Dinh-Vinh-Thuy Tran · Ruochen Chen · Shaifali Parashar

ID # 182 # paper title # Sequential Gaussian Avatars with Hierarchical Motion Context

Authors # Wangze Xu · Yifan Zhan · Zhihang Zhong · Xiao Sun

ID # 183 # paper title # Attention to the Burtiness in Visual Prompt Tuning!

Authors # Yuzhu Wang · Manni Duan · Shu Kong

ID # 184 # paper title # Federated Representation Angle Learning

Authors # Liping Yi · Han Yu · Gang Wang · xiaoguang Liu · Xiaoxiao Li

ID # 185 # paper title # InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models

Authors # Cong Wei · Yujie Zhong · yingsen zeng · Haoxian Tan · Yong Liu · Hongfa Wang · Yujiu Yang

ID # 186 # paper title # How To Make Your Cell Tracker Say “I dunno!”

Authors # Richard D Paul · Johannes Seiffarth · David Rügamer · Hanno Scharr · Katharina Nöh

ID # 187 # paper title # Visual Surface Wave Tomography: Revealing Subsurface Physical Properties via Visible Surface Waves

Authors # Alexander Ogren · Berthy Feng · Jihoon Ahn · Katherine Bouman · Chiara Daraio

ID # 188 # paper title # Mind the Cost of Scaffold! Benign Clients May Even Become Accomplices of Backdoor Attack

Authors # Xingshuo Han · Xuanye Zhang · Xiang Lan · Haozhao Wang · Shengmin Xu · Shen Ren · Jason Zeng · Ming Wu · Michael Heinrich · Tianwei Zhang

ID # 189 # paper title # YOLOE: Real-Time Seeing Anything

Authors # Ao Wang · Lihao Liu · Hui Chen · Zijia Lin · Jungong Han · Guiguang Ding

ID # 190 # paper title # SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation

Authors # Shiqi Huang · Shuting He · Huaiyuan Qin · Bihan Wen

ID # 191 # paper title # MinCD-PnP: Learning 2D-3D Correspondences with Approximate Blind PnP

Authors # Pei An · Jiaqi Yang · Muyao Peng · You Yang · Qiong Liu · Xiaolin Wu · Liangliang Nan

ID # 192 # paper title # SegmentDreamer: Towards High-fidelity Text-to-3D Synthesis with Segmented Consistency Trajectory Distillation

Authors # Jiahao Zhu · Zixuan Chen · Guangcong Wang · Xiaohua Xie · Yi Zhou

ID # 193 # paper title # CWNet: Causal Wavelet Network for Low-Light Image Enhancement

Authors # Tongshun Zhang · Pingping Liu · Yubing Lu · Mengen Cai · Zijian Zhang · Zhe Zhang · Qiuzhan Zhou

ID # 194 # paper title # MGSR: 2D/3D Mutual-boosted Gaussian Splatting for High-fidelity Surface Reconstruction under Various Light Conditions

Authors # Qingyuan Zhou · Yuehu Gong · Weidong Yang · Jiaze Li · Yeqi Luo · Baixin Xu · Shuhao Li · Ben Fei · Ying He

ID # 195 # paper title # Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling

Authors # Zenghao Niu · Weicheng Xie · Siyang Song · Zitong YU · Feng Liu · Linlin Shen

ID # 196 # paper title # Streamlining Image Editing with Layered Diffusion Brushes

Authors # Peyman Gholami · Robert Xiao

ID # 197 # paper title # GroundFlow: A Plug-in Module for Temporal Reasoning on 3D Point Cloud Sequential Grounding

Authors # Zijun Lin · Shuting He · Cheston Tan · Bihan Wen

ID # 198 # paper title # FedMVP: Federated Multimodal Visual Prompt Tuning for Vision-Language Models

Authors # Mainak Singha · Subhankar Roy · Sarthak Mehrotra · Ankit Jha · Moloud Abdar · Biplab Banerjee · Elisa Ricci

ID # 199 # paper title # QK-Edit: Revisiting Attention-based Injection in MM-DiT for Image and Video Editing

Authors # Tiancheng SHEN · Jun Hao Liew · Zilong Huang · Xiangtai Li · Zhijie Lin · Jiyang Liu · Yitong Wang · Jiashi Feng · Ming-Hsuan Yang

ID # 200 # paper title # MotionShot: Adaptive Motion Transfer across Arbitrary Objects for Text-to-Video Generation

Authors # Yanchen Liu · Yanan SUN · Zhening Xing · Junyao Gao · Kai Chen · Wenjie Pei

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值