来源: AINLPer公众号(每日干货分享!!)
编辑: ShuYini
校稿: ShuYini
时间: 2025-3-6
引言
AAAI2025会议于2025年2月25日至3月4日在美国宾夕法尼亚州费城拉开帷幕,为期8天(刚刚过去几天),本次会议共收到 12,957 篇有效投稿,录用 3,032 篇,录取率为 23.4%,其中 Oral 论文占比 4.6%。
下面是作者整理的论文接受列表,因平台限制不能给出每篇论文的连接。如果有需要,欢迎关注 AINLPer公众号 回复:AAAI2025 获取。
论文接受列表
1、Structural Entropy Guided Unsupervised Graph Out-Of-Distribution Detection
2、Path-Adaptive Matting for Efficient Inference Under Various Computational Cost Constraints
3、Out-of-Distribution Generalization on Graphs via Progressive Inference
4、Zero-Shot Complex Question-Answering on Long Scientific Documents
5、dyAb: Flow Matching for Flexible Antibody Design with AlphaFold-driven Pre-binding Antigen
6、Self-attention-based Diffusion Model for Time-series Imputation in Partial Blackout Scenarios
7、Stability-based Generalization Analysis of Randomized Coordinate Descent for Pairwise Learning
8、The Complexity of Extending Fair Allocations of Indivisible Goods
9、Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning
10、Differentiable Information Enhanced Model-Based Reinforcement Learning
11、Beyond QA Pairs: Assessing Parameter-Efficient Fine-Tuning for Fact Embedding in LLMs
12、LLM-Fusion: A Novel Multimodal Fusion Model for Accelerated Material Discovery
13、IteRPrimE: Zero-shot Referring Image Segmentation with Iterative Grad-CAM Refinement and Primary Word Emphasis
14、Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking
15、Adversarial Attacks on Event-Based Pedestrian Detectors: A Physical Approach
16、Structured Reasoning for Fairness: A Multi-Agent Approach to Bias Detection in Textual Data
17、Robust Multi-Objective Preference Alignment with Online DPO
18、Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information Flow
19、Few-Shot, No Problem: Descriptive Continual Relation Extraction
20、Walking the Web of Concept-Class Relationships in Incrementally Trained Interpretable Models
21、Efficient Gaussian Splatting for Monocular Dynamic Scene Rendering via Sparse Time-Variant Attribute Modeling
22、Constrained Generative Modeling with Manually Bridged Diffusion Models
23、QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects
24、Lightweight Contrastive Distilled Hashing for Online Cross-modal Retrieval
25、Noise-Injected Spiking Graph Convolution for Energy-Efficient 3D Point Cloud Denoising
26、Improving Representation Learning of Complex Critical Care Data with ICU-BERT
27、Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing
28、Langevin Multiplicative Weights Update with Applications in Polynomial Portfolio Management
29、Multi-Agent Security Tax: Trading Off Security and Collaboration Capabilities in Multi-Agent Systems
30、TabGLM: Tabular Graph Language Model for Learning Transferable Representations Through Multi-Modal Consistency Minimization
31、Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition
32、Mechanistic Understanding of Language Models in Syntactic Code Completion
33、Disrupt Your Research Using Generative AI Powered ScienceSage
34、TSKANMixer: Kolmogorov-Arnold Networks with MLP-Mixer Model for Time Series Forecasting
35、The Gradient of Algebraic Model Counting
36、MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning
37、Iterative Counterfactual Data Augmentation
38、Implicit Word Reordering with Knowledge Distillation for Cross-Lingual Dependency Parsing
39、Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing
40、A Framework for Evaluating Vision-Language Model Safety: Building Trust in AI for Public Sector Applications
41、TimePFN: Effective Multivariate Time Series Forecasting with Synthetic Data
42、SalM
2
^{2}
2: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention
43、BiDeV: Bilateral Defusing Verification for Complex Claim Fact-Checking
44、Destroy and Repair Using Hyper Graphs for Routing
45、Heterogeneous Multi-Agent Bandits with Parsimonious Hints
46、Together We Rise: Optimizing Real-Time Multi-Robot Task Allocation using Coordinated Heterogeneous Plays
47、A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier Models
48、GenAI at the Edge: Comprehensive Survey on Empowering Edge Devices
49、Spiking Point Transformer for Point Cloud Classification
50、Data Wrangling Task Automation Using Code-Generating Language Models
51、Evaluate with the Inverse: Efficient Approximation of Latent Explanation Quality Distribution
52、M2LADS Demo: A System for Generating Multimodal Learning Analytics Dashboards
53、GeoAggregator: An Efficient Transformer Model for Geo-Spatial Tabular Data
54、Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability
55、A Socratic RAG Approach to Connect Natural Language Queries on Research Topics with Knowledge Organization Systems
56、Evaluating Precise Geolocation Inference Capabilities of Vision Language Models
57、Enhancing Portuguese Variety Identification with Cross-Domain Approaches
58、Tradutor: Building a Variety Specific Translation Model
59、SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics
60、Retrieving Versus Understanding Extractive Evidence in Few-Shot Learning
61、DiffExp: Efficient Exploration in Reward Fine-tuning for Text-to-Image Diffusion Models
62、Rectified Lagrangian for Out-of-Distribution Detection in Modern Hopfield Networks
63、A Label-Free Heterophily-Guided Approach for Unsupervised Graph Fraud Detection
64、SpeHeatal: A Cluster-Enhanced Segmentation Method for Sperm Morphology Analysis
65、Fake It Till You Make It: Using Synthetic Data and Domain Knowledge for Improved Text-Based Learning for LGE Detection
66、TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
67、TimeCAP: Learning to Contextualize, Augment, and Predict Time Series Events with Large Language Model Agents
68、Open-Set Cross-Network Node Classification via Unknown-Excluded Adversarial Graph Domain Alignment
69、LLM-driven Knowledge Distillation for Dynamic Text-Attributed Graphs
70、REGNav: Room Expert Guided Image-Goal Navigation
71、Disentangle Nighttime Lens Flares: Self-supervised Generation-based Lens Flare Removal
72、Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering
73、A Multiagent Path Search Algorithm for Large-Scale Coalition Structure Generation
74、Navigating Label Ambiguity for Facial Expression Recognition in the Wild
75、MuDoC: An Interactive Multimodal Document-grounded Conversational AI System
76、Rolling Ahead Diffusion for Traffic Scene Simulation
77、Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery
78、Improve LLM-based Automatic Essay Scoring with Linguistic Features
79、Two-Stage Representation Learning for Analyzing Movement Behavior Dynamics in People Living with Dementia
80、Large Images are Gaussians: High-Quality Large Image Representation with Levels of 2D Gaussian Splatting
81、Generalized Class Discovery in Instance Segmentation
82、Counterexample Guided Program Repair Using Zero-Shot Learning and MaxSAT-based Fault Localization
83、Unsupervised Translation of Emergent Communication
84、Don’t Just Demo, Teach Me the Principles: A Principle-Based Multi-Agent Prompting Strategy for Text Classification
85、A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks
86、Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior
87、UniDemoiré: Towards Universal Image Demoiréing with Data Generation and Synthesis
88、K-ON: Stacking Knowledge On the Head Layer of Large Language Model
89、Towards Efficient and Intelligent Laser Weeding: Method and Dataset for Weed Stem Detection
90、Integrating Sequence and Image Modeling in Irregular Medical Time Series Through Self-Supervised Learning
91、Enhancing Document Key Information Localization Through Data Augmentation
92、Verifying Proportionality in Temporal Voting
93、Fast Omni-Directional Image Super-Resolution: Adapting the Implicit Image Function with Pixel and Semantic-Wise Spherical Geometric Priors
94、MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
95、3CAD: A Large-Scale Real-World 3C Product Dataset for Unsupervised Anomaly
96、Exploring Visual Embedding Spaces Induced by Vision Transformers for Online Auto Parts Marketplaces
97、Efficient Reinforcement Learning Through Adaptively Pretrained Visual Encoder
98、Probabilistic Foundations for Metacognition via Hybrid-AI
99、Two-Player Zero-Sum Differential Games with One-Sided Information
100、The Phantom of the Elytra – Phylogenetic Trait Extraction from Images of Rove Beetles Using Deep Learning – Is the Mask Enough?
101、MultiFloodSynth: Multi-Annotated Flood Synthetic Dataset Generation
102、Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
103、MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers
104、GHOST: Gaussian Hypothesis Open-Set Technique
105、Mitigating Language Bias in Cross-Lingual Job Retrieval: A Recruitment Platform Perspective
106、IAO Prompting: Making Knowledge Flow Explicit in LLMs through Structured Reasoning Templates
107、Maximizing the Position Embedding for Vision Transformers with Global Average Pooling
108、Medical Multimodal Model Stealing Attacks via Adversarial Domain Alignment
109、MaintaAvatar: A Maintainable Avatar Based on Neural Radiance Fields by Continual Learning
110、Hierarchical Consensus Network for Multiview Feature Learning
111、LLM-TA: An LLM-Enhanced Thematic Analysis Pipeline for Transcripts from Parents of Children with Congenital Heart Disease
112、Nearly Tight Bounds for Exploration in Streaming Multi-armed Bandits with Known Optimality Gap
113、Universal Post-Processing Networks for Joint Optimization of Modules in Task-Oriented Dialogue Systems
114、Scalable Framework for Classifying AI-Generated Content Across Modalities
115、OneBatchPAM: A Fast and Frugal K-Medoids Algorithm
116、The Pitfalls of “Security by Obscurity” And What They Mean for Transparent AI
117、A Video-grounded Dialogue Dataset and Metric for Event-driven Activities
118、MAMS: Model-Agnostic Module Selection Framework for Video Captioning
119、Tensor Completion for Surrogate Modeling of Material Property Prediction
120、VidSole: A Multimodal Dataset for Joint Kinetics Quantification and Disease Detection with Deep Learning
121、Task and Perception-aware Distributed Source Coding for Correlated Speech under Bandwidth-constrained Channels
122、Exploring Vision Language Models for Multimodal and Multilingual Stance Detection
123、Memorize and Rank: Elevating Large Language Models for Clinical Diagnosis Prediction
124、Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series Forecasting
125、LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience
126、Sequential Decision Making in Stochastic Games with Incomplete Preferences over Temporal Objectives
127、Revisiting Projection-Free Online Learning with Time-Varying Constraints
128、Efficient Attention-Sharing Information Distillation Transformer for Lightweight Single Image Super-Resolution
129、Learning Complex Heterogeneous Multimodal Fake News via Social Latent Network Inference
130、A Training-free Synthetic Data Selection Method for Semantic Segmentation
131、PatentLMM: Large Multimodal Model for Generating Descriptions for Patent Figures
132、Graph-Based Cross-Domain Knowledge Distillation for Cross-Dataset Text-to-Image Person Retrieval
133、Controllable Protein Sequence Generation with LLM Preference Optimization
134、MISCON: A Mission-Driven Conversational Consultant for Pre-Venture Entrepreneurs in Food Deserts
135、FSTA-SNN:Frequency-based Spatial-Temporal Attention Module for Spiking Neural Networks
136、BrainGuard: Privacy-Preserving Multisubject Image Reconstructions from Brain Activities
137、Micro-macro Wavelet-based Gaussian Splatting for 3D Reconstruction from Unconstrained Images
138、Effective Defect Detection Using Instance Segmentation for NDI
139、Reinforcement Learning Platform for Adversarial Black-box Attacks with Custom Distortion Filters
140、Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change
141、Crossfire: An Elastic Defense Framework for Graph Neural Networks Under Bit Flip Attacks
142、GCAD: Anomaly Detection in Multivariate Time Series from the Perspective of Granger Causality
143、Debate Helps Weak-to-Strong Generalization
144、Multilinguality in LLM-Designed Reward Functions for Restless Bandits: Effects on Task Performance and Fairness
145、To Measure or Not: A Cost-Sensitive, Selective Measuring Environment for Agricultural Management Decisions with Reinforcement Learning
146、Exploring Unknown Social Networks for Discovering Hidden Nodes
147、How Does the Spatial Distribution of Pre-training Data Affect Geospatial Foundation Models?
148、Deploying Privacy Guardrails for LLMs: A Comparative Analysis of Real-World Applications
149、Scopes of Alignment
150、SMamba: Sparse Mamba for Event-based Object Detection
151、FNIN: A Fourier Neural Operator-based Numerical Integration Network for Surface-form-gradients
152、SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks
153、MASS: Overcoming Language Bias in Image-Text Matching
154、Federated Learning with Sample-level Client Drift Mitigation
155、Disentangled Modeling of Preferences and Social Influence for Group Recommendation
156、Towards Loss-Resilient Image Coding for Unstable Satellite Networks
157、Learning with Open-world Noisy Data via Class-independent Margin in Dual Representation Space
158、SMARTe-VR: Student Monitoring and Adaptive Response Technology for e-learning in Virtual Reality
159、TSVC:Tripartite Learning with Semantic Variation Consistency for Robust Image-Text Retrieval
160、Decoupling Appearance Variations with 3D Consistent Features in Gaussian Splatting
161、Differentiable Adversarial Attacks for Marked Temporal Point Processes
162、Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems
163、LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading
164、Confidence Estimation for Error Detection in Text-to-SQL Systems
165、Normal-NeRF: Ambiguity-Robust Normal Estimation for Highly Reflective Scenes
166、AugRefer: Advancing 3D Visual Grounding via Cross-Modal Augmentation and Spatial Relation-based Referring
167、A Simple Graph Contrastive Learning Framework for Short Text Classification
168、Boosting Short Text Classification with Multi-Source Information Exploration and Dual-Level Contrastive Learning
169、Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks
170、Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval
171、Computing Game Symmetries and Equilibria That Respect Them
172、ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind
173、SAIF: A Comprehensive Framework for Evaluating the Risks of Generative AI in the Public Sector
174、Normalize Then Propagate: Efficient Homophilous Regularization for Few-shot Semi-Supervised Node Classification
175、Densely Connected Parameter-Efficient Tuning for Referring Image Segmentation
176、DualOpt: A Dual Divide-and-Optimize Algorithm for the Large-scale Traveling Salesman Problem
177、PokerBench: Training Large Language Models to become Professional Poker Players
178、D
2
^2
2-DPM: Dual Denoising for Quantized Diffusion Probabilistic Models
179、Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models
180、AI Guide Dog: Egocentric Path Prediction on Smartphone
181、PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration
182、Performance Optimization of Ratings-Based Reinforcement Learning
183、Large Language Models for Interpretable Mental Health Diagnosis
184、Improving DeFi Accessibility through Efficient Liquidity Provisioning with Deep Reinforcement Learning
185、RbRL2.0: Integrated Reward and Policy Learning for Rating-based Reinforcement Learning
186、Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data
187、Foundation Models at Work: Fine-Tuning for Fairness in Algorithmic Hiring
188、Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion
189、Anomalous Agreement: How to find the Ideal Number of Anomaly Classes in Correlated, Multivariate Time Series Data
190、Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics
191、ACCon: Angle-Compensated Contrastive Regularizer for Deep Regression
192、A Weighted Similarity Metric for Community Detection in Sparse Data
193、Pareto Set Learning for Multi-Objective Reinforcement Learning
194、VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning
195、Multi-task Visual Grounding with Coarse-to-Fine Consistency Constraints
196、Comparing Few-Shot Prompting of GPT-4 LLMs with BERT Classifiers for Open-Response Assessment in Tutor Equity Training
197、Understanding How Paper Writers Use AI-Generated Captions in Figure Caption Writing
198、FedSA: A Unified Representation Learning via Semantic Anchors for Prototype-based Federated Learning
199、Uncertainty-aware Knowledge Tracing
200、TimeDP: Learning to Generate Multi-Domain Time Series with Domain Prompts
201、JAQ: Joint Efficient Architecture Design and Low-Bit Quantization with Hardware-Software Co-Exploration
202、Stream Aligner: Efficient Sentence-Level Alignment via Distribution Induction
203、Is Your Autonomous Vehicle Safe? Understanding the Threat of Electromagnetic Signal Injection Attacks on Traffic Scene Perception
204、CoDe: Communication Delay-Tolerant Multi-Agent Collaboration via Dual Alignment of Intent and Timeliness
205、FaceMe: Robust Blind Face Restoration with Personal Identification
206、State-Based Disassembly Planning
207、IPDN: Image-enhanced Prompt Decoding Network for 3D Referring Expression Segmentation
208、V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept Tokenizer
209、Battling the Non-stationarity in Time Series Forecasting via Test-time Adaptation
210、Targeted Adversarial Denoising Autoencoders (TADA) for Neural Time Series Filtration
211、Evaluating Interval-based Tokenization for Pitch Representation in Symbolic Music Analysis
212、FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature Consistency
213、Unified Coding for Both Human Perception and Generalized Machine Analytics with CLIP Supervision
214、A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point Processes
215、Rethinking High-speed Image Reconstruction Framework with Spike Camera
216、TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning
217、Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study
218、Textualize Visual Prompt for Image Editing via Diffusion Bridge
219、Entropy-Guided Attention for Private LLMs
220、Align-Pro: A Principled Approach to Prompt Optimization for LLM Alignment
221、VOILA: Complexity-Aware Universal Segmentation of CT images by Voxel Interacting with Language
222、Enhanced Importance Sampling through Latent Space Exploration in Normalizing Flows
223、Rethinking Byzantine Robustness in Federated Recommendation from Sparse Aggregation Perspective
224、Revolutionizing Encrypted Traffic Classification with MH-Net: A Multi-View Heterogeneous Graph Model
225、Backdoor Token Unlearning: Exposing and Defending Backdoors in Pretrained Language Models
226、AIF-SFDA: Autonomous Information Filter-driven Source-Free Domain Adaptation for Medical Image Segmentation
227、CALM: Curiosity-Driven Auditing for Large Language Models
228、LOHA: Direct Graph Spectral Contrastive Learning Between Low-pass and High-pass Views
229、Offline-to-online hyperparameter transfer for stochastic bandits
230、HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation
231、Universal Features Guided Zero-Shot Category-Level Object Pose Estimation
232、Enhancing Trustworthiness of Graph Neural Networks with Rank-Based Conformal Training
233、Holistic Semantic Representation for Navigational Trajectory Generation
234、Sequence Complementor: Complementing Transformers For Time Series Forecasting with Learnable Sequences
235、Multispectral Pedestrian Detection with Sparsely Annotated Label
236、Multi-LLM Collaborative Caption Generation in Scientific Documents
237、Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection
238、Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine
239、MetaNeRV: Meta Neural Representations for Videos with Spatial-Temporal Guidance
240、KD-MSLRT: Lightweight Sign Language Recognition Model Based on Mediapipe and 3D to 1D Knowledge Distillation
241、Detecting Music Performance Errors with Transformers
242、Is Your Image a Good Storyteller?
243、Optimal bounds for dissatisfaction in perpetual voting
244、Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models
245、Adaptive Few-shot Prompting for Machine Translation with Pre-trained Language Models
246、The (Exact) Price of Cardinality for Indivisible Goods: A Parametric Perspective
247、Look Back for More: Harnessing Historical Sequential Updates for Personalized Federated Adapter Tuning
248、AI-Enabled Operations at Fermi Complex: Multivariate Time Series Prediction for Outage Prediction and Diagnosis
249、Enhancing Reasoning through Process Supervision with Monte Carlo Tree Search
250、Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension
251、Citations and Trust in LLM Generated Responses
252、Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging
253、SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization
254、Asymmetric Reinforcing against Multi-modal Representation Bias
255、Sparis: Neural Implicit Surface Reconstruction of Indoor Scenes from Sparse Views
256、DuMo: Dual Encoder Modulation Network for Precise Concept Erasure
257、Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning
258、MalCL: Leveraging GAN-Based Generative Replay to Combat Catastrophic Forgetting in Malware Classification
259、MMVA: Multimodal Matching Based on Valence and Arousal across Images, Music, and Musical Captions
260、HoneypotNet: Backdoor Attacks Against Model Extraction
261、Noise-Resilient Symbolic Regression with Dynamic Gating Reinforcement Learning
262、Bootstrapped Reward Shaping
263、A Novel Diffusion Model for Pairwise Geoscience Data Generation with Unbalanced Training Dataset
264、Population Aware Diffusion for Time Series Generation
265、Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation
266、Less is More: Token Context-aware Learning for Object Tracking
267、Foreground-Covering Prototype Generation and Matching for SAM-Aided Few-Shot Segmentation
268、SoundBrush: Sound as a Brush for Visual Scene Editing
269、CNC: Cross-modal Normality Constraint for Unsupervised Multi-class Anomaly Detection
270、SAM-Aware Graph Prompt Reasoning Network for Cross-Domain Few-Shot Segmentation
271、Predicate Invention from Pixels via Pretrained Vision-Language Models
272、Make Domain Shift a Catastrophic Forgetting Alleviator in Class-Incremental Learning
273、Seq2Seq Model-Based Chatbot with LSTM and Attention Mechanism for Enhanced User Interaction
274、Exploring and Controlling Diversity in LLM-Agent Conversation
275、Low-Light Image Enhancement via Generative Perceptual Priors
276、Frequency-Masked Embedding Inference: A Non-Contrastive Approach for Time Series Representation Learning
277、KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences
278、Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model
279、Planning, Living and Judging: A Multi-agent LLM-based Framework for Cyclical Urban Planning
280、Multimodal Variational Autoencoder: a Barycentric View
281、Tri-Ergon: Fine-grained Video-to-Audio Generation with Multi-modal Conditions and LUFS Control
282、EmoReg: Directional Latent Vector Modeling for Emotional Intensity Regularization in Diffusion-based Voice Conversion
283、Asynchronous Federated Clustering with Unknown Number of Clusters
284、Real-time Calibration Model for Low-cost Sensor in Fine-grained Time series
285、TradingAgents: Multi-Agents LLM Financial Trading Framework
286、ST
3
^3
3: Accelerating Multimodal Large Language Model by Spatial-Temporal Visual Token Trimming
287、Discrete Curvature Graph Information Bottleneck
288、Sharpening Neural Implicit Functions with Frequency Consolidation Priors
289、The Value of Recall in Extensive-Form Games
290、Diverse Rare Sample Generation with Pretrained GANs
291、Interacted Object Grounding in Spatio-Temporal Human-Object Interactions
292、Towards Open-Vocabulary Remote Sensing Image Semantic Segmentation
293、DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes
294、KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
295、On the Expressiveness and Length Generalization of Selective State-Space Models on Regular Languages
296、ViPCap: Retrieval Text-Based Visual Prompts for Lightweight Image Captioning
297、Time Series Foundational Models: Their Role in Anomaly Detection and Prediction
298、Virtual Nodes Can Help: Tackling Distribution Shifts in Federated Graph Learning
299、Learning Cross-Domain Representations for Transferable Drug Perturbations on Single-Cell Transcriptional Responses
300、Towards Better Spherical Sliced-Wasserstein Distance Learning with Data-Adaptive Discriminative Projection Direction
301、PlanLLM: Video Procedure Planning with Refinable Large Language Models
302、SUTrack: Towards Simple and Unified Single Object Tracking
303、Graph Mixture of Experts and Memory-augmented Routers for Multivariate Time Series Anomaly Detection
304、BSDB-Net: Band-Split Dual-Branch Network with Selective State Spaces Mechanism for Monaural Speech Enhancement
305、DAPoinTr: Domain Adaptive Point Transformer for Point Cloud Completion
306、CL-attack: Textual Backdoor Attacks via Cross-Lingual Triggers
307、Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization
308、Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework
309、Improving Integrated Gradient-based Transferable Adversarial Examples by Refining the Integration Path
310、FedVCK: Non-IID Robust and Communication-Efficient Federated Learning via Valuable Condensed Knowledge for Medical Image Analysis
311、Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-Augmentation
312、Graph Structure Learning for Spatial-Temporal Imputation: Adapting to Node and Feature Scales
313、Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English
314、Extract Free Dense Misalignment from CLIP
315、Contrastive Representation for Interactive Recommendation
316、A Many Objective Problem Where Crossover is Provably Indispensable
317、Unveiling the Threat of Fraud Gangs to Graph Neural Networks: Multi-Target Graph Injection Attacks Against GNN-Based Fraud Detectors
318、Hypergraph Attacks via Injecting Homogeneous Nodes into Elite Hyperedges
319、Learning Generalized Residual Exchange-Correlation-Uncertain Functional for Density Functional Theory
320、FloNa: Floor Plan Guided Embodied Visual Navigation
321、Enhancing Multi-Robot Semantic Navigation Through Multimodal Chain-of-Thought Score Collaboration
322、NoiseHGNN: Synthesized Similarity Graph-Based Neural Network For Noised Heterogeneous Graph Representation Learning
323、AdaCo: Overcoming Visual Foundation Model Noise in 3D Semantic Segmentation via Adaptive Label Correction
324、Towards Macro-AUC oriented Imbalanced Multi-Label Continual Learning
325、ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation
326、PCM Selector: Penalized Covariate-Mediator Selection Operator for Evaluating Linear Causal Effects
327、Token Highlighter: Inspecting and Mitigating Jailbreak Prompts for Large Language Models
328、SongGLM: Lyric-to-Melody Generation with 2D Alignment Encoding and Multi-Task Pre-Training
329、The Unreasonable Effectiveness of Open Science in AI: A Replication Study
330、Graph Structure Refinement with Energy-based Contrastive Learning
331、Active Geospatial Search for Efficient Tenant Eviction Outreach
332、Cross-View Referring Multi-Object Tracking
333、GaussianPainter: Painting Point Cloud into 3D Gaussians with Normal Guidance
334、DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder
335、Hierarchical Vector Quantization for Unsupervised Action Segmentation
336、Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection
337、S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field
338、Retention Score: Quantifying Jailbreak Risks for Vision Language Models
339、Constructing Fair Latent Space for Intersection of Fairness and Explainability
340、BEE: Metric-Adapted Explanations via Baseline Exploration-Exploitation
341、An Evaluation Framework for Product Images Background Inpainting based on Human Feedback and Product Consistency
342、CALLIC: Content Adaptive Learning for Lossless Image Compression
343、Learning from Summarized Data: Gaussian Process Regression with Sample Quasi-Likelihood
344、Just What You Desire: Constrained Timeline Summarization with Self-Reflection for Enhanced Relevance
345、BrainMAP: Learning Multiple Activation Pathways in Brain Networks
346、Towards Intrinsic Self-Correction Enhancement in Monte Carlo Tree Search Boosted Reasoning via Iterative Preference Learning
347、Singular Value Scaling: Efficient Generative Model Compression via Pruned Weights Refinement
348、Interweaving Memories of a Siamese Large Language Model
349、FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation
350、Bi-Directional Multi-Scale Graph Dataset Condensation via Information Bottleneck
351、EcoSearch: A Constant-Delay Best-First Search Algorithm for Program Synthesis