- 博客(232)
- 问答 (1)
- 收藏
- 关注
原创 Adapting Vision-Language Models Without Labels A Comprehensive Survey
论文日报
2025-08-09 13:53:00
680
原创 LLaVA-RE Binary Image-Text Relevancy Evaluation with Multimodal Large Language Model
论文日报
2025-08-09 13:44:55
478
原创 MELLA Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs
论文日报
2025-08-09 13:36:48
431
原创 Follow-Your-Instruction A Comprehensive MLLM Agent for World Data Synthesis
论文日报
2025-08-09 13:28:18
369
原创 mKG-RAG Multimodal Knowledge Graph-Enhanced RAG for Visual Question Answering
论文日报
2025-08-09 13:20:05
456
原创 LumiGen An LVLM-Enhanced Iterative Framework for Fine-Grained Text-to-Image Generation
论文日报
2025-08-09 13:11:48
160
原创 Multimodal Causal-Driven Representation Learning for Generalizable Medical Image Segmentation
论文日报
2025-08-09 12:55:10
852
原创 Accelerating Conditional Prompt Learning via Masked Image Modeling for Vision-Language Models
论文日报
2025-08-09 12:46:39
523
原创 RCR-Router Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory
论文日报
2025-08-09 12:38:22
595
原创 WeTok Powerful Discrete Tokenization for High-Fidelity Visual Reconstruction
论文日报
2025-08-09 10:12:57
601
原创 Textual Inversion for Efficient Adaptation of Open-Vocabulary Object Detectors Without Forgetting
论文日报
2025-08-09 09:59:05
472
原创 VFlowOpt A Token Pruning Framework for LMMs with Visual Information Flow-Guided Optimization
论文日报
2025-08-09 09:30:48
459
原创 CIVQLLIE Causal Intervention with Vector Quantization for Low-Light Image Enhancement
论文日报
2025-08-07 09:36:52
304
原创 MedBLINK Probing Basic Perception in Multimodal Language Models for Medicine
论文日报
2025-08-07 09:20:51
233
原创 Neutralizing Token Aggregation via Information Augmentation for Efficient Test-Time Adaptation
论文日报
2025-08-07 09:12:47
232
原创 SAVER Mitigating Hallucinations in Large Vision-Language Models via Style-Aware Visual Early Revisio
论文日报
2025-08-07 09:04:48
454
原创 Coherent Multimodal Reasoning with Iterative Self-Evaluation for Vision-Language Models
论文日报
2025-08-07 08:56:50
372
原创 UniEdit-I Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verif
论文日报
2025-08-07 08:48:53
255
原创 Enhancing Long Video Question Answering with Scene-Localized Frame Grouping
论文日报
2025-08-07 08:40:49
417
原创 VLMQ Efficient Post-Training Quantization for Large Vision-Language Models via Hessian Augmentation
论文日报
2025-08-07 08:32:49
581
原创 Less is More Token-Efficient Video-QA via Adaptive Frame-Pruning and Semantic Graph Integration
论文日报
2025-08-07 08:24:43
580
原创 Enhancing Long Video Question Answering with Scene-Localized Frame Grouping
论文日报
2025-08-06 23:45:08
631
原创 VLMQ Efficient Post-Training Quantization for Large Vision-Language Models via Hessian Augmentation
论文日报
2025-08-06 23:37:04
730
原创 Less is More Token-Efficient Video-QA via Adaptive Frame-Pruning and Semantic Graph Integration
论文日报
2025-08-06 23:29:01
265
原创 Point-Bind & Point-LLM Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, an
论文日报
2025-08-06 21:44:31
793
原创 Less is More Token-Efficient Video-QA via Adaptive Frame-Pruning and Semantic Graph Integration
论文日报
2025-08-06 14:51:13
1009
原创 对接Zotero或本地偏好的Paper Agent(已开源可用)
是一个专为科研人员、学生和技术爱好者打造的智能工具。现在就访问我们的 GitHub 仓库,只需简单的三步配置,即可拥有您的专属 AI 研究助理!厌倦了在信息的海洋中迷失方向吗?想让您的科研效率提升到一个新的水平吗?生成 Markdown 每日速报。获取 Zotero 近期论文。查询 arXiv 最新论文。LLM 生成深度摘要。
2025-07-16 17:10:28
869
计算机体系结构课程报告-摩尔定律的过去,现在与未来
2023-04-29
一个后端静态资源模板,快速搭建网页的必备
2023-04-06
HFUT计网1000页PPT复习资料
2023-03-31
verilog设计cpu时什么情况下才需要输入时钟
2022-12-22
paddle强化学习
2022-11-07
Anaconda更新out of memory
2022-11-15
ML-agents如何生成可以交互的exe文件
2022-11-19
ML-Agents生成的环境如何对接自己的python代码
2022-11-19
Windows系统的快捷方式底层是包含哪些部分?
2022-11-16
MIPS为什么要设置I,J,R三种指令结构
2022-11-15
Latex伪代码对齐
2022-11-12
由于找不到libgcc_s_sjlj-1.dll
2022-11-08
pycharm跳出modify setup
2022-11-06
3dmax删除不了物体
2022-10-06
TeXStudio出现问题
2022-09-14
对于wor中的表格绘制方法
2022-09-16
matlab三维散点图的绘制
2022-09-16
TeXStudio运行提示Could not start Default
2022-09-14
pandas处理有条件的合并
2022-09-01
德鲁伊连接池报错,更改MySQL驱动后无效
2022-08-15
scrapy部署在服务器运行一段时间出现ERROR: Error downloading
2022-08-09
Zotero链接infini-cloud
2024-07-25
windows11删除快捷键设置
2024-07-02
windows用户文件夹消失
2024-06-27
基于文本描述的跨模态行人重识别模型训练异常
2024-05-24
日志训练过程中缺失但还能正常运行
2024-04-22
chatgpt公式复制到typora
2024-02-07
chatgpt出现parseerror katex parse error
2024-02-04
edge浏览器无法搜索
2023-08-21
采用JDBC+druid数据库连接池出现异常
2023-06-19
ppt无法加载AxGlyph.ppam
2023-04-14
Navicat中MongoDB如何将时间按照升序排序?
2023-03-12
从零开始的操作系统没有编译器如何编译代码成机器码执行
2023-01-21
windows c语言创建线程
2023-01-16
win10弹出edgeupdater
2023-01-04
sqlServer主键设置
2023-01-03
windows10弹出edgetaskUpdater
2023-01-03
.net在导入system.web时找不到
2023-01-01
TA创建的收藏夹 TA关注的收藏夹
TA关注的人