自定义博客皮肤VIP专享

*博客头图:

格式为PNG、JPG,宽度*高度大于1920*100像素,不超过2MB,主视觉建议放在右侧,请参照线上博客头图

请上传大于1920*100像素的图片!

博客底图:

图片格式为PNG、JPG,不超过1MB,可上下左右平铺至整个背景

栏目图:

图片格式为PNG、JPG,图片宽度*高度为300*38像素,不超过0.5MB

主标题颜色:

RGB颜色,例如:#AFAFAF

Hover:

RGB颜色,例如:#AFAFAF

副标题颜色:

RGB颜色,例如:#AFAFAF

自定义博客皮肤

-+
  • 博客(66)
  • 收藏
  • 关注

原创 论文阅读笔记——ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback

ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback 论文阅读笔记

2025-06-09 10:58:24 667 1

原创 论文阅读笔记——Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing

Muffin: Testing Deep Learning Libraries via Neural Architecture Fuzzing 论文阅读笔记

2025-06-09 10:55:29 645 1

原创 论文阅读笔记——D3: Differential Testing of Distributed Deep Learning With Model Generation

D3: Differential Testing of Distributed Deep Learning With Model Generation 论文阅读笔记

2025-06-07 22:37:43 911 1

原创 论文阅读笔记——Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries

Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries 论文阅读笔记

2025-06-07 22:35:00 577 1

原创 论文阅读笔记——Large Language Models Are Zero-Shot Fuzzers

Large Language Models Are Zero-Shot Fuzzers: Fuzzing Deep-Learning Libraries via Large Language Models 论文阅读笔记

2025-06-04 15:48:37 1226 2

原创 论文阅读笔记——FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space 论文阅读笔记

2025-06-02 15:33:06 1187 1

原创 论文阅读笔记——Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset

Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 论文阅读笔记

2025-06-01 12:54:58 735 1

原创 论文阅读笔记——FLOW MATCHING FOR GENERATIVE MODELING

FLOW MATCHING FOR GENERATIVE MODELING 论文阅读笔记

2025-05-30 19:26:47 1835 1

原创 论文阅读笔记——MetaMorph: Multimodal Understanding and Generation via Instruction Tuning

MetaMorph: Multimodal Understanding and Generation via Instruction Tuning 论文阅读笔记

2025-05-30 19:23:22 1006 1

原创 论文阅读笔记——In-Context Edit

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer 论文阅读笔记

2025-05-28 15:44:52 1330 1

原创 论文阅读笔记——Step1X-Edit: A Practical Framework for General Image Editing

Step1X-Edit: A Practical Framework for General Image Editing 论文阅读笔记

2025-05-27 23:47:46 1348 1

原创 论文阅读笔记——Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing

Nexus-Gen: A Unified Model for Image Understanding, Generation, and Editing 论文阅读笔记

2025-05-27 22:42:36 961 1

原创 论文阅读笔记——ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision

ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision 论文阅读笔记

2025-05-26 20:51:25 246 1

原创 论文阅读笔记——Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model 论文阅读笔记

2025-05-26 16:26:01 920 1

原创 论文阅读笔记——Janus,Janus Pro

Janus、Janus Pro 论文阅读笔记

2025-05-25 18:40:34 1390 1

原创 论文阅读笔记——Emerging Properties in Unified Multimodal Pretraining

Emerging Properties in Unified Multimodal Pretraining 论文阅读笔记

2025-05-24 19:08:26 1142 1

原创 论文阅读笔记——PixArt-α,PixArt-δ

PixArt-α,PixArt-δ 论文阅读笔记

2025-05-22 20:15:07 1026 1

原创 论文阅读笔记——双流网络

Two-Stream Convolutional Networks for Action Recognition in Videos 论文阅读笔记

2025-05-14 17:50:45 675 1

原创 论文阅读笔记——Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions

Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions 论文阅读笔记

2025-05-07 14:06:31 914 1

原创 论文阅读笔记——ROBOGROUND: Robotic Manipulation with Grounded Vision-Language Priors

ROBOGROUND: Robotic Manipulation with Grounded Vision-Language Priors 论文阅读笔记

2025-05-06 23:24:24 1334 1

原创 论文阅读笔记——STDArm

STDArm: Transferring Visuomotor Policies From Static Data Training to Dynamic Robot Manipulation 论文阅读笔记

2025-05-04 11:26:26 1574 1

原创 论文阅读笔记——TesserAct: Learning 4D Embodied World Models

TesserAct: Learning 4D Embodied World Models 论文阅读笔记

2025-05-02 13:08:02 1527 1

原创 论文阅读笔记——Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction

Phoenix: A Motion-based Self-Reflection Framework for Fine-grained Robotic Action Correction 论文阅读笔记

2025-04-30 10:32:22 870 1

原创 论文阅读笔记——ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping

ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping 论文阅读笔记

2025-04-25 16:59:39 1117 1

原创 论文阅读笔记——π0.5: a Vision-Language-Action Model with Open-World Generalization

π0.5: a Vision-Language-Action Model with Open-World Generalization 论文阅读笔记

2025-04-24 10:04:09 1643 1

原创 论文阅读笔记——A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation

A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation 论文阅读笔记,其核心创新在于将任务分解为**高层空间可操作性推理**与**底层动作执行**,通过跨平台的**具身无关可操作性表示**(Embodiment-Agnostic Affordance Representation)预测物体中心的接触点与轨迹,实现多机器人系统的泛化能力。

2025-04-21 12:00:00 1241 1

原创 论文阅读笔记——Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsit

Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity 论文阅读笔记

2025-04-20 12:00:00 1358 1

原创 论文阅读笔记——OPAL: Encoding Causal Understanding of Physical Systems for Robot Learning

OPAL: Encoding Causal Understanding of Physical Systems for Robot Learning 论文阅读笔记

2025-04-19 14:00:55 1059 1

原创 论文阅读笔记——Mixtral of Experts

Mixtral of Experts 论文阅读笔记

2025-04-18 11:14:03 1456 1

原创 论文阅读笔记——RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete

RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete论文阅读笔记

2025-04-17 12:14:54 593 1

原创 论文阅读笔记——Generating Long Sequences with Sparse Transformers

Generating Long Sequences with Sparse Transformers 论文阅读笔记

2025-04-14 19:00:00 1283 1

原创 论文阅读笔记——Reactive Diffusion Policy

Reactive Diffusion Policy: Slow-Fast Visual-Tactile Policy Learning for Contact-Rich Manipulation 论文阅读笔记

2025-04-13 14:57:04 1045 1

原创 论文阅读笔记——Multi-Token Attention

Multi-Token Attention 论文阅读笔记

2025-04-12 21:00:00 1393 1

原创 论文阅读笔记——GPT-1,GPT-2,GPT-3,InstructGPT

GPT-1,GPT-2,GPT-3,InstructGPT 论文阅读笔记

2025-04-09 12:00:00 1086 1

原创 论文阅读笔记——Deformable Radial Kernel Splatting

Deformable Radial Kernel Splatting 论文阅读笔记

2025-04-06 11:45:13 1172 1

原创 论文阅读笔记——RDT-1B: A DIFFUSION FOUNDATION MODEL FOR BIMANUAL MANIPULATION

RDT-1B: A DIFFUSION FOUNDATION MODEL FOR BIMANUAL MANIPULATION 论文阅读笔记

2025-04-05 16:54:44 1320 1

原创 论文阅读笔记——SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model

SpatialVLA 论文阅读笔记

2025-04-01 12:36:27 1119 1

原创 论文阅读笔记——PointVLA: Injecting the 3D World into Vision-Language-Action Models

PointVLA 论文阅读笔记

2025-03-30 23:19:05 521 1

原创 论文阅读笔记——ReconDreamer

ReconDreamer 论文阅读笔记

2025-03-29 17:41:16 1388 1

原创 论文阅读笔记——ST-4DGS,WideRange4D

ST-4DGS,WideRange4D 论文阅读笔记

2025-03-27 20:42:20 1394 1

空空如也

空空如也

TA创建的收藏夹 TA关注的收藏夹

TA关注的人

提示
确定要删除当前文章?
取消 删除