Open-GroundingDino 使用教程

宣聪麟

于 2024-08-16 07:44:26 发布

阅读量985

点赞数 27

本文链接：https://blog.csdn.net/gitblog_00276/article/details/141236785

版权

Open-GroundingDino 使用教程

Open-GroundingDinoThis is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.项目地址:https://gitcode.com/gh_mirrors/op/Open-GroundingDino

1、项目介绍

Open-GroundingDino 是 Grounding DINO 的第三方实现，旨在通过结合 DINO 和文本预训练模型来实现开集目标检测。该项目允许用户输入文本提示并输出视觉目标的位置，实现了文本和图像的匹配。与传统的 OVD 算法相比，GroundingDino 在文本处理上具有更高的灵活性，能够处理单词、短语和句子等多种文本形式。

2、项目快速启动

环境安装

首先，克隆项目仓库并安装必要的依赖：

git clone https://github.com/longzw1997/Open-GroundingDino.git
cd Open-GroundingDino
pip install -r requirements.txt

编译 GroundingDino

接下来，编译 GroundingDino 模块：

cd models/GroundingDINO/ops
python setup.py build_ext --inplace

运行推理代码

使用以下命令运行推理代码：

python tools/inference_on_a_image.py \
  -c tools/GroundingDINO_SwinT_OGC.py \
  -p path/to/your/ckpt.pth \
  -i /figs/dog.jpeg \
  -t "dog" \
  -o output