CLIP-Count代码配置以及踩坑记录

行走的人参

已于 2023-11-01 10:39:04 修改

阅读量492

点赞数

文章标签： pytorch python

于 2023-10-31 17:04:23 首次发布

本文链接：https://blog.csdn.net/qq_42499501/article/details/134142755

版权

代码：https://github.com/songrise/CLIP-Count

下载好后安装一下特定的包，主要麻烦的是clip包，clip包不能用

~~pip install clip~~

这个是错的，下载好了运行会报错。下面是正确的按照方式

pip install ftfy # 这个包也要装
pip install git+https://github.com/openai/CLIP.git  # 装clip 的代码

如果电脑是内网环境，先下载CLIP的源码，然后解压，解压后进入clip目录。GitHub - openai/CLIP: CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

假设目录名为CLIP，如图。终端输入命令：

python -u setup.py install

然后就安装好了，装的是clip1.0

数据集配置没啥问题，train一定要 FSC-147

下载路径： https://drive.google.com/file/d/1ymDYrGs9DSRicfZbSCDiOu0ikGDh5k6S/view?usp=sharing

FSC-147注释路径：https://github.com/cvlab-stonybrook/LearningToCountEverything/tree/master/data

把data文件夹下下来，放在你的主文件路径下如下：

data
├─CARPK/
│  ├─Annotations/
│  ├─Images/
│  ├─ImageSets/
│
├─FSC/    
│  ├─gt_density_map_adaptive_384_VarV2/
│  ├─images_384_VarV2/
│  ├─FSC_147/
│  │  ├─ImageClasses_FSC147.txt
│  │  ├─Train_Test_Val_FSC_147.json
│  │  ├─ annotation_FSC147_384.json
│  
├─ShanghaiTech/
│  ├─part_A/
│  ├─part_B/

FSC下下来的和代码实际运行的还有点区别，自己改一下数据集里的名字就行了。

这样基本就可以train了。

如果内网环境，models/clip_count.py里的self.clip, clip_preprocess报错，改成本地路径下载好的pt即可。

        if backbone == "b16":
            # self.clip, clip_preprocess = clip.load("ViT-B/16")
            self.clip, clip_preprocess = clip.load("/path/CLIP-Count-main/ViT-B-16.pt")
            self.n_patches = 14*14
            self.clip_hidden_dim = 768
            self.clip_out_dim = 512

VIT-B-16.pt下载路径是：

https://openaipublic.azureedge.net/clip/models/5806e77cd80f8b59890b7e101eabd078d9fb84e6937f9e85e4ecb61988df416f/ViT-B-16.pt

最后，运行主目录下run.py即可。

python -u run.py

行走的人参

关注

0
点赞
踩
5

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫