Google Landmark Retrieval 2021 2nd Place Solution 使用教程

最新推荐文章于 2024-09-12 08:56:40 发布

侯深业Dorian

最新推荐文章于 2024-09-12 08:56:40 发布

阅读量406

点赞数 5

本文链接：https://blog.csdn.net/gitblog_00093/article/details/139542483

版权

Google Landmark Retrieval 2021 2nd Place Solution 使用教程

Google_Landmark_Retrieval_2021_2nd_Place_Solution 项目地址: https://gitcode.com/gh_mirrors/go/Google_Landmark_Retrieval_2021_2nd_Place_Solution

1. 项目介绍

本项目是2021年Google Landmark Retrieval竞赛的第二名解决方案。该项目主要用于地标图像的检索任务，通过深度学习模型对图像进行特征提取和匹配，从而实现高效的地标图像检索。项目使用了多种先进的深度学习模型，如ResNeXt101ibn、SEResNet101ibn等，并结合了多阶段的训练和推理策略，以达到最佳的检索效果。

2. 项目快速启动

环境准备

使用CUDA 11.1、Python 3.7、PyTorch 1.9.1和Torchvision 0.8.1进行训练和测试。
下载ImageNet预训练模型ResNeXt101ibn和SEResNet101ibn。
从官方网站下载GLDv2完整版本数据集。

数据准备

运行python tools/generate_gld_list.py生成训练数据列表。
验证数据来自GLDv2中的1129张图像。

快速训练

使用8个GPU进行训练。

快速训练脚本（适用于R50_256模型）：

python -m torch.distributed.run --standalone --nnodes=1 --nproc_per_node=8 --master_port 55555 --max_restarts 0 train.py --config_file configs/GLDv2/R50_256.yml

完整训练流程

使用SER101ibn骨干网络的完整训练流程：

python -m torch.distributed.run --standalone --nnodes=1 --nproc_per_node=8 --master_port 55555 --max_restarts 0 train.py --config_file configs/GLDv2/SER101ibn_384.yml
python -m torch.distributed.run --standalone --nnodes=1 --nproc_per_node=8 --master_port 55555 --max_restarts 0 train.py --config_file configs/GLDv2/SER101ibn_384_finetune.yml
python -m torch.distributed.run --standalone --nnodes=1 --nproc_per_node=8 --master_port 55555 --max_restarts 0 train.py --config_file configs/GLDv2/SER101ibn_512_finetune.yml
python -m torch.distributed.run --standalone --nnodes=1 --nproc_per_node=8 --master_port 55555 --max_restarts 0 train.py --config_file configs/GLDv2/SER101ibn_512_all.yml