python视觉识别库_用于视觉识别的深度等距学习"Deep Isometric Learning for Visual Recognition"...

Deep Isometric Learning for Visual Recognition

This repository is an official PyTorch implementation of the ICML paper:

Deep Isometric Learning for Visual Recognition

Haozhi Qi, Chong You, Xiaolong Wang, Yi Ma, Jitendra Malik

International Conference on Machine Learning (ICML), 2020

[Project Webpage], [arXiv]

Introduction

In this project, we argue the notion of isometry is a central guiding principle for training deep ConvNet. In particular, we take a minimalist approach and show that a vanilla deep Isometric Network (ISONet) (i.e., without BN and shortcut) can be trained and achieve suprisingly good accuracy. We also show that if combined with skip connections, such near isometric networks (i.e. R-ISONet) can achieve performances on par with the standard ResNet, even without normalization at all.

Main results

Here we show the Top-1 Classification Accuracy on ImageNet Validation dataset:

Methods

depth18

depth 34

depth 50

depth 101

ISONet

68.10

70.90

71.20

71.01

R-ISONet

69.17

73.43

76.18

77.08

For more results and pretrained models, see Model Zoo.

Using ISONet

Data Preparation

The ImageNet folder train/val folder should locate at data/ILSVRC2012/train and data/ILSVRC2012/val, respectively.

Installation

This codebase is developed and tested with python 3.6, PyTorch 1.4, and cuda 10.1. But any version newer than that should work.

Here we gave an example of installing ISONet using conda virtual environment:

git clone https://github.com/HaozhiQi/ISONet

cd ISONet

conda create -y -n isonet

conda activate isonet

# install pytorch according to https://pytorch.org/

conda install -y pytorch==1.4 torchvision cudatoolkit=10.1 -c pytorch

pip install yacs

Evaluation

You can download the pre-trained models from the links in Model Zoo.

For example, if we want to test the performance of R-ISONet 18, download it from model zoo, and use the following command:

# change config files if you are going to test other pre-trained models

python test.py --cfg configs/IN1k-RISO18.yaml --gpus {GPU_ID} --ckpt RISO18.pt

Training

To train our model from scratch, use the following command:

python train.py --cfg {CONFIG_FILE} --gpus {GPU_ID} --output {OUTPUT_NAME}

Citing ISONet

If you find ISONet or this codebase helpful in your research, please consider citing:

@InProceedings{qi2020deep,

author={Qi, Haozhi and You, Chong and Wang, Xiaolong and Ma, Yi and Malik, Jitendra},

title={Deep Isometric Learning for Visual Recognition},

booktitle={ICML},

year={2020}

}

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值