Fine-Grained Image Analysis
Method | Published | BBox | Part | Backbone | Image Resolution | Bird Acc |
---|---|---|---|---|---|---|
MaxEnt | NeurIPS 2018 | GoogLeNet | TBD | 74.4% | ||
PS-CNN | CVPR 2016 | √ | √ | CaffeNet | 454×454 | 76.6% |
MaxEnt | NeurIPS 2018 | VGG-16 | TBD | 77.0% | ||
Mask-CNN | PR 2016 | √ | Alex-Net | 448×448 | 78.6% | |
PC | ECCV 2018 | ResNet-50 | TBD | 80.2% | ||
DeepLAC | CVPR 2015 | √ | √ | Alex-Net | 227×227 | 78.6% |
MaxEnt | NeurIPS 2018 | ResNet-50 | TBD | 80.4% | ||
Triplet-A | CVPR 2016 | √ | GoogLeNet | TBD | 80.7% | |
Multi-grained | ICCV 2015 | VGG-19 | 224×224 | 81.7% | ||
Krause et al. | CVPR 2015 | √ | CaffeNet | TBD | 82.0% | |
Multi-grained | ICCV 2015 | √ | VGG-19 | 224×224 | 83.0% | |
TS | CVPR 2016 | VGGD+VGGM | 448×448 | 84.0% | ||
Bilinear CNN | ICCV 2015 | VGGD+VGGM | 448×448 | 84.1% | ||
STN | NeurIPS 2015 | GoogLeNet+BN | 448×448 | 84.1% | ||
LRBP | CVPR 2017 | VGG-16 | 224×224 | 84.2% | ||
PDFS | CVPR 2016 | VGG-16 | TBD | 84.5% | ||
Xu et al. | ICCV 2015 | √ | √ | CaffeNet | 224×224 | 84.6% |
Cai et al. | ICCV 2017 | VGG-16 | 448×448 | 85.3% | ||
RA-CNN | CVPR 2017 | VGG-19 | 448×448 | 85.3% | ||
MaxEnt | NeurIPS 2018 | Bilinear CNN | TBD | 85.3% | ||
PC | ECCV 2018 | Bilinear CNN | TBD | 85.6% | ||
CVL | CVPR 2017 | VGG | TBD | 85.6% |