Contribution
proposed an interpretable deep model for fine-grained visual recognition:
- 做细粒度分类,但同时output the segmentation of object parts and the identification of their contributions towards classification,增加了模型的可解释性
- 为了确认object parts,使用a simple prior (prior knowledge)。利用assumption:给定一张图,某个part出现的概率符合Beta distribution(beta distribution具体没懂,之后再了解)。
Methods
-
region-based part discovery and attribution
输入图片。输出类别,assignment map(区域分割),attention map(标出重要区域)。分三步:
- compare input feature X with part dictionary D, 得到soft part assignment map Q.
- 根据Q 和D,从X中pool出region features Z。再根据Z 进一步计算attention a。
- 用a reweigh