pytorch 模型同一轮两次预测结果不一样_2020年的最新深度学习模型可解释性综述[附带代码]...

最新推荐文章于 2024-08-08 17:06:57 发布

weixin_39714849

最新推荐文章于 2024-08-08 17:06:57 发布

阅读量639

点赞数

文章标签： pytorch 模型同一轮两次预测结果不一样深度学习的可解释性 github

最近low-level vision的炼丹经常出现各种主观评测上的效果问题，无法定位出其对于输入数据的对应关系，出现了问题之后很难进行针对性解决。

这个时候一个很自然的问题就是，都2020年了，深度学习的可解释性到底发展到什么地步了？

对于模型的可解释性而言，很难做到像解数学题一样，每一步都能给出有效的解释。

于是就查阅了下模型解释性相关的论文，从2012年开始，主要还是以高引用作为主要的参考因素。

整个的相关论文查阅步骤大致为：

先从周博磊和张拳石两个大佬的主页去找相关的文献作为初始种子队列；
从初始种子队列里面去寻找相关参考文献，然后添加到种子队列中；
从github 资源awesome系列和pytorch模型解释性包captum中去寻找相关的论文添加到种子队列中；
按照时间顺序，从经典到最新，将种子队列中的论文在谷歌学术中进行搜索，然后从其引用中去继续寻找相关的论文，添加到种子队列中；
重复4，直至种子队列为空；

其中主要以高引用/顶会作为主要考虑因素，对ICCV和CVPR历年来的引用数统计结果来看，破100引用数的论文占比已经很少了，所以将其作为一个引用数高低的一个大致的衡量标准。

当然，查阅过程中难免会有错漏，因此也整理了一个awesome系列awesome_deep_learning_interpretabilit 来进行了版本的记录。

知乎不支持markdown格式的表格，前往github阅读会更为直观一些。

最后筛选出了161篇相关的论文，按照时间顺序和不同的出版物进行排序，结果进行了整理，结果如下。要是只想关注高引用的论文，也可参见github，也有按照引用次数的整理结果。

以下文献列表会不定期更新。

|:---:|:---:|:---:|:---:|:---:|

|2020|ICLR|Knowledge Isomorphism between Neural Networks|0|

|2020|ICLR|Interpretable Complex-Valued Neural Networks for Privacy Protection|0|

|2019|AI|Explanation in artificial intelligence: Insights from the social sciences|380|

|2019|NMI|Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead|54|

|2019|NeurIPS|A benchmark for interpretability methods in deep neural networks（同arxiv:1806.10758）|3|

|2019|NeurIPS|Full-gradient representation for neural network visualization|2|

|2019|NeurIPS|On the (In) fidelity and Sensitivity of Explanations|2|

|2019|NeurIPS|CXPlain: Causal explanations for model interpretation under uncertainty|1|

|2019|CVPR|Interpreting CNNs via Decision Trees|49|

|2019|CVPR|Attention branch network: Learning of attention mechanism for visual explanation|14|

|2019|CVPR|Interpretable and fine-grained visual explanations for convolutional neural networks|8|

|2019|CVPR|Learning to Explain with Complemental Examples|6|

|2019|CVPR|Multimodal Explanations by Predicting Counterfactuality in Videos|1|

|2019|CVPR|Visualizing the Resilience of Deep Convolutional Network Interpretations|1|

|2019|ICCV|U-CAM: Visual Explanation using Uncertainty based Class Activation Maps|6|

|2019|ICCV|Towards Interpretable Face Recognition|6|

|2019|ICCV|Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded|5|

|2019|ICCV|Explaining Neural Networks Semantically and Quantitatively|1|

|2019|ICLR|How Important Is a Neuron?|10|

|2019|ICLR|Visual Explanation by Interpretation: Improving Visual Feedback Capabilities of Deep Neural Networks|7|

|2019|ICAIS|Interpreting black box predictions using fisher kernels|7|

|2019|ACMFAT|Explaining explanations in AI|54|

|2019|AAAI|Classifier-agnostic saliency map extraction|4|

|2019|AAAI|Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval|0|

|2019|AAAIW|Unsupervised Learning of Neural Networks to Explain Neural Networks|9|

|2019|AAAIW|Network Transplanting|4|

|2019|CSUR|A Survey of Methods for Explaining Black Box Models|344|

|2019|ExplainAI|The (Un)reliability of saliency methods(scihub)|95|

|2019|ACL|Attention is not Explanation|57|

|2019|arxiv|Attention Interpretability Across NLP Tasks|4|

|2018|ICLR|Towards better understanding of gradient-based attribution methods for deep neural networks|123|

|2018|ICLR|Learning how to explain neural networks: PatternNet and PatternAttribution|90|

|2018|CVPR|Interpretable Convolutional Neural Networks|154|

|2018|CVPR|Net2vec: Quantifying and explaining how concepts are encoded by filters in deep neural networks|39|

|2018|CVPR|What have we learned from deep representations for action recognition?|20|

|2018|CVPR|Learning to Act Properly: Predicting and Explaining Affordances from Images|17|

|2018|CVPR|What do Deep Networks Like to See?|9|

|2018|ECCV|Grounding visual explanations|38|

|2018|ECCV|Textual explanations for self-driving vehicles|30|

|2018|ECCV|Convnets and imagenet beyond accuracy: Understanding mistakes and uncovering biases|17|

|2018|ECCV|Vqa-e: Explaining, elaborating, and enhancing your answers for visual questions|12|

|2018|ECCV|ExplainGAN: Model Explanation via Decision Boundary Crossing Transformations|0|

|2018|ICML|Learning to explain: An information-theoretic perspective on model interpretation|72|

|2018|ACL|Did the Model Understand the Question?|34|[Tensorflow](pramodkaushik/acl18_results)|

|2018|FITEE|Visual interpretability for deep learning: a survey|140|

|2018|NeurIPS|Sanity Checks for Saliency Maps|122|

|2018|NeurIPS|Attacks meet interpretability: Attribute-steered detection of adversarial samples|26|

|2018|NeurIPS Workshop|Interpretable Convolutional Filters with SincNet|17|

|2018|AAAI|Anchors: High-precision model-agnostic explanations|200|

|2018|AAAI|Examining CNN Representations with respect to Dataset Bias|24|

|2018|WACV|Grad-cam++: Generalized gradient-based visual explanations for deep convolutional networks|85|

|2018|IJCV|Top-down neural attention by excitation backprop|256|

|2018|TPAMI|Interpreting deep visual representations via network dissection|56|

|2018|DSP|Methods for interpreting and understanding deep neural networks(scihub)|469|

|2018|Access|Peeking inside the black-box: A survey on Explainable Artificial Intelligence (XAI)|131|

|2018|MIPRO|Explainable artificial intelligence: A survey|54|

|2018|AIES|Detecting Bias in Black-Box Models Using Transparent Model Distillation|27|

|2018|BMVC|Rise: Randomized input sampling for explanation of black-box models|30|

|2018|arxiv|Manipulating and measuring model interpretability|73|

|2018|arxiv|How convolutional neural network see the world-A survey of convolutional neural network visualization methods|27|

|2018|arxiv|Revisiting the importance of individual units in cnns via ablation|25|

|2018|arxiv|Computationally Efficient Measures of Internal Neuron Importance|1|

|2017|ICML|Learning Important Features Through Propagating Activation Differences|383|

|2017|ICLR|Exploring LOTS in Deep Neural Networks|26|

|2017|NeurIPS|A Unified Approach to Interpreting Model Predictions|591|

|2017|NeurIPS|SVCCA: Singular Vector Canonical Correlation Analysis for Deep Learning Dynamics and Interpretability|97|

|2017|CVPR|Mining Object Parts from CNNs via Active Question-Answering|15|

|2017|CVPR|Network dissection: Quantifying interpretability of deep visual representations|373|

|2017|CVPR|Improving Interpretability of Deep Neural Networks with Semantic Information|43|

|2017|CVPR|Interpretable 3d human action analysis with temporal convolutional networks|106|

|2017|CVPR|Making the V in VQA matter: Elevating the role of image understanding in Visual Question Answering|393|

|2017|ICCV|Interpretable Learning for Self-Driving Cars by Visualizing Causal Attention|80|

|2017|ICCV|Understanding and comparing deep neural networks for age and gender classification|39|

|2017|ICCV|Learning to disambiguate by asking discriminative questions|10|

|2017|IJCAI|Right for the right reasons: Training differentiable models by constraining their explanations|102|

|2017|ACL|Visualizing and Understanding Neural Machine Translation|56|

|2017|EMNLP|A causal framework for explaining the predictions of black-box sequence-to-sequence models|64|

|2017|CVPRW|Looking under the hood: Deep neural network visualization to interpret whole-slide image analysis outcomes for colorectal polyps|14|

|2017|survey|Interpretability of deep learning models: a survey of results|49|

|2017|arxiv|SmoothGrad: removing noise by adding noise|212|

|2017|arxiv|Interpretable & explorable approximations of black box models|68|

|2017|arxiv|Towards interpretable deep neural networks by leveraging adversarial examples|44|

|2017|arxiv|Explainable artificial intelligence: Understanding, visualizing and interpreting deep learning models|210|

|2017|arxiv|Challenges for transparency|69|

|2017|ACMSOPP|Deepxplore: Automated whitebox testing of deep learning systems|302|

|2017|CEURW|What does explainable AI really mean? A new conceptualization of perspectives|64|

|2017|TVCG|ActiVis: Visual Exploration of Industry-Scale Deep Neural Network Models|113|

|2016|NeurIPS|Understanding the effective receptive field in deep convolutional neural networks|310|

|2016|CVPR|Inverting Visual Representations with Convolutional Networks|266|

|2016|CVPR|Visualizing and Understanding Deep Texture Representations|83|

|2016|CVPR|Analyzing Classifiers: Fisher Vectors and Deep Neural Networks|82|

|2016|ECCV|Design of kernels in convolutional neural networks for image classification|11|

|2016|ICML|Understanding and improving convolutional neural networks via concatenated rectified linear units|216|

|2016|ICML|Visualizing and comparing AlexNet and VGG using deconvolutional layers|28|

|2016|KDD|Why should i trust you?: Explaining the predictions of any classifier|2255|

|2016|TVCG|Visualizing the hidden activity of artificial neural networks|122|

|2016|TVCG|Towards better analysis of deep convolutional neural networks|184|

|2016|arxiv|Understanding neural networks through representation erasure|137|

|2016|arxiv|Grad-CAM: Why did you say that?|87|

|2016|arxiv|Investigating the influence of noise and distractors on the interpretation of neural networks|24|

|2016|arxiv|Attentive Explanations: Justifying Decisions and Pointing to the Evidence|41|

|2016|arxiv|The Mythos of Model Interpretability|951|

|2016|arxiv|Multifaceted feature visualization: Uncovering the different types of features learned by each neuron in deep neural networks|130|

|2015|AAS|Interpretable classifiers using rules and Bayesian analysis: Building a better stroke prediction model|304|

新年新气象，新的2020年，努力的一年从第一天的模型可解释性综述开始。

weixin_39714849

关注

0
点赞
踩
3

收藏

觉得还不错? 一键收藏
0
评论
复制链接

分享到 QQ

分享到新浪微博

扫一扫