2020.ICDM.LP-Explain: Local Pictorial Explanation for Outliers

paper

pdf

main idea

tries to identify the set of best Local Pictorial explanations (defined as the scatter plots in the 2-D space of the feature pairs) that can Explain the behavior for cluster of outliers.

Different from lookout in:
lookout chose top subspaces represent a compromise among all the outliers, thus they may not include the optimal subspace for each individual outlier.
Ours: cluster outliers, then explain the behavior for each cluster.

contribution

1、We propose a new pictorial explanation method to provide visualized descriptions for clusters of outliers.
2、We design an outlier clustering method specifically for our pictorial explanation task. The method first quantifies the relationship among feature pairs, then leverages a proposed rank similarity method to measure the distance between top feature pairs of outliers.
3、We formulate the feature pairs selection problem as a multi-task learning problem where a hyperparameter indicating the localization level is adopted to provide explanation towards individual cluster or all the outliers.
4、We conduct experiments on six public datasets and demonstrate the effectiveness of the proposed LP-Explain by the explanation performance.

method

1、define an effective measure to quantify the similarity between outliers, and then cluster outliers into different groups based on their abnormal feature pairs.
2、weigh the importance of feature pairs within each cluster through a
multi-task learning framework to select the set of top feature
pairs that best explain various outlier clusters.
在这里插入图片描述

step 1:X, SQ Generation

each row in X represents the anomaly score detected on feature pairs of outlier i. For example, the first in the first row means the anomaly score of outlier 1 in feature pair 1(i.e. fp1).
在这里插入图片描述
Then for each outlier, the scores are sorted to get feature pair sequence.
在这里插入图片描述

step 2:Outlier Clustering

Relation Quantification of Feature Pairs

construct a fully connected graph G =(V,E) to indicate feature pairs relationship:
在这里插入图片描述
compute the weight of the edge:
the edge weight between two feature pairs is large when most of the outliers obtain similar outlier scores in these two 2-D spaces.
在这里插入图片描述
To measure the structural similarity between two feature pairs, we need to learn a vector representation V =(v1, v2, …, vn) of each node in the graph G.
在这里插入图片描述

Similarity of Outliers

a rank similarity method to quantitatively measure the relation between two outliers according to their ranked feature pair sequences.

Spectral Clustering on Outliers

use Self-Tuning Spectral Clustering method to produce clusters C

Feature Pair Selection

experiment

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值