Pattern Evaluation

Pattern Evaluation

@(Pattern Discovery in Data Mining)
本文介绍了数据挖掘中模式挖掘,评估所得模式与规则科学性的方法。

Limitation of Support-Confidence Framework

Pattern-mining will generate a large set of patterns/rules. However, not all the generated patterns/rules are interesting.

The interestingness measures: Objective vs. subjective
* Objective interestingness measures
* Support, confidence, correlation, …
* Subjective interestingness measures: One man’s trash could be another man’s treasure
* Query-based: Relevant to a user’s particular request
* Against one’s knowledge-base: unexpected, freshness, timeliness
* Visualization tools: Multi-dimensional, interactive examination

An example of limitations:

Interesting Measures: Lift and χ2

  1. Lift

    • Measure of dependent/correlated events: lift

      lift(B,C)=c(BC)s(C)=s(BC)s(B)×s(C)

    • Lift(B, C) may tell how B and C are correlated

    • Lift(B, C) = 1: B and C are independent
    • > 1: positively correlated
    • < 1: negatively correlated

Example:


Thus, B and C are negatively correlated since list < 1; But B and ¬C are positively correlated since lift > 1.

  1. χ2

    • Measure to test correlated events

      χ2=ObservedExpectedExpected

    • General rules:

    • χ2=0 , independent
    • χ2>0 , correlated, either positive or negative. So it needs additional test

Example:

  1. Null transaction( ¬A¬B )
    • Notion: Lift and χ2 are not always good measures

Null Invariance Measures

  • Null Invariance: Value does not change with the number of null-transactions.
  • Why is null invariance crucial for the analysis of massive transaction data? Because Many transactions may contain neither milk nor coffee!

Comparison of Null-invariance Measures

Use Imbalanced Ratio to measure the imbalance of two itemsets A and B in rule implications.

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值