在数据挖掘中,如何进行有约束地挖掘,如何对待挖掘数据进行条件约束与筛选,是本文探讨的话题。
Why do we use constraint-based pattern mining? Because we’d like to apply different pruning methods to constrain pattern mining process.
And for those reasons:
- Finding all the patterns in a dataset autonomously? — unrealistic!
- Too many patterns but not necessarily user-interested!
- Pattern mining should be an interactive process
- User directs what to be mined using a data mining query language (or a graphical user interface)
- Constraint-based mining
- User flexibility: provides constraints on what to be mined
- Optimization: explores such constraints for efficient mining
- Constraint-based mining: Constraint-pushing, similar to push selection first in DB query processing
Constraints in General Data Mining
A data mining query can be in the form of a meta-rule or with the following language primitives
* Knowledge type constraint:
* Ex.: classification, association, clustering, outlier finding, ….
* Data constraint —