1.2 What is Data Ming?
Data ming is also called :'' Knowledge discovery from data or KDD", Some people think Data Mining is just an essential step in the process of the knowledge discovery.
This is the process of Knowledge Discovery:
1. Data cleaning (to remove noise and inconsistent data)
2. Data integration (where multiple data sources may be combined)
3. Data selection (where data relevant to the analysis task are retrieved from the
database)
4. Data transformation (where data are transformed and consolidated into forms
appropriate for mining by performing summary or aggregation operations)4
5. Data mining (an essential process where intelligent methods are applied to extract
data patterns)
6. Pattern evaluation (to identify the truly interesting patterns representing knowledge
based on interestingness measures—see Section 1.4.6)
7. Knowledge presentation (where visualization and knowledge representation tech-
niques are used to present mined knowledge to users)
Because in industry, in media, in the research environment, the term of data mining is often referred to the entire knowledge discovery process. So the definition of Data mining is: Data mining is the process of discovering interesting patterns and knowledge from large amounts of data. The data sources can include databases, data warehouses, the Web, other information repositories, or data that are streamed into the system dynamically.