数据挖掘、数据、大数据

What is Data Mining

Discovery of useful, possibly unexpected, patterns in data.

Data Mining Process

(Describe the steps involved in data mining when viewed as a process of knowledge discovery.)

  1. Data Cleaning 数据清理(消除噪声或不一致数据)
  2. Data Integration数据集成(多种数据源可以组合在一起)
  3. Data Selection 数据选择(从数据库中检索与分析任务相关的数据)
  4. Data transformation数据变换(数据变换或统一成适合挖掘的形式)
  5. Data Mining Method 挖掘方法(使用各种方法提取数据模式)
  6. Pattern Assessment 模式评估(使用某种度量,识别真正有价值的模式)
  7. Knowledge Representation 知识表示(使用可视化和知识表示技术,向用户提供挖掘的知识)

What is Data

Definition

“Data are pieces of information that represent the qualitative or quantitative attributes of a variable or set of variables.

Data are often viewed as the lowest level of abstraction from which information and knowledge are derived.”

Data Types

  • Continuous
  • Discrete
  • Symbolic

Storage

  • Physical
  • Logical

Major Issues

  • Transformation
  • Errors and Corruption

What is Big Data?

  • “Big data is high-volume, high-velocity and high-variety information assets that demand cost-effective, innovative forms of information processing for enhanced insight and decision making.” — Gartner
    大数据是高容量,高速度,多变的信息资产,需要经济高效的创新形式的信息处理方式,以增强洞察力和决策能力。”

  • “Big data refers to datasets whose size is beyond the ability of typical database software tools to capture, store, manage, and analyze.” —
    Mckinsey & Company

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值