数据仓库发展趋势

多种数据类型

数据仓库中一般存储的是结构化数据,大部分为数值数据。从这个角度看,决策支持系统分为两类:结构化数据的数据仓库和非结构化数据的知识管理系统。非结构化数据包含非结构化文档、图像、视频、音频和空间数据等。

 

数据可视化

可视化趋势:更多图表类型;交互可视化;庞杂结果可视化。

可视化技术的最大变化是从静态的表格到动态交互表达方式的变化,主要表现为表格的可操作性;数据可以向下钻取;可以进行高级的互动。

 

并行处理

数据仓库是以用户为中心和密集查询的环境,如果不能处理好复杂、大型查询,则数据仓库毫无价值而言,数据仓库性能非常重要,这就要求具有较强的并行处理能力。并行处理能力有并行处理硬件和软件构成。

采用并行处理的优点:1、提高查询处理、数据装在和索引创建性能;2、规模可扩展,在不改变现有应用程序的条件下,允许增加CPU和内存模块;3、容错能力,在一些并行处理器出现错误时数据库仍然可用;

 

查询工具

(待补充)

 

浏览工

发展趋势如下:允许定义任何类型的数据或信息对象;内含开放的应用程序接口(API);提供几种类型的浏览功能,可以区别不同的用户群体;允许用户浏览数据字典或元数据,查找感兴趣的信息对象,并能进一步利用相关参数运行合适的查询工具;提供Web浏览和查找技术来浏览信息目录。

 

数据融合工具的发展

 

软件代理发展

 

企业数据组合

 

整合ERP和数据仓库

使用现成的ERP数据仓库;自行开发数据仓库;对ERP数据库使用第三方工具进行扩展。

 

数据仓库和知识管理的发展

 

建设能够进行客户关系管理的数据仓库

 

建立活跃数据仓库

一对一服务,这就是需要建设的活跃数据仓库提供的服务。

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
Chapter 1 overviews open research problems concerning building data warehouses for integrating and analyzing various complex types of data, dealing with temporal aspects of data, handling imprecise data, and ensuring privacy in DWs. Chapter 2 discusses challenges in designing ETL processes for real-time (or near real-time) data warehouses and proposes an architecture of a real-time DW system. Chapter 3 discusses data warehouse modeling techniques, based on multidimensional modeling. In particular, the chapter covers conceptual, logical, and physical modeling. Chapter 4 proposes an approach to personalizing a multidimensional database. The authors present a model and a language that allow to define user preferences on schema elements. User preferences are expressed by means of weights associated with schema elements and they express user interest in data. Chapter 5 covers designing spatial (geographical) data warehouses. It proposes a metamodel for the support of the design of spatial dimensional schemas. Chapter 6 presents a technique for approximate answering range-sum queries on data cubes. To this end, the authors propose tree-based data structures for separately storing sampled data and outliers data. The proposed technique assures a good quality of approximate answers. Chapter 7 addresses a problem of summarizing multidimensional search spaces, called data cubes. The authors propose the concept of the so-called closed cube, which is a cover for a data cube. The authors show that the closed cube is smaller than its competitor, i.e. a quotient cube, and it can be used for deriving a quotient cube. Chapter 8 analyzes multiple index structures for multiversion data warehouses. In particular, the paper describes how to extend index structures designed for data with linear evolution in order to handle data with branched evolution and it provides an analytical model for comparing various index structures for multiversion DWs. Chapter 9 discusses the application of WAH compressed bitmap indexes to indexing text data for full-text search. The chapter also presents performance characteristics of the proposed indexing technique in three systems, namely MySQL, FastBit, and MonetDB. Chapter 10 proposes the optimization of OLAP queries by means of applying horizontal partitioning of tables and bitmap join indexes. The partitioning schema viii Preface and the set of bitmap indexes are selected by means of genetic and greedy algorithms. The proposed optimization techniques are validated experimentally. Chapter 11 discusses the application of the x-BR-tree index to spatial data. The chapter provides an analytical cost model of spatial queries with the support of the x-BR-tree index, followed by its experimental evaluation. Chapter 12 proposes a formal model for representing spatio-temporal and nonspatial data about moving objects. Based on the model, the authors propose a query language allowing to query such data. The main idea is based on replacing object trajectories by sequences of object stops and moves. Chapter 13 addresses the problem of data mining in a multidimensional space in a data warehouse. The authors propose a compact representation of sequential patterns, called closed multidimensional sequential patterns, which allows to reduce the search space. The proposed representation and mining algorithms are followed by experimental evaluation. Chapter 14 presents issues on modeling and querying temporal semistructured data warehouses. Such a DW is modeled as a graph with labeled nodes and edges. The temporal aspect is added to the graph by means of labels denoting validity times. The model is supported with a query language based on path expressions. Chapter 15 contributes a multidimensional data model of a data warehouse, called "galaxy", that supports the analysis of XML documents. The authors also propose a technique and a tool for integrating XML documents and loading them into a DW.

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值