[数据仓库工具箱][1].目录

Acknowledgments xv
Introduction xvii
Chapter 1 Dimensional Modeling Primer 1
Different Information Worlds 2
Goals of a Data Warehouse 2
The Publishing Metaphor 4
Components of a Data Warehouse 6
Operational Source Systems 7
Data Staging Area 8
Data Presentation 10
Data Access Tools 13
Additional Considerations 14
Dimensional Modeling Vocabulary 16
Fact Table 16
Dimension Tables 19
Bringing Together Facts and Dimensions 21
Dimensional Modeling Myths 24
Common Pitfalls to Avoid 26
Summary 27
Chapter 2 Retail Sales 29
Four-Step Dimensional Design Process 30
Retail Case Study 32
Step 1. Select the Business Process 33
Step 2. Declare the Grain 34
Step 3. Choose the Dimensions 35
Step 4. Identify the Facts 36
Dimension Table Attributes 38
Date Dimension 38
Product Dimension 42
Store Dimension 45
Promotion Dimension 46
Degenerate Transaction Number Dimension 50
Retail Schema in Action 51
Retail Schema Extensibility 52
Resisting Comfort Zone Urges 54
Dimension Normalization (Snowflaking) 55
Too Many Dimensions 57
Surrogate Keys 58
Market Basket Analysis 62
Summary 65
Chapter 3 Inventory 67
Introduction to the Value Chain 68
Inventory Models 69
Inventory Periodic Snapshot 69
Inventory Transactions 74
Inventory Accumulating Snapshot 75
Value Chain Integration 76
Data Warehouse Bus Architecture 78
Data Warehouse Bus Matrix 79
Conformed Dimensions 82
Conformed Facts 87
Summary 88
Chapter 4 Procurement 89
Procurement Case Study 89
Procurement Transactions 90
Multiple- versus Single-Transaction Fact Tables 91
Complementary Procurement Snapshot 93
vi C o n t e n t s
Slowly Changing Dimensions 95
Type 1: Overwrite the Value 95
Type 2: Add a Dimension Row 97
Type 3: Add a Dimension Column 100
Hybrid Slowly Changing Dimension Techniques 102
Predictable Changes with Multiple Version Overlays 102
Unpredictable Changes with Single Version Overlay 103
More Rapidly Changing Dimensions 105
Summary 105
Chapter 5 Order Management 107
Introduction to Order Management 108
Order Transactions 109
Fact Normalization 109
Dimension Role-Playing 110
Product Dimension Revisited 111
Customer Ship-To Dimension 113
Deal Dimension 116
Degenerate Dimension for Order Number 117
Junk Dimensions 117
Multiple Currencies 119
Header and Line Item Facts with Different Granularity 121
Invoice Transactions 122
Profit and Loss Facts 124
Profitability—The Most Powerful Data Mart 126
Profitability Words of Warning 127
Customer Satisfaction Facts 127
Accumulating Snapshot for the Order Fulfillment Pipeline 128
Lag Calculations 130
Multiple Units of Measure 130
Beyond the Rear-View Mirror 132
Fact Table Comparison 132
Transaction Fact Tables 133
Periodic Snapshot Fact Tables 134
Accumulating Snapshot Fact Tables 134
Contents vii
Designing Real-Time Partitions 135
Requirements for the Real-Time Partition 136
Transaction Grain Real-Time Partition 136
Periodic Snapshot Real-Time Partition 137
Accumulating Snapshot Real-Time Partition 138
Summary 139
Chapter 6 Customer Relationship Management 141
CRM Overview 142
Operational and Analytical CRM 143
Packaged CRM 145
Customer Dimension 146
Name and Address Parsing 147
Other Common Customer Attributes 150
Dimension Outriggers for a Low-Cardinality Attribute Set 153
Large Changing Customer Dimensions 154
Implications of Type 2 Customer Dimension Changes 159
Customer Behavior Study Groups 160
Commercial Customer Hierarchies 161
Combining Multiple Sources of Customer Data 168
Analyzing Customer Data from Multiple Business Processes 169
Summary 170
Chapter 7 Accounting 173
Accounting Case Study 174
General Ledger Data 175
General Ledger Periodic Snapshot 175
General Ledger Journal Transactions 177
Financial Statements 180
Budgeting Process 180
Consolidated Fact Tables 184
Role of OLAP and Packaged Analytic Solutions 185
Summary 186
viii C o n t e n t s
Chapter 8 Human Resources Management 187
Time-Stamped Transaction Tracking in a Dimension 188
Time-Stamped Dimension with Periodic Snapshot Facts 191
Audit Dimension 193
Keyword Outrigger Dimension 194
AND/OR Dilemma 195
Searching for Substrings 196
Survey Questionnaire Data 197
Summary 198
Chapter 9 Financial Services 199
Banking Case Study 200
Dimension Triage 200
Household Dimension 204
Multivalued Dimensions 205
Minidimensions Revisited 206
Arbitrary Value Banding of Facts 207
Point-in-Time Balances 208
Heterogeneous Product Schemas 210
Heterogeneous Products with Transaction Facts 215
Summary 215
Chapter 10 Telecommunications and Utilities 217
Telecommunications Case Study 218
General Design Review Considerations 220
Granularity 220
Date Dimension 222
Degenerate Dimensions 222
Dimension Decodes and Descriptions 222
Surrogate Keys 223
Too Many (or Too Few) Dimensions 223
Draft Design Exercise Discussion 223
Geographic Location Dimension 226
Location Outrigger 226
Leveraging Geographic Information Systems 227
Summary 227
Contents ix
Chapter 11 Transportation 229
Airline Frequent Flyer Case Study 230
Multiple Fact Table Granularities 230
Linking Segments into Trips 233
Extensions to Other Industries 234
Cargo Shipper 234
Travel Services 235
Combining Small Dimensions into a Superdimension 236
Class of Service 236
Origin and Destination 237
More Date and Time Considerations 239
Country-Specific Calendars 239
Time of Day as a Dimension or Fact 240
Date and Time in Multiple Time Zones 240
Summary 241
Chapter 12 Education 243
University Case Study 244
Accumulating Snapshot for Admissions Tracking 244
Factless Fact Tables 246
Student Registration Events 247
Facilities Utilization Coverage 249
Student Attendance Events 250
Other Areas of Analytic Interest 253
Summary 254
Chapter 13 Health Care 255
Health Care Value Circle 256
Health Care Bill 258
Roles Played By the Date Dimension 261
Multivalued Diagnosis Dimension 262
Extending a Billing Fact Table to Show Profitability 265
Dimensions for Billed Hospital Stays 266
x C o n t e n t s
TEAMFLY
Complex Health Care Events 267
Medical Records 269
Fact Dimension for Sparse Facts 269
Going Back in Time 271
Late-Arriving Fact Rows 271
Late-Arriving Dimension Rows 273
Summary 274
Chapter 14 Electronic Commerce 277
Web Client-Server Interactions Tutorial 278
Why the Clickstream Is Not Just Another Data Source 281
Challenges of Tracking with Clickstream Data 282
Specific Dimensions for the Clickstream 287
Clickstream Fact Table for Complete Sessions 292
Clickstream Fact Table for Individual Page Events 295
Aggregate Clickstream Fact Tables 298
Integrating the Clickstream Data Mart into the
Enterprise Data Warehouse 299
Electronic Commerce Profitability Data Mart 300
Summary 303
Chapter 15 Insurance 305
Insurance Case Study 306
Insurance Value Chain 307
Draft Insurance Bus Matrix 309
Policy Transactions 309
Dimension Details and Techniques 310
Alternative (or Complementary) Policy
Accumulating Snapshot 315
Policy Periodic Snapshot 316
Conformed Dimensions 316
Conformed Facts 316
Heterogeneous Products Again 318
Multivalued Dimensions Again 318
Contents xi
More Insurance Case Study Background 319
Updated Insurance Bus Matrix 320
Claims Transactions 322
Claims Accumulating Snapshot 323
Policy/Claims Consolidated Snapshot 324
Factless Accident Events 325
Common Dimensional Modeling Mistakes to Avoid 326
Summary 330
Chapter 16 Building the Data Warehouse 331
Business Dimensional Lifecycle Road Map 332
Road Map Major Points of Interest 333
Project Planning and Management 334
Assessing Readiness 334
Scoping 336
Justification 336
Staffing 337
Developing and Maintaining the Project Plan 339
Business Requirements Definition 340
Requirements Preplanning 341
Collecting the Business Requirements 343
Postcollection Documentation and Follow-up 345
Lifecycle Technology Track 347
Technical Architecture Design 348
Eight-Step Process for Creating the Technical Architecture 348
Product Selection and Installation 351
Lifecycle Data Track 353
Dimensional Modeling 353
Physical Design 355
Aggregation Strategy 356
Initial Indexing Strategy 357
Data Staging Design and Development 358
Dimension Table Staging 358
Fact Table Staging 361
xii C o n t e n t s
Lifecycle Analytic Applications Track 362
Analytic Application Specification 363
Analytic Application Development 363
Deployment 364
Maintenance and Growth 365
Common Data Warehousing Mistakes to Avoid 366
Summary 369
Chapter 17 Present Imperatives and Future Outlook 371
Ongoing Technology Advances 372
Political Forces Demanding Security and Affecting Privacy 375
Conflict between Beneficial Uses and Insidious Abuses 375
Who Owns Your Personal Data? 376
What Is Likely to Happen? Watching the Watchers . . . 377
How Watching the Watchers Affects Data
Warehouse Architecture 378
Designing to Avoid Catastrophic Failure 379
Catastrophic Failures 380
Countering Catastrophic Failures 380
Intellectual Property and Fair Use 383
Cultural Trends in Data Warehousing 383
Managing by the Numbers
across the Enterprise 383
Increased Reliance on Sophisticated Key
Performance Indicators 384
Behavior Is the New Marquee Application 385
Packaged Applications Have Hit Their High Point 385
Application Integration Has to Be Done by Someone 386
Data Warehouse Outsourcing Needs a Sober Risk Assessment 386
In Closing 387
Glossary 389
Index 419
Contents xii

--翻译

--致谢XV
引进XVII
第1章三维建模入门1
不同的信息世界2
数据仓库的目标2
出版隐喻4
数据仓库的组成部分6
运作的源系统7
第8区的数据分段
数据显示10
数据访问工具13
14其他注意事项
三维建模词汇16
事实数据表16
维度表19
汇集事实和维度21
三维建模的神话24
常见的陷阱,以避免26
总结27
第2章29零售
四步三维设计过程30
零售业为例32
第1步。选择业务流程33
第2步。申报的粮食34
第3步。选择尺寸35
第4步。查明的事实36
维度表属性38
日期维度38
产品尺寸42
Store维度45
推广尺寸46
退化成交宗数尺寸50
零售模式在行动51
零售模式扩展52
抗舒适区敦促54
维正常化(Snowflaking)的55
太多的尺寸57
代理键58
市场篮分析62
总结65
第3章库存67
价值链68
存货模型69
库存周期快照69
库存交易74
库存累积快照75
价值链整合76
数据仓库总线结构78
数据仓库总线矩阵79
82个符合尺寸
87个符合事实
总结88
第四章采购89
采购为例89
采购交易90
多与单交易事实表91
互补性采购快照93
6 C O N T E N T S
慢慢地改变尺寸95
类型1:覆盖的价值95
类型2:添加维度行97
类型3:添加维度列100
混合渐变尺寸技术102
与102多个版本覆盖可预见的变化
变化莫测,单版本覆盖103
更迅速地改变尺寸105
摘要105
第5章订单管理107
订单管理108简介
订单交易109
事实上正常化109
尺寸角色扮演110
产品尺寸再访111
顾客船维度113
处理尺寸116
退化的订单编号117的尺寸
垃圾尺寸117
119多币种
头和行项目事实与不同粒度121
122发票交易
损益事实124
盈利能力的最强大的数据沃尔玛126
盈利警告127词
顾客满意的事实127
积累的订单履行管道128快照
滞计算130
测量130多个单位
超出后视镜132
事实表比较132
交易情况统计表133
定期快照事实表134
累积快照事实表134
目录vii
设计实时分区135
要求实时分区136
实时交易粮食分区136
定期快照的实时分区137
累积快照的实时分区138
摘要139
第6章客户关系管理141
CRM概述142
业务和分析型CRM 143
打包的CRM 145
客户维度146
名称及地址解析147
其他普通客户属性150
尺寸支腿为低基数属性设置153
大客户不断变化的尺寸154
2型客户尺寸变化159的启示
客户行为的研究组160
商业客户层次161
结合168多个客户资料来源
来自169多个业务流程分析客户数据
摘要170
第7章会计173
会计案例研究174
总帐数据175
总帐定期快照175
总帐管理系统 - 交易177
180财务报表
预算编制过程180
综合情况表184
OLAP和包装185解析解中的作用
摘要186
VIII C O N T E N T S
第8章人力资源管理187
时间戳记在尺寸188交易跟踪
时间戳记的尺寸与周期快照事实191
审计尺寸193
关键字支腿尺寸194
和/或困境195
寻找子串196
调查问卷的数据197
摘要198
第9章199金融服务
银行为例200
维分流200
家用尺寸204
多值尺寸205
minidimensions再访206
捆扎事实207的任意值
在时间点余额208
异构产品架构210
与交易事实的异质产品215
摘要215
第10章电讯和公用事业217
电信案例研究218
一般设计审查注意事项220
粒度220
日期维度222
退化尺寸222
维解码和说明222
代理键223
太多(或太少)尺寸223
草案设计工作的探讨223
地理位置尺寸226
地点支腿226
利用地理信息系统227
总结227
内容IX
第11章运输229
航空公司飞行常客为例230
多个事实表粒度230
链接到旅游233段
向其他行业扩展234
货物托运人234
旅行服务235
小尺寸结合成一个Superdimension 236
一流的服务236
原产地和目的地237
更多的日期和时间的思考239
特定国家的日历239
作为一个维度或事实240天的时间
在240多个时区的日期和时间
摘要241
第12章教育243
大学的案例研究244
积累了跟踪244招生快照
factless事实表246
学生注册事件247
设施利用覆盖249
学生出席活动250
分析利息253其他地区
摘要254
第13章卫生保健255
保健价值256圈
保健条例草案258
日期261尺寸所扮演的角色
多值诊断尺寸262
扩展记帐事实表显示盈利265
嘴医院停留的尺寸266
x C O N T E N T S
TEAMFLY
复杂的医疗保健活动267
医疗记录269
事实上尺寸为269稀疏的事实
回去时271
迟到的事实行271
迟到维度行273
摘要274
第14章电子商业277
Web客户端 - 服务器交互教程278
为什么点击流是不是只是另一个数据源281
跟踪282点击流数据的挑战
点击流287的具体尺寸
292完整会话的点击流事实表
单独的页面事件的点击流事实表295
聚集点击流事实表298
点击流数据集市纳入
企业数据仓库299
电子商业盈利数据集市300
摘要303
第15章保险305
保险案例研究306
保险价值链307
草案保险总线矩阵309
政策交易309
详细尺寸和技术310
替代(或补充)的政策
累积快照315
政策周期快照316
316符合尺寸
316符合事实
异质产品再次318
同样318多值尺寸
内容第十一
更多保险为例背景319
更新的保险总线矩阵320
债权交易322
索赔累积快照323
政策/索赔合并快照324
factless事故事件325
常见的三维建模错误,避免326
摘要330
第16章建立数据仓库331
业务维生命周期路线图332
路线图的主要景点333点
项目规划和管理334
评估准备334
作用域336
理由336
人员编制337
开发和维护项目计划339
业务需求定义340
要求预先计划341
收集业务需求343
postcollection文件和后续行动345
生命周期技术跟踪347
技术架构设计348
八步流程创建的技术体系结构348
产品选型和安装351
生命周期的数据轨道353
三维建模353
物理设计355
聚合策略356
初始索引策略357
数据暂存的设计与开发358
尺寸表分期358
事实表分期361
第十二届C O N T E N T S
生命周期分析应用程序跟踪362
分析应用规范363
分析应用开发363
部署364
维护和增长365
常见的数据仓库避免366错误
摘要369
第17章现状的紧迫性和未来展望371
持续的技术进展372
政治势力苛刻的安全和私隐有影响的375
有益用途,阴险的滥用和375之间的冲突
谁拥有你的个人资料? 376
什么是有可能发生吗?看着看守。 377
如何看看守影响数据
仓库建筑378
设计以避免灾难性的失败379
380灾难性故障
打击灾难性故障380
知识产权和合理使用383
在数据仓库383文化思潮
由数字管理
整个企业383
更多地依赖于复杂的重点
性能指标384
行为是新的跑马灯应用385
打包的应用程序都击中了他们的高点385
应用集成,必须完成386人
数据仓库外包需要一个清醒的风险评估386
在关闭387
词汇表389
指数419
内容第十三

转载于:https://www.cnblogs.com/sanpoye/archive/2012/04/28/2475272.html

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论

“相关推荐”对你有帮助么?

  • 非常没帮助
  • 没帮助
  • 一般
  • 有帮助
  • 非常有帮助
提交
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值