研究新得
hjy3789759
这个作者很懒,什么都没留下…
展开
-
Optimization of ETL Execution by pipelining method(ETL执行的流水线优化)
摘要:ETL工具是构建和维护数据仓库的基本构件,由于它处理的是海量数据,如何有效地加快响应时间成为值得研究的问题。本文提出了ETL过程的“主表衍生”模式,并针对这种模式采用流水线算法来提高并行性从而加快ETL过程的响应时间,理论分析和实验表明具有好的效果。原创 2005-06-21 20:24:00 · 749 阅读 · 0 评论 -
Automatic Accuracy Assessment via Hashing in Multiple-Source Environment
Accuracy is a most important data quality dimension and its assessment is a key issue in data management. Most of current studies focus on how to qualitatively analyze accuracy dimension and the a原创 2009-09-09 07:39:00 · 667 阅读 · 0 评论 -
Simulated Annealing
The book is published by IT-TECH, whichis on the theory and application of simulated annealing. ISBN: 978-953-7619-07-7. (David Jiang, Jingyu Han finishes the chapter of"Real time multiagent decision原创 2008-11-21 08:47:00 · 857 阅读 · 0 评论 -
Thought of Participation in DASFAA 2008
Share the ideas, get a full understanding; Follow the master, see through the truth!原创 2008-03-27 20:03:00 · 1318 阅读 · 0 评论 -
基于网络受限移动对象数据库的交通流统计分析模型
原创 2008-01-01 12:54:00 · 1625 阅读 · 1 评论 -
数据质量研究综述
http://www.paper.edu.cn/paper.php?serial_number=200701-103原创 2007-01-20 22:09:00 · 1650 阅读 · 0 评论 -
to follow
to follow the path:(沿着这样一条道路:)look to the master,(寻找大师,) follow the master,(跟随大师,) walk with the master,(与大师通行,) see through the master,(洞察大师,) become the master.(成为大师。)原创 2006-04-21 22:05:00 · 815 阅读 · 0 评论 -
A Multiple-Depth Structural Index for Branching Query (http://dx.doi.org/10.1016/j.infsof.2005.12.003)
information & software technology , Elsevier Science,Volume 48,Issue 9,september 2006原创 2005-12-29 11:26:00 · 742 阅读 · 0 评论 -
一种大数据量的相似记录检测算法
to process the duplicate records !原创 2005-12-15 18:20:00 · 735 阅读 · 0 评论