BigData
文章平均质量分 76
Jack_F
唉生活唉社交
展开
-
分布式系统领域经典论文翻译集
http://duanple.blog.163.com/blog/static/709717672011330101333271/转载 2013-07-10 09:32:02 · 997 阅读 · 0 评论 -
data-intensive text processing with mapreduce-Inverted Indexing for Text Retrieval
Inverted Indexing for Text Retrieval原创 2013-11-16 20:19:33 · 1174 阅读 · 0 评论 -
data-intensive text processing with mapreduce-Graph Algorithms
Graph Algorithms原创 2013-11-16 20:20:31 · 3098 阅读 · 0 评论 -
data-intensive text processing with mapreduce-EM Algorithms for Text Processing
EM Algorithms for Text Processing原创 2013-11-16 20:21:10 · 1010 阅读 · 0 评论 -
Hadoop 二次排序 Secondary Sort
转自:http://blog.csdn.net/heyutao007/article/details/5890103mr自带的例子中的源码SecondarySort,我重新写了一下,基本没变。这个例子中定义的map和reduce如下,关键是它对输入输出类型的定义:(java泛型编程) public static class Map extends Mapper publ转载 2013-10-10 00:04:47 · 5560 阅读 · 2 评论 -
MapReduce Design Patterns-chapter 7
CHAPTER 7:Input and Output PatternsCustomizing Input and Output in HadoopHadoop allows you to modify the way data is loaded on disk in two major ways: configuring how contiguous chunks of input ar原创 2013-09-25 23:30:42 · 1090 阅读 · 0 评论 -
MapReduce Design Patterns-chapter 6
CHAPTER 6:Metapatterns**Oozie**# Job Chaining #CombineFileInputFormat takessmaller blocks and lumps them together to make a larger input splitbefore being processed by the mapper.You原创 2013-09-25 09:17:24 · 1491 阅读 · 0 评论 -
MapReduce Design Patterns-chapter 5
CHAPTER 5:Join PatternsA Refresher on JoinsINNER JOINWith this type of join, records from both A and B that contain identical values for a given foreign key f are brought together, such that all原创 2013-09-23 18:38:44 · 963 阅读 · 0 评论 -
MapReduce Design Patterns-chapter 4
CHAPTER 4:Data Organization PatternsStructured to HierarchicalProblem: Given a list of posts and comments, create a structured XML hierarchy to nest comments with their related post.public原创 2013-09-22 17:08:42 · 1697 阅读 · 0 评论 -
MapReduce Design Patterns-chapter 3
CHAPTER 3:Filtering PatternsThere are a couple of reasons why map-only jobs are efficient.• Since no reducers are needed, data never has to be transmitted between the mapand reduce phase. Most o原创 2013-09-22 10:02:00 · 1385 阅读 · 0 评论 -
[Hadoop源码解读](一)MapReduce篇之InputFormat
http://blog.csdn.net/posa88/article/details/7897963目录(?)[-]InputSplitInputFormatFileInputFormatTextInputFormatNLineInputFormat 平时我们写MapReduce程序的时候,在设置输入格式的时候转载 2013-09-21 10:08:21 · 1067 阅读 · 0 评论 -
MapReduce Design Patterns-chapter 2
啊原创 2013-09-21 09:56:28 · 1350 阅读 · 1 评论 -
找工作面试备忘录
Data StructureJava1.Java HashMap的工作原理2.Java应用程序中的内存泄漏及内存管理3.Java垃圾回收精粹Hadoop原创 2014-04-08 16:54:33 · 1511 阅读 · 0 评论