Storm
ylzhjlinux
这个作者很懒,什么都没留下…
展开
-
Strom: mongdb spout /bolt trending topics
Referenceshttps://github.com/nathanmarz/storm-contribhttp://eugenedvorkin.com/implementing-top-10-most-popular-articles-in-real-time-with-storm-and-mongodb/https://github.com...原创 2015-01-04 18:57:11 · 77 阅读 · 0 评论 -
Storm: DRPC
https://storm.apache.org/documentation/Distributed-RPC.htmlhttps://github.com/mithunsatheesh/php-drpchttp://php.sabscape.com/blog/?p=482原创 2015-03-17 17:35:16 · 145 阅读 · 0 评论 -
Strom: Trident Fields and tuples
https://storm.apache.org/documentation/Trident-tutorial.html The Trident data model is the TridentTuple which is a named list of values. During a topology, tuples are incrementally built up throu...原创 2015-04-28 10:14:54 · 83 阅读 · 0 评论 -
Strom: Trident-ML realtime ML
https://github.com/pmerienne/trident-ml原创 2015-05-26 14:25:49 · 116 阅读 · 0 评论 -
Storm: Introduction 1
https://storm.apache.org/documentation/Tutorial.htmlComponents of a Storm cluster A Storm cluster is superficially similar to a Hadoop cluster. Whereas on Hadoop you run "MapReduce jobs", on S...原创 2014-12-01 11:46:28 · 90 阅读 · 0 评论 -
Storm: compile storm source code and run storm-starter
https://github.com/apache/storm/tree/master/examples/storm-starter 1 down load the source codes# git clone git://github.com/apache/storm.git2 Build and install Storm jars locally# mvn cle...原创 2014-12-01 15:49:07 · 135 阅读 · 0 评论 -
What makes a running topology: worker processes, executors and tasks
https://storm.apache.org/documentation/Understanding-the-parallelism-of-a-Storm-topology.html Storm distinguishes between the following three main entities that are used to actually run a topolog...原创 2014-12-01 17:10:45 · 66 阅读 · 0 评论 -
concepts of Storm
https://storm.apache.org/documentation/Concepts.html This page lists the main concepts of Storm and links to resources where you can find more information. The concepts discussed are:Topologie...原创 2014-12-01 17:19:24 · 92 阅读 · 0 评论 -
Storm integrate with kafka and hdfs
https://storm.apache.org/2014/11/25/storm093-released.html https://github.com/apache/storm/blob/v0.9.3/external/storm-kafka/README.mdhttps://github.com/apache/storm/tree/v0.9.3/external/storm-h...原创 2014-12-02 09:56:01 · 78 阅读 · 0 评论 -
storm integrate with mysql
https://github.com/wilbinsc/storm-mysql原创 2014-12-02 13:17:21 · 73 阅读 · 0 评论 -
Big Data Counting: How to count a billion distinct objects using only 1.5KB of
http://highscalability.com/blog/2012/4/5/big-data-counting-how-to-count-a-billion-distinct-objects-us.html This is a guest post by Matt Abrams (@abramsm), from Clearspring, discussing how they a...原创 2015-01-27 10:13:49 · 141 阅读 · 0 评论 -
Streaming algorithm
http://en.wikipedia.org/wiki/Streaming_algorithm原创 2015-01-26 16:26:02 · 502 阅读 · 0 评论 -
Count-Min sketch
“Sketching” data structures store a summary of a data set in situations where the whole data would be prohibitively costly to store (at least in a fast-access place like the memory as opposed to ...原创 2015-01-26 14:30:20 · 990 阅读 · 0 评论 -
Implementing Real-Time Trending Topics With a Distributed Rolling Count Algorith
http://www.michael-noll.com/blog/2013/01/18/implementing-real-time-trending-topics-in-storm/ A common pattern in real-time data workflows is performing rolling counts of incoming data points, al...原创 2015-01-04 20:09:07 · 492 阅读 · 0 评论 -
Storm: related high qulity posts
http://www.drdobbs.com/go-parallel/article/print?articleId=240143874&siteSectionName=原创 2015-01-04 20:34:37 · 194 阅读 · 0 评论 -
storm: storm-kafka spout
package inok.storm.kafka.sample;import java.io.FileInputStream;import java.io.IOException;import java.util.Arrays;import java.util.HashMap;import java.util.Iterator;import jav...原创 2015-01-09 20:49:00 · 131 阅读 · 0 评论 -
Storm based realtime recommendation algorithm
Referenceshttp://www.wentrue.net/blog/?p=1181http://blog.csdn.net/huilixiang/article/details/38441203 [1] Davidson, J. and Liebald, B. and Liu, J. The YouTu...原创 2015-01-11 13:34:19 · 91 阅读 · 0 评论 -
Transactional Topologies
https://storm.apache.org/documentation/Transactional-topologies.html NOTE: Transactional topologies have been deprecated -- use the Trident framework instead. Storm guarantees data processing...原创 2015-01-12 16:28:30 · 88 阅读 · 0 评论 -
Trident Tutorial
Trident is a high-level abstraction for doing realtime computing on top of Storm. It allows you to seamlessly intermix high throughput (millions of messages per second), stateful stream processing wi...原创 2015-01-12 16:30:54 · 123 阅读 · 0 评论 -
Guaranteeing Message Processing
Storm guarantees that each message coming off a spout will be fully processed. This page describes how Storm accomplishes this guarantee and what you have to do as a user to benefit from Storm's reli...原创 2015-01-16 15:22:08 · 118 阅读 · 0 评论 -
Content based and collaborative filtering based recommendation and personalizati
Referenceshttps://github.com/pranab/sifarish原创 2015-01-21 15:53:59 · 156 阅读 · 0 评论 -
Setting up a Storm Cluster
https://storm.apache.org/documentation/Setting-up-a-Storm-cluster.htmlThis page outlines the steps for getting a Storm cluster up and running. If you're on AWS, you should check out the storm-deplo...原创 2015-01-22 14:44:12 · 94 阅读 · 0 评论 -
Storm: books
Big Data Analytics Beyond Hadoop: Real-Time Applications with Storm, Spark, and More Hadoop Alternatives Storm Applied: Strategies for Real-time Event Processing Storm源码分析 Storm Real-T...原创 2014-12-09 16:09:28 · 96 阅读 · 0 评论