第1关：函数、过滤器、地图和平面地图、窥视操作-CSDN博客

本文链接：https://blog.csdn.net/m0_62103032/article/details/134573065

任务描述

本关任务：使用 Storm Trident 完成一个过滤器的实现。

编程要求

根据提示，在右侧编辑器补充代码，编写一个过滤器的操作和实现，要求过滤出 tuple 中值为 the 的 tuple 。

测试说明

平台会对你编写的代码进行测试：

预计输入：

the cow jumped over the moon
the man went to the store and bought some candy
four score and seven years ago
how many apples can you eat

预计输出：

emit word is :the
Filter word is:the and return type is:true
emit word is :cow
emit word is :jumped
emit word is :over
emit word is :the
Filter word is:the and return type is:true
emit word is :moon
emit word is :the
Filter word is:the and return type is:true
emit word is :man
emit word is :went
emit word is :to
emit word is :the
Filter word is:the and return type is:true
emit word is :store
emit word is :and
emit word is :bought
emit word is :some
emit word is :candy
emit word is :four
emit word is :score
emit word is :and
emit word is :seven
emit word is :years
emit word is :ago
emit word is :how
emit word is :many
emit word is :apples
emit word is :can
emit word is :you
emit word is :eat

代码如下：

import org.apache.storm.Config;

import org.apache.storm.LocalCluster;

import org.apache.storm.generated.StormTopology;

import org.apache.storm.trident.TridentTopology;

import org.apache.storm.trident.operation.BaseFilter;

import org.apache.storm.trident.operation.BaseFunction;

import org.apache.storm.trident.operation.TridentCollector;

import org.apache.storm.trident.testing.FixedBatchSpout;

import org.apache.storm.trident.tuple.TridentTuple;

import org.apache.storm.tuple.Fields;

import org.apache.storm.tuple.Values;

/*  部分输出

emit word is :the

Filter word is:the  and return type is:true

emit word is :cow

emit word is :jumped

emit word is :over

emit word is :the

Filter word is:the  and return type is:true

emit word is :moon

* */

public class FunctionFilter {

    public static void main(String[] args){

        TridentTopology topology = new TridentTopology();

        FixedBatchSpout spout = new FixedBatchSpout(new Fields("sentence"), 1,

                new Values("the cow jumped over the moon"),

                new Values("the man went to the store and bought some candy"),

                new Values("four score and seven years ago"),

                new Values("how many apples can you eat"));

        //不循环发送数据

        spout.setCycle(false);



        //****请根据提示补全Topology程序****//

        /*********begin*********/



        topology.newStream("spout1", spout)

        //topology的newStream 方法从输入源中读取数据, 并在 topology 中创建一个新的数据流 batch-spout

        .each(new Fields("sentence"), new Split(), new Fields("word"))

        .groupBy(new Fields("word"));

         //使用.each()方法，sentence tuple经过split()方法后输出word tuple  

         //使用.each()方法，new Fields()保留setence tuple和word tuple ,经过WordFilter() 过滤 单词 the

                       

        /*********end*********/





        StormTopology stormTopology = topology.build();

        LocalCluster cluster = new LocalCluster();

        Config conf = new Config();

        cluster.submitTopology("soc", conf,stormTopology);

    }

    //Filter过滤器

    public static class WordFilter extends BaseFilter {

        String actor;

        public WordFilter(String actor) {

            this.actor = actor;

        }

        @Override

        public boolean isKeep(TridentTuple tuple) {

            //如果元组的值和 actor 相等（这里的actor是“the”）

            if(tuple.getString(1).equals(actor)){

                //输出 Filter word is:the  and return type is:true

                System.out.println("Filter word is:"+tuple.getString(1) + "  and return type is:"+tuple.getString(1).equals(actor));

            }

            return tuple.getString(1).equals(actor);

        }

    }

    // Function函数

    public static class Split extends BaseFunction {

        public void execute(TridentTuple tuple, TridentCollector collector) {

            String sentence = tuple.getString(0);

            //把句子以空格切分为单词

            //每一个 sentence tuple 可能会被转换成多个 word tuple,

            //比如说 "the cow jumped over the moon" 这个句子会被转换成 6 个 "word" tuples

            for(String word: sentence.split(" ")) {

                System.out.println("emit word is :"+word);

                collector.emit(new Values(word));

            }

        }

   

}

}