Endeca Segmenter.jar Study

 First I give the architecture of the Segmentation and  Translit aftering analyzing the classes in the Segmenter.jar written by  endeca.com group.

 

 

Segmentation process:


 

Translit process:


 

We can see the two process are just use the same design pattern .Use adapter to modify the interface that can be suitable to be wrapped into the Runnable thread, and in the inner implementation ,it delegate the AdapterHandler to handle it .

 

Segmenter process  is used for segmentation ABCDE to A B C D E.

Translit process is used for read the feed file and make dictionary.

AdapterConfig ,( I cannot see the source code or decompiled code ,because ENDECA is not open sourced),but just guess from it ,I think it is a configuration wrapper class which can be mapped to some adapter configuration in the form of xml.

AdapterHandler, (still I cannot see the source code) .I think it can wrap some callback method and be used in the Segmenter  & Translit  process.

 

Some functions I cannot totally understand ,which prevent me from understanding the whole process .

1.        The first() method in the AdapterConfig class , I know it is used to retrieve maybe the first element which map the element tagged in the parameter ,but I do not know which file it parses.

2.       The emit() method in the AdapterHandler class





本文转自 charles_wang888 51CTO博客,原文链接:http://blog.51cto.com/supercharles888/834188,如需转载请自行联系原作者

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值