java IKAnalyzer配置

 

download and use the IKAnalyzer:

add library jar to the project as usual

in MyEclispse:

choose project->build path->configure build path->add ext jars

using stop words:

the configuration file<IKAnalyzer.cfg.xml>:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE properties SYSTEM "http://java.sun.com/dtd/properties.dtd">  
<properties>  

    <comment>IK Analyzer 扩展配置</comment>
    <!-- 用户可以在这里配置自己的扩展字典 -->
     <entry key="ext_dict">/dicdata/use.dic.dic;/dicdata/googlepy.dic</entry> 
     <!-- 用户可以在这里配置自己的扩展停止词字典    -->
    <entry key="ext_stopwords">/dicdata/ext_stopword.dic</entry> 

</properties>
add  IKAnalyzer.cfg.xml to the root path of src, that's to say: src/

stopword.dic is the stop words you defined, and shall be in the certain path ruled in IKAnalyzer.cfg.xml

use NotePad++ as file editor for you can save the stopwords.dic as demanded utf8 without BOM. 

At last, save all the file and refresh the whole project, you can see the  IKAnalyzer.cfg.xml and stopword.dic in your project then.


评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值