download and use the IKAnalyzer:
add library jar to the project as usual
in MyEclispse:
choose project->build path->configure build path->add ext jars
using stop words:
the configuration file<IKAnalyzer.cfg.xml>:
<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE properties SYSTEM "http://java.sun.com/dtd/properties.dtd">
<properties>
<comment>IK Analyzer 扩展配置</comment>
<!-- 用户可以在这里配置自己的扩展字典 -->
<entry key="ext_dict">/dicdata/use.dic.dic;/dicdata/googlepy.dic</entry>
<!-- 用户可以在这里配置自己的扩展停止词字典 -->
<entry key="ext_stopwords">/dicdata/ext_stopword.dic</entry>
</properties>
add
IKAnalyzer.cfg.xml to the root path of src, that's to say: src/
stopword.dic is the stop words you defined, and shall be in the certain path ruled in IKAnalyzer.cfg.xml
use NotePad++ as file editor for you can save the stopwords.dic as demanded utf8 without BOM.
At last, save all the file and refresh the whole project, you can see the IKAnalyzer.cfg.xml and stopword.dic in your project then.