文件名称: DeleteStopWord下载
收藏√ [
5 4 3 2 1 ]
开发工具: Java
文件大小: 5398 KB
上传时间: 2014-09-19
下载次数: 3
提 供 者: 傅颖
详细说明:此源码组要用于中文文本预处理。源码首先进行文本分词,分词之后对文本中的停用词进行过滤。-text preprocessing
文件列表(点击判断是否您需要的文件,如果是垃圾请在下面评价投诉):
DeleteStopWord\DeleteStopWord\.classpath
..............\..............\.project
..............\..............\.settings\org.eclipse.core.resources.prefs
..............\..............\.........\org.eclipse.jdt.core.prefs
..............\..............\bin\ICTCLAS\I3S\AC\ICTCLAS50.class
..............\..............\...\similarityCompution\CSV.class
..............\..............\...\...................\CSV_handler.class
..............\..............\...\...................\FileExcludeStopWord.class
..............\..............\...\...................\SortDocsTopics.class
..............\..............\Configure.xml
..............\..............\Data\BiWord.big
..............\..............\....\character.idx
..............\..............\....\character.type
..............\..............\....\CoreDict.pdat
..............\..............\....\CoreDict.pos
..............\..............\....\CoreDict.unig
..............\..............\....\FieldDict.pdat
..............\..............\....\FieldDict.pos
..............\..............\....\GranDict.pdat
..............\..............\....\GranDict.pos
..............\..............\....\ICTCLAS30.ctx
..............\..............\....\ICTCLAS_First.map
..............\..............\....\ICTPOS.map
..............\..............\....\nr.ctx
..............\..............\....\nr.fsa
..............\..............\....\nr.role
..............\..............\....\PKU.map
..............\..............\....\PKU_First.map
..............\..............\destFile\newtrain.csv
..............\..............\........\newtrain.txt
..............\..............\ICTCLAS.log
..............\..............\ICTCLAS50.dll
..............\..............\ICTCLAS50.h
..............\..............\ICTCLAS50.lib
..............\..............\ICTCLAS_I3S_AC_ICTCLAS50.h
..............\..............\src\ICTCLAS\I3S\AC\ICTCLAS50.java
..............\..............\...\similarityCompution\CSV.java
..............\..............\...\...................\CSV_handler.java
..............\..............\...\...................\FileExcludeStopWord.java
..............\..............\...\...................\SortDocsTopics.java
..............\..............\...File\oldtrain.txt
..............\..............\.......\StopWordTable.txt
..............\..............\user.lic
..............\..............\bin\ICTCLAS\I3S\AC
..............\..............\src\ICTCLAS\I3S\AC
..............\..............\bin\ICTCLAS\I3S
..............\..............\src\ICTCLAS\I3S
..............\..............\bin\ICTCLAS
..............\..............\...\similarityCompution
..............\..............\src\ICTCLAS
..............\..............\...\similarityCompution
..............\..............\.settings
..............\..............\bin
..............\..............\Data
..............\..............\destFile
..............\..............\src
..............\..............\srcFile
..............\DeleteStopWord
DeleteStopWord
输入关键字,在本站238万海量源码库中尽情搜索:
帮助
[Camel_CH_SDK.rar] - java调用网络摄像头SDK代码示例及项目jar包
[DB_test2.zip] - java swing + MySQL,一个小型的数据库管理系统,《国家信息查询系统》,能够在操作界面中输入国家名称,然后查询到相关的基本信息,含地理地图,管理员可以进行对信息进行维护
[TextCategorizer.zip] - 自己实现的中文分词器、贝叶斯文本分类器,附分词词典、中文停用词表,用于数据挖掘学习、交流。
Visual Studio 2010 开发
[tingyongci.rar] - 是有关文本处理停用词的小程序,使用常用的停词列表,去掉停用词