jackrabbit1.3.1 英文全文检索可以 中文无效请帮助 谢谢
下边是 jackrabbit的配置 workspace.xml和这个类似
[code]
<SearchIndex
class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
<param name="path" value="${wsp.home}/index" />
<param name="textFilterClasses"
value="org.apache.jackrabbit.extractor.MsExcelTextExtractor,
org.apache.jackrabbit.extractor.MsPowerPointTextExtractor,
org.apache.jackrabbit.extractor.MsWordTextExtractor,
org.apache.jackrabbit.extractor.PdfTextExtractor,
org.apache.jackrabbit.extractor.HTMLTextExtractor,
org.apache.jackrabbit.extractor.XMLTextExtractor,
org.apache.jackrabbit.extractor.RTFTextExtractor,
org.apache.jackrabbit.extractor.OpenOfficeTextExtractor" />
<!-- These are all default values. You can change them if you want -->
<param name="useCompoundFile" value="true" />
<param name="minMergeDocs" value="100" />
<param name="volatileIdleTime" value="3" />
<param name="maxMergeDocs" value="100000" />
<param name="mergeFactor" value="10" />
<param name="bufferSize" value="10" />
<param name="cacheSize" value="1000" />
<param name="forceConsistencyCheck" value="false" />
<param name="autoRepair" value="true" />
<!-- <param name="analyzer"
value="org.apache.lucene.analysis.standard.StandardAnalyzer" />-->
<!-- <param name="analyzer"
value="org.apache.lucene.analysis.cjk.CJKAnalyzer" />-->
<param name="analyzer"
value="org.mira.lucene.analysis.IK_CAnalyzer" />
<param name="queryClass"
value="org.apache.jackrabbit.core.query.QueryImpl" />
<param name="maxIdleTime" value="-1" />
<!-- end of default values -->
<param name="respectDocumentOrder" value="false" />
</SearchIndex>
[/code]
Query query = qm.createQuery("select * FROM nt:resource where CONTAINS( . , sb.toString()+"')",
当参数是英文的时候可以检索到 ,中文失效了.
我用的jackrabbit版本是1.3.1
下边是 jackrabbit的配置 workspace.xml和这个类似
[code]
<SearchIndex
class="org.apache.jackrabbit.core.query.lucene.SearchIndex">
<param name="path" value="${wsp.home}/index" />
<param name="textFilterClasses"
value="org.apache.jackrabbit.extractor.MsExcelTextExtractor,
org.apache.jackrabbit.extractor.MsPowerPointTextExtractor,
org.apache.jackrabbit.extractor.MsWordTextExtractor,
org.apache.jackrabbit.extractor.PdfTextExtractor,
org.apache.jackrabbit.extractor.HTMLTextExtractor,
org.apache.jackrabbit.extractor.XMLTextExtractor,
org.apache.jackrabbit.extractor.RTFTextExtractor,
org.apache.jackrabbit.extractor.OpenOfficeTextExtractor" />
<!-- These are all default values. You can change them if you want -->
<param name="useCompoundFile" value="true" />
<param name="minMergeDocs" value="100" />
<param name="volatileIdleTime" value="3" />
<param name="maxMergeDocs" value="100000" />
<param name="mergeFactor" value="10" />
<param name="bufferSize" value="10" />
<param name="cacheSize" value="1000" />
<param name="forceConsistencyCheck" value="false" />
<param name="autoRepair" value="true" />
<!-- <param name="analyzer"
value="org.apache.lucene.analysis.standard.StandardAnalyzer" />-->
<!-- <param name="analyzer"
value="org.apache.lucene.analysis.cjk.CJKAnalyzer" />-->
<param name="analyzer"
value="org.mira.lucene.analysis.IK_CAnalyzer" />
<param name="queryClass"
value="org.apache.jackrabbit.core.query.QueryImpl" />
<param name="maxIdleTime" value="-1" />
<!-- end of default values -->
<param name="respectDocumentOrder" value="false" />
</SearchIndex>
[/code]
Query query = qm.createQuery("select * FROM nt:resource where CONTAINS( . , sb.toString()+"')",
当参数是英文的时候可以检索到 ,中文失效了.
我用的jackrabbit版本是1.3.1