2、构建索引

最新推荐文章于 2021-09-20 12:58:14 发布

王小工

最新推荐文章于 2021-09-20 12:58:14 发布

阅读量495

点赞数

分类专栏：搜索引擎文章标签： exception 文档 class

本文链接：https://blog.csdn.net/mqiqe/article/details/7404899

版权

搜索引擎专栏收录该内容

2 篇文章 0 订阅

订阅专栏

本文介绍了索引过程的主要操作步骤，并通过一个基本的Lucene索引demo展示了如何建立索引，包括创建RAMDirectory对象、使用IndexWriter对象进行索引操作以及使用IndexReader进行读取。

摘要由CSDN通过智能技术生成

索引过程

主要操作步骤：

1、将原始文档转换成文本

2、分析文本

3、将分析好的文本保存至索引中

基本索引demo

 package com.lucene;

import java.io.IOException;

import org.apache.lucene.analysis.WhitespaceAnalyzer;
import org.apache.lucene.document.Document;
import org.apache.lucene.document.Field;
import org.apache.lucene.index.CorruptIndexException;
import org.apache.lucene.index.IndexReader;
import org.apache.lucene.index.IndexWriter;
import org.apache.lucene.store.Directory;
import org.apache.lucene.store.RAMDirectory;
import org.junit.Test;

public class IndexingTest {
	private Directory directory;
	private IndexWriter writer;

	@Test
	public void creasteIndex() throws  Exception {
		directory = new RAMDirectory();
		try {
			writer = getWriter();
			Document doc = new Document();
			doc.add(new Field("str", "hello,Wrold!", Field.Store.YES,
					Field.Index.ANALYZED));
			writer.addDocument(doc);
		} catch (IOException e) {
			e.printStackTrace();
		}finally{
			writer.close();
		}
	}
	@Test
	public  void IndexReader() throws Exception{
		creasteIndex();
		IndexReader reader=IndexReader.open(directory);
		System.out.println("文档数："+reader.maxDoc());
		reader.clone();
	}
	@SuppressWarnings("deprecation")
	private IndexWriter getWriter() throws IOException {
		return new IndexWriter(directory, new WhitespaceAnalyzer(),
				IndexWriter.MaxFieldLength.UNLIMITED);
	}
	
}