Java中的搜索技术Lucene--Field域和索引维护技术详细解析！(1)-CSDN博客

本文链接：https://blog.csdn.net/2401_86984420/article/details/141980223

如果需要商品描述，则根据搜索出的商品ID去数据库中查询，然后显示出商品描述信息即可。

2.Field常用类型

开发中常用的Filed类型，注意Field的属性，根据需求选择：

Field常用类型.PNG

3.Field改进代码

图书id：

是否分词：不用分词，因为不会根据商品id来搜索商品
是否索引：不索引，因为不需要根据图书ID进行搜索
是否存储：要存储，因为查询结果页面需要使用id这个值。

图书名称：

是否分词：要分词，因为要将图书的名称内容分词索引，根据关键搜索图书名称抽取的词。
是否索引：要索引。
是否存储：要存储。

图书价格：

是否分词：要分词，lucene对数字型的值只要有搜索需求的都要分词和索引，
因为lucene对数字型的内容要特殊分词处理，本例子可能要根据价格范围搜索，需要分词和索引。
是否索引：要索引
是否存储：要存储

图书图片地址：

是否分词：不分词
是否索引：不索引
是否存储：要存储

图书描述：

是否分词：要分词
是否索引：要索引
是否存储：因为图书描述内容量大，不在查询结果页面直接显示，不存储。
不存储是来不在lucene的索引文件中记录，节省lucene的索引文件空间，如果要在详情页面显示描述，
思路：
从lucene中取出图书的id，根据图书的id查询关系数据库中book表得到描述信息。

代码：

@Test
public void createIndex() throws Exception {
// 采集数据
BookDao dao = new BookDaoImpl();
List list = dao.queryBooks();

// 将采集到的数据封装到Document对象中
List docList = new ArrayList<>();
Document document;
for (Book book : list) {
document = new Document();
// store:如果是yes，则说明存储到文档域中
// 图书ID
// 不分词、索引、存储 StringField
Field id = new StringField(“id”, book.getId().toString(), Store.YES);
// 图书名称
// 分词、索引、存储 TextField
Field name = new TextField(“name”, book.getName(), Store.YES);
// 图书价格
// 分词、索引、存储但是是数字类型，所以使用FloatField
Field price = new FloatField(“price”, book.getPrice(), Store.YES);
// 图书图片地址
// 不分词、不索引、存储 StoredField
Field pic = new StoredField(“pic”, book.getPic());
// 图书描述
// 分词、索引、不存储 TextField
Field description = new TextField(“description”,
book.getDescription(), Store.NO);

// 设置boost值
if (book.getId() == 4)
description.setBoost(100f);

// 将field域设置到Document对象中
document.add(id);
document.add(name);
document.add(price);
document.add(pic);
document.add(description);

docList.add(document);
}

二、索引维护

需求：

管理人员通过电商系统更改图书信息，这时更新的是数据库，如果使用lucene搜索图书信息需要在数据库表book信息变化时及时更新lucene索引库。

1.添加索引

调用 indexWriter.addDocument（doc）添加索引。

@Test
public void createIndex() throws Exception {
// 采集数据
BookDao dao = new BookDaoImpl();
List list = dao.queryBooks();

// 将采集到的数据封装到Document对象中
List docList = new ArrayList<>();
Document document;
for (Book book : list) {
document = new Document();
// store:如果是yes，则说明存储到文档域中
// 图书ID
Field id = new TextField(“id”, book.getId().toString(), Store.YES);
// 图书名称
Field name = new TextField(“name”, book.getName(), Store.YES);
// 图书价格
Field price = new TextField(“price”, book.getPrice().toString(),
Store.YES);
// 图书图片地址
Field pic = new TextField(“pic”, book.getPic(), Store.YES);
// 图书描述
Field description = new TextField(“description”,
book.getDescription(), Store.YES);

// 将field域设置到Document对象中
document.add(id);
document.add(name);
document.add(price);
document.add(pic);
document.add(description);

docList.add(document);
}

// 创建分词器，标准分词器
Analyzer analyzer = new StandardAnalyzer();

// 创建IndexWriter
IndexWriterConfig cfg = new IndexWriterConfig(Version.LUCENE_4_10_3,
analyzer);
// 指定索引库的地址
File indexFile = new File(“E:\11-index\hm19\”);
Directory directory = FSDirectory.open(indexFile);
IndexWriter writer = new IndexWriter(directory, cfg);

// 通过IndexWriter对象将Document写入到索引库中
for (Document doc : docList) {
writer.addDocument(doc);
}

// 关闭writer
writer.close();
}

2.删除索引

1)删除指定索引

根据Term项删除索引，满足条件的将全部删除。

Term是索引域中最小的单位。根据条件删除时，建议根据唯一键来进行删除。在solr中就是根据ID来进行删除和修改操作的。

@Test
public void deleteIndex() throws Exception {
// 创建分词器，标准分词器
Analyzer analyzer = new StandardAnalyzer();

// 创建IndexWriter
IndexWriterConfig cfg = new IndexWriterConfig(Version.LUCENE_4_10_3,
analyzer);
Directory directory = FSDirectory
.open(new File(“E:\11-index\hcx\”));
// 创建IndexWriter
IndexWriter writer = new IndexWriter(directory, cfg);

// Terms
writer.deleteDocuments(new Term(“id”, “1”));

writer.close();
}

2)删除全部索引（慎用）

将索引目录的索引信息全部删除，直接彻底删除，无法恢复。慎用！

// 删除索引
@Test
public void deleteIndex() throws Exception {
// 1、指定索引库目录
Directory directory = FSDirectory.open(new File(“E:\11-index\0720”));
// 2、创建IndexWriterConfig
IndexWriterConfig cfg = new IndexWriterConfig(Version.LATEST,
new StandardAnalyzer());
// 3、创建IndexWriter
IndexWriter writer = new IndexWriter(directory, cfg);
// 4、通过IndexWriter来删除索引
// a)、删除全部索引