solr 去除html,solr exclude html class from indexing

Im indexing a knowledgebase with solr. The problem is, that the menu is indexed as well, so searching for a term used in the menu returns all pages.

Can I somehow tell solr to exclude a special html class from indexing?

HTML-Tags are removed, so I cant find the specified element later.

EDIT:

I added a short sample for what I want to achieve.

That is, to exclude certain html nodes (like my navigation) from beeing indexed.

Sample html:

  • topic-1
  • topic-2
  • topic-3

Topic-1

Lorem ipsum dolor sit ament...

What I currently get in my index from that:

topic-1

topic-2

topic-3

Topic-1

lorem ipsum dolor sit ament...

What I want to get in my index fom that:

Topic-1

lorem ipsum dolor sit ament...

  • 0
    点赞
  • 0
    收藏
    觉得还不错? 一键收藏
  • 0
    评论
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值