Inverted Index in practice

 I always think that inverted-index is applied in search-engine.So fool......

Today a problem is that if there are millions of words,give you several  words that get most similar one .For examle,to a 3-len words.The request is that hits the target 2/3 in words at least.

First ,I thought of top(K) and then I learned a new way to sovle it "Inverted-Index".

In the first loop,make a new Inverted-index table that includes all the single character which appeared in the words.

To the new words if it is 3 char length,the request is as above.Divide it 2 char for a group and make intersecion.

At last the most similar words is got.It is a good way.

 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值