pysparnn 模块使用,相似句子召回

最新推荐文章于 2024-05-16 09:43:34 发布

高颜值的杀生丸

最新推荐文章于 2024-05-16 09:43:34 发布

阅读量341

点赞数

本文链接：https://blog.csdn.net/u010970956/article/details/104586577

版权

import pysparnn.cluster_index as ci
from sklearn.feature_extraction.text import TfidfVectorizer

data = [
    "hello world",
    "oh hello there",
    "Play it",
    "Play it again Sam",
]


tv = TfidfVectorizer()
tv.fit(data)
#特征向量
features_vec = tv.transform(data)

#建立搜索索引
cp = ci.MultiClusterIndex(features_vec,data)

#搜索带有索引的
search_data = [
    "oh there",
    "Play it again Frank"
]

search_feature_vec = tv.transform(search_data)

#k是返回的个数，k_clusters代表聚类的个数
print(cp.search(search_feature_vec,k = 1,k_clusters=2,return_distance=False))

[['oh hello there'], ['Play it again Sam']]

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

高颜值的杀生丸

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
pysparnn 模块使用,相似句子召回

import pysparnn.cluster_index as cifrom sklearn.feature_extraction.text import TfidfVectorizerdata = [ "hello world", "oh hello there", "Play it", "Play it again Sam",]tv ...
复制链接

扫一扫