python最长匹配_最大匹配算法进行分词前向后向 python实现

最新推荐文章于 2021-11-10 16:40:33 发布

weixin_39900736

最新推荐文章于 2021-11-10 16:40:33 发布

阅读量183

点赞数

文章标签： python最长匹配

# 先定义个词典

word_dict = ['我们', '经常', '有','有意见','意见','分歧']

# 滑动窗口的大小

max_len = 5

# 用户的输入

user_input = '我们经常有意见分歧'

len(user_input)

结果：

前向最大匹配算法的实现

# 前向最大匹配算法

result = []

i = 0

while i < len(user_input):

matched = False

pos = i + max_len if i + max_len < len(user_input) else len(user_input)

while user_input[i:pos] not in word_dict and i < pos:

print(user_input[i:pos])

pos -= 1

if i < pos:

matched = True

result.append(user_input[i:pos])

i = pos if matched == True else i + max_len

print(result)

输出结果：

我们经常有

我们经常

我们经

经常有意见

经常有意

经常有

有意见分歧

有意见分

['我们', '经常', '有意见', '分歧']

后向最大匹配算法的实现

# 后向最大匹配算法

result = []

i = len(user_input)

while i > 0:

matched = False

pos = i - max_len if i - max_len > 0 else 0

while user_input[pos:i] not in word_dict and i > pos:

print(user_input[pos:i])

pos += 1

if i > pos:

matched = True

result.insert(0, user_input[pos:i])

i = pos if matched == True else i - max_len

print(result)

输出结果：

有意见分歧

意见分歧

见分歧

经常有意见

常有意见

我们经常

们经常

['我们', '经常', '有意见', '分歧']

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39900736

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
python最长匹配_最大匹配算法进行分词前向后向 python实现

# 先定义个词典word_dict = ['我们', '经常', '有','有意见','意见','分歧']# 滑动窗口的大小max_len = 5# 用户的输入user_input = '我们经常有意见分歧'len(user_input)结果：9前向最大匹配算法的实现# 前向最大匹配算法result = []i = 0while i < len(user_input):matched = F...
复制链接

扫一扫