维特比算法实现词性标注

最新推荐文章于 2021-11-12 00:02:47 发布

水瓶座千里光

最新推荐文章于 2021-11-12 00:02:47 发布

阅读量740

点赞数 3

分类专栏： NLP 文章标签： nlp 词性标注

本文链接：https://blog.csdn.net/qq_25006671/article/details/102545022

版权

参照贪心科技的视频，按照其中的教学一步一步写出的代码，经过测试，可以运行，写出来供大家参考学习之。

import numpy as np

tag2id, id2tag = {
   }, {
   }
word2id, id2word = {
   }, {
   }
for line in open('traindata.txt'):  # 抽取单词和词性
    items = line.split('/')
    word, tag = items[0], items[1].rstrip()
    if word not in word2id:
        word2id[word] = len(word2id)
        id2word[len(id2word)] = word
    if tag not in tag2id:
        tag2id[tag] = len(tag2id)
        id2tag[len(id2tag)] = tag
M = len(word2id)  # 词典的大小
N = len(tag2id)  # 词性种类个数
# print(M, N)
# print(id2tag)

# 构建 pi,A,B
pi = np.zeros(N)  # 每个单词出现在句子第一个位置的概率
A = np.zeros((N, M))  # A[i

最低0.47元/天解锁文章

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

水瓶座千里光

关注关注

3
点赞
踩
6

收藏

觉得还不错? 一键收藏
1
评论
维特比算法实现词性标注

句子的词性标注简单实现参照贪心科技的视频，按照其中的教学一步一步写出的代码，经过测试，可以运行，写出来供大家参考学习之。import numpy as nptag2id, id2tag = {}, {}word2id, id2word = {}, {}for line in open('traindata.txt'): # 抽取单词和词性 items = line.split...
复制链接

扫一扫