这一段时间在做自然语言处理,过程中在做NER的时候需要获取偏旁部首和拼音。
尝试了cnradical库,十分好用,并且是开源的,直接在terminal进行pip即可。
from cnradical import Radical, RunOption
radical = Radical(RunOption.Radical) #获取偏旁
pinyin = Radical(RunOption.Pinyin) #获取拼音
input = '你好,今天早上吃饭了吗'
radical_out = [radical.trans_ch(ele) for ele in input]
pinyin_out = [pinyin.trans_ch(ele) for ele in input]
print(radical_out)
print(pinyin_out)
radical_out = radical.trans_str(input)
pinyin_out = pinyin.trans_str(input)
print(radical_out)
print(pinyin_out)
输出如下:
['亻', '女', None, '人', '大', '日', '一', '口', '饣', '乙', '口']
['nǐ', 'hǎo', None, 'jīn', 'tiān', 'zǎo', 'shàng', 'chī', 'fàn', 'liǎo', 'má']
亻女,人大日一口饣乙口
nǐ hǎo , jīn tiān zǎo shàng chī fàn liǎo má