NLP之路-warm up

最新推荐文章于 2022-10-27 22:53:31 发布

j-o-l-i-n

最新推荐文章于 2022-10-27 22:53:31 发布

阅读量1.1k

点赞数

分类专栏： NLP Python 原创

本文链接：https://blog.csdn.net/jolinxia/article/details/39644081

版权

原创同时被 3 个专栏收录

65 篇文章 0 订阅

订阅专栏

Python

28 篇文章 0 订阅

订阅专栏

NLP

24 篇文章 0 订阅

订阅专栏

今天继续做了一些小的尝试，算作技术铺垫。

from nltk.book import *
print("*****import nltk.book OK")

print(sorted([w for w in set(text7) if '-'in w and 'index' in w]))
print('\n')
print(sorted([wd for wd in set(text3) if wd.istitle() and len(wd)> 10]))
print('\n')
print(sorted([w for w in set(sent7) if not w.islower()]))
print('\n')
print(sorted([t for t in set(text2) if 'cie' in t or 'cei'in t]))
print('\n')

for xyzzy in sent1:
     if xyzzy.endswith('l'):
         print xyzzy 

for token in sent1:
    if token.islower():
        print token, 'is a lowercase word'
    elif token.istitle():
        print token, 'is a titlecase word'
    else:
        print token, 'is punctuation' 

#请注意 print 语句结尾处的逗号，它告诉 Python在同一行输出。
tricky = sorted([w for w in set(text2) if 'cie' in w or 'cei' in w])
for word in tricky:
     print word,

和机器人对话