实现:
import string
def str_count(str):
'''找出字符串中的中英文、空格、数字、标点符号个数'''
count_en = 0
count_dg = 0
count_sp = 0
count_zh = 0
count_pu = 0
for s in str:
# 英文
if s in string.ascii_letters:
count_en += 1
# 数字
elif s.isdigit():
count_dg += 1
# 空格
elif s.isspace():
count_sp += 1
# 中文
elif s.isalpha():
count_zh += 1
# 特殊字符
else:
count_pu += 1
return count_zh
def str_count2(str):
for s in str:
# 中文字符范围
if '\u4e00' <= s <= '\u9fff':
print(s, end="\t")
示例:去掉文件中单个汉字词
with open('chs.txt', 'r') as open_file:
for line in open_file:
if str_count(line) > 1:
print(line)
参见:
https://blog.csdn.net/xiamoyanyulrq/article/details/81504114