书生浦语大模型实战营第三期—python基础入门岛打卡
任务要求:
任务一:
请实现一个wordcount函数,统计英文字符串中每个单词出现的次数。返回一个字典,key为单词,value为对应单词出现的次数。
TIPS:记得先去掉标点符号,然后把每个单词转换成小写。不需要考虑特别多的标点符号,只需要考虑实例输入中存在的就可以。
Eg:
Input:
"""Hello world!
This is an example.
Word count is fun.
Is it fun to count words?
Yes, it is fun!"""
Output:
{'hello': 1, 'world': 1, 'this': 1, 'is': 4, 'an': 1, 'example': 1, 'word': 1, 'count': 2,
'fun': 3, 'it': 2, 'to': 1, 'words': 1, 'yes': 1}
text = """
Got this panda plush toy for my daughter's birthday,
who loves it and takes it everywhere. It's soft and
super cute, and its face has a friendly look. It's
a bit small for what I paid though. I think there
might be other options that are bigger for the
same price. It arrived a day earlier than expected,
so I got to play with it myself before I gave it
to her.
"""
def wordcount(text):
pass
函数定义:
def wordcount(text):
"""
Counts the occurrences of each word in the given text.
Args:
- text (str): The input text string to analyze.
Returns:
- dict: A dictionary where keys are words and values are the counts of those words.
"""
# 将文本转换为小写
text = text.lower()
# 使用正则表达式来分割文本中的单词
# \w 匹配字母数字字符
words = re.findall(r'\w+', text)
# 使用 Counter 来统计每个单词的出现次数
word_counts = Counter(words)
return dict(word_counts)
运行结果:
任务2:请使用本地vscode连接远程开发机,将上面你写的wordcount函数在开发机上进行debug,体验debug的全流程,并完成一份debug笔记(需要截图)。
debug笔记:
断点设置:
运行到第一个断点,查看侧边栏,仅有text变量:
运行到第二个断点,将text文本转为小写,显示正确。
运行到第三个断点,word_counts为Counter的对象,Counter本质为字典的一个子类,完全可以当作普通字典使用,题目要求返回字典对象,因此在下一个断点将word_counts转换为普通字典类型输出。
运行到第四个断点,可以看到最后返回的word_counts字典。