Q:
给定一个段落 (paragraph) 和一个禁用单词列表 (banned)。返回出现次数最多,同时不在禁用列表中的单词。题目保证至少有一个词不在禁用列表中,而且答案唯一。
禁用列表中的单词用小写字母表示,不含标点符号。段落中的单词不区分大小写。答案都是小写字母。
示例:
输入: paragraph = "Bob hit a ball, the hit BALL flew far after it was hit." banned = ["hit"] 输出: "ball" 解释: "hit" 出现了3次,但它是一个禁用的单词。 "ball" 出现了2次 (同时没有其他单词出现2次),所以它是段落里出现次数最多的,且不在禁用列表中的单词。 注意,所有这些单词在段落里不区分大小写,标点符号需要忽略(即使是紧挨着单词也忽略, 比如 "ball,"), "hit"不是最终的答案,虽然它出现次数更多,但它在禁用单词列表中。
链接:https://leetcode-cn.com/problems/most-common-word/description/
思路:暴力求解
class Solution:
def mostCommonWord(self, paragraph, banned):
"""
:type paragraph: str
:type banned: List[str]
:rtype: str
"""
import re
dic = {}
for words in re.split('[!?\',;. ]', paragraph):
words = words.replace('!','')
words = words.replace('?', '')
words = words.replace("'", "")
words = words.replace(',', '')
words = words.replace('.', '')
words = words.replace(';', '')
words = words.replace(' ', '')
words = words.lower()
if words not in banned:
if words in dic:
dic[words]+=1
else:
dic[words] = 1
for key in (sorted(dic.items(), key=lambda d: d[1],reverse=True)):
if key[0]:
就是时间有点儿感人-_-||
第一的:
class Solution:
def mostCommonWord(self, paragraph, banned):
"""
:type paragraph: str
:type banned: List[str]
:rtype: str
"""
tokens = re.sub('[\!\?\'\,;\.]', '', paragraph.lower()).split()
counter = collections.Counter(tokens)
for i in counter.most_common(len(counter)):
if i[0] not in banned:
return i[0]