leetcode-472. 连接词_阿拉伯编码连词问题-CSDN博客

本文链接：https://blog.csdn.net/sinat_41679123/article/details/106959956

本文介绍了一种算法，用于从给定的单词列表中找出所有由至少两个列表内单词构成的连接词。通过使用集合和字典，算法有效地检查每个单词是否可以由剩余单词集中的多个单词组合而成。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

题目

给定一个不含重复单词的列表，编写一个程序，返回给定单词列表中所有的连接词。

连接词的定义为：一个字符串完全是由至少两个给定数组中的单词组成的。

示例:

输入: ["cat","cats","catsdogcats","dog","dogcatsdog","hippopotamuses","rat","ratcatdogcat"]

输出: ["catsdogcats","dogcatsdog","ratcatdogcat"]

解释: "catsdogcats"由"cats", "dog" 和 "cats"组成; 
     "dogcatsdog"由"dog", "cats"和"dog"组成; 
     "ratcatdogcat"由"rat", "cat", "dog"和"cat"组成。

说明:

给定数组的元素总数不超过 10000。
给定数组中元素的长度总和不超过 600000。
所有输入字符串只包含小写字母。
不需要考虑答案输出的顺序。

解题思路

延用单词拆分、单词拆分II的思路，这道题就是每次用去掉本词后剩下的所有词作为wordDict，然后判断当前word是否能切分成至少2个dict中的词。

有个用例解答错误。。。leetcode不显示所有结果，我也不知道错在哪里了。看题解有人说这道题是前缀树的题目，需要看一下前缀树的知识点然后再做。

代码

class Solution:
    def can_break(self, word: str, word_dict: list) -> bool:
        dict_set = set()
        for each_word in word_dict:
            dict_set.update(set(each_word))
        if not dict_set.issuperset(set(word)):
            return False
        true_dict = {0: [[]]}
        for end_index in range(1, len(word) + 1):
            cur_indexs = list(true_dict.keys())
            for begin_index in cur_indexs:
                if word[begin_index: end_index] in word_dict:
                    if end_index in true_dict:
                        true_dict[end_index].extend([item + [word[begin_index:end_index]] for item in true_dict[begin_index]])
                    else:
                        true_dict[end_index] = [item + [word[begin_index:end_index]] for item in true_dict[begin_index]]
        if len(word) not in true_dict:
            return False
        return max([len(set(item)) for item in true_dict[len(word)]]) >= 2

    def findAllConcatenatedWordsInADict(self, words: List[str]) -> List[str]:
        if (not words) or (not words[0]):
            return []
        result = []
        for index, each_word in enumerate(words):
            if self.can_break(each_word, words[:index] + words[index + 1:]):
                result.append(each_word)
        return result