Leetcode | 30. Substring with Concatenation of All Words

最新推荐文章于 2020-09-30 14:38:55 发布

HW_WY

最新推荐文章于 2020-09-30 14:38:55 发布

阅读量117

点赞数

分类专栏： leetcode

本文链接：https://blog.csdn.net/zhang15953709913/article/details/84963377

版权

leetcode 专栏收录该内容

3 篇文章 0 订阅

订阅专栏

You are given a string, s, and a list of words, words, that are all of the same length. Find all starting indices of substring(s) in s that is a concatenation of each word in words exactly once and without any intervening characters.

Example 1:

Input:
  s = "barfoothefoobarman",
  words = ["foo","bar"]
Output: [0,9]
Explanation: Substrings starting at index 0 and 9 are "barfoor" and "foobar" respectively.
The output order does not matter, returning [9,0] is fine too.

解题思路：用两个 HashMap 来解决。首先，我们把所有的单词存到 HashMap 里，key 直接存单词，value 存单词出现的个数（因为给出的单词可能会有重复的，所以可能是 1 或 2 或者其他）。然后扫描子串的单词，如果当前扫描的单词在之前的 HashMap 中，就把该单词存到新的 HashMap 中，并判断新的 HashMap 中该单词的 value 是不是大于之前的 HashMap 该单词的 value ，如果大了，就代表该子串不是我们要找的，接着判断下一个子串就可以了。如果不大于，那么我们接着判断下一个单词的情况。子串扫描结束，如果子串的全部单词都符合，那么该子串就是我们找的其中一个。看下具体的例子。

看下图，我们把 words 存到一个 HashMap 中。

然后遍历子串的每个单词。

第一个单词在 HashMap1 中，然后我们把 foo 存到 HashMap2 中。并且比较此时 foo 的 value 和 HashMap1 中 foo 的 value，1 < 2，所以我们继续扫描。

第二个单词也在 HashMap1 中，然后把 foo 存到 HashMap2 中，因为之前已经存过了，所以更新它的 value 为 2 ，然后继续比较此时 foo 的 value 和 HashMap1 中 foo 的 value，2 <= 2，所以继续扫描下一个单词。

第三个单词也在 HashMap1 中，然后把 foo 存到 HashMap2 中，因为之前已经存过了，所以更新它的 value 为 3，然后继续比较此时 foo 的 value 和 HashMap1 中 foo 的 value，3 > 2，所以表明该字符串不符合。然后判断下个子串就好了。

当然上边的情况都是单词在 HashMap1 中，如果不在的话就更好说了，不在就表明当前子串肯定不符合了，直接判断下个子串就好了。

（以上解释+图例选自https://leetcode.windliang.cc/leetCode-30-Substring-with-Concatenation-of-All-Words.html）

class Solution:
    def findSubstring(self, s, words):
        """
        :type s: str
        :type words: List[str]
        :rtype: List[int]
        """
        if not s or not words or not words[0]:
            return []
        
        k = len(words[0])
        w = len(words)
        n = len(s)
        
        map1 = {}
        map2 = {}
        
        for word in words:
            if word in map1:
                map1[word] += 1
            else:
                map1[word] = 1
        
        res = []
        boundary = n-w*k+1
        for i in range(0,boundary):
            start = i
            while start < i+w*k:
                term = s[start:start+k]
                if term not in map1:
                    break
                if term in map2:
                    map2[term] += 1
                else:
                    map2[term] = 1
                if map2[term]>map1[term]:
                    break
                start += k
            if start >= i+w*k:
                res.append(i)
            map2.clear()
        return res
            
            
# s = Solution()
# s.findSubstring( "wordgoodgoodgoodbestword",["word","good","best","good"])