916. 单词子集

最新推荐文章于 2022-08-04 21:02:43 发布

cs_zh_xi

最新推荐文章于 2022-08-04 21:02:43 发布

阅读量212

点赞数

分类专栏： LeetCode 文章标签：算法 leetcode

本文链接：https://blog.csdn.net/SuWanWorld/article/details/104270941

版权

LeetCode 专栏收录该内容

24 篇文章 0 订阅

订阅专栏

我们给出两个单词数组 A 和 B。每个单词都是一串小写字母。

现在，如果 b 中的每个字母都出现在 a 中，包括重复出现的字母，那么称单词 b 是单词 a 的子集。例如，“wrr” 是 “warrior” 的子集，但不是 “world” 的子集。

如果对 B 中的每一个单词 b，b 都是 a 的子集，那么我们称 A 中的单词 a 是通用的。

你可以按任意顺序以列表形式返回 A 中所有的通用单词。

来源：力扣（LeetCode）
链接：https://leetcode-cn.com/problems/word-subsets

解法

如果将A中的每个单词都和B中的每个单词进行比较，效率比较低。

但如果我们可以先计算出B的特征，然后使用这个特征和A中的单词进行比较，效率则比较高。

具体做法是，对B中的每个单词，统计其中每个字母出现的次数，记录每个字母在一个单词中的最大出现次数。

比如对于B = ["eo","oo"]，e在一个单词中的最大出现次数是1，而o则是2，将这些结果记录在cnt中，cnt[i]表示字母i + 'a'在一个单词中的最大出现次数。由于小写字母共26个，所以cnt长度为26. cnt就是我们所采集的特征。

对A中的单词a，统计a中每个字母出现的次数，结果记录在cntA中，自然cntA的长度也是26，如果对每个cntA[i]有cntA[i] >= cnt[i]，则a是通用的。

代码：

class Solution {
public:
    vector<string> wordSubsets(vector<string>& A, vector<string>& B) {
        vector<int> cnt(26, 0);
        for (const string& s : B)
        {
            vector<int> tmp(26, 0);
            _getCount(s, tmp);
            for (size_t i = 0; i < tmp.size(); ++i)
            {
                if (cnt[i] < tmp[i])
                    cnt[i] = tmp[i];
            }
        }

        vector<string> ans;
        for (const string& s : A)
        {
            vector<int> cntA(26, 0);
            _getCount(s, cntA);

            size_t i = 0;
            for (; i < cnt.size(); ++i)
            {
                if (cnt[i] > cntA[i])
                    break;
            }
            
            if (i == cnt.size())
                ans.push_back(s);
        }

        return ans;
    }

private:
    void _getCount(const string& s, vector<int> &cnt)
    {
        for (char c : s)
        {
            ++cnt[c - 'a'];
        }
    }
};

欢迎关注【CPP笔记】