1078 Bigram 分词_bigram的分词方法例子-CSDN博客

本文链接：https://blog.csdn.net/weixin_44171872/article/details/108614598

题目描述：
给出第一个词 first 和第二个词 second，考虑在某些文本 text 中可能以 “first second third” 形式出现的情况，其中 second 紧随 first 出现，third 紧随 second 出现。
对于每种这样的情况，将第三个词 “third” 添加到答案中，并返回答案。

示例 1：
输入：text = “alice is a good girl she is a good student”, first = “a”, second = “good”
输出：[“girl”,“student”]

示例 2：
输入：text = “we will we will rock you”, first = “we”, second = “will”
输出：[“we”,“rock”]

提示：
1 <= text.length <= 1000
text 由一些用空格分隔的单词组成，每个单词都由小写英文字母组成
1 <= first.length, second.length <= 10
first 和 second 由小写英文字母组成

方法1：
主要思路：
（1）先将给出的两个单词拼接成一个短句，然后在给出的句子中的每个单词起始位置，匹配该短句，若是该短句匹配上了，则在该短句后面的部分，搜索是否存在可能的新的单词，若存在，则压入到结果中；

class Solution {
public:
    vector<string> findOcurrences(string text, string first, string second) {
        string sub_str=first+" "+second;//生成新的短句
        int len=sub_str.size();//短句的长度
        int end_pos=text.size()-len+1;//新的终止位置
        vector<string> res;
        //遍历句子的字符
        for(int i=0;i<end_pos;++i){
        	//若匹配上了
            if(text.substr(i,len)==sub_str){
            	//找短句的后面是否有单词存在
                int pos=i+len+1;
                string tmp;
                while(pos<text.size()&&text[pos]!=' '){
                    ++pos;
                }
                if(i+len+1<text.size()&&pos!=i+len+1){
                    res.push_back(text.substr(i+len+1,pos-i-len-1));
                }
            }
            //找下个单词的位置
            while(i<end_pos&&text[i]!=' '){
                ++i;
            }
        }
        return res;
    }
};