Censor （哈希）

最新推荐文章于 2020-01-22 10:38:24 发布

deebcjrb

最新推荐文章于 2020-01-22 10:38:24 发布

阅读量210

点赞数

分类专栏：哈希算法文章标签： hash

本文链接：https://blog.csdn.net/weixin_43791787/article/details/99327568

版权

哈希算法专栏收录该内容

4 篇文章 0 订阅

订阅专栏

原题目：

frog is now a editor to censor so-called sensitive words (敏感词).

She has a long text pp. Her job is relatively simple -- just to find the first occurence of sensitive word ww and remove it.

frog repeats over and over again. Help her do the tedious work.

Input

The input consists of multiple tests. For each test:

The first line contains 11 string ww. The second line contains 11 string pp.

(1≤length of w,p≤5⋅1061≤length of w,p≤5⋅106, w,pw,p consists of only lowercase letter)

Output

For each test, write 11 string which denotes the censored text.

Sample Input

    abc
    aaabcbc
    b
    bbb
    abc
    ab

Sample Output

    a
    ab

中文概要：

给出一个word字符串和一个page字符串，要求不断重复把p中所有w串删去.

输出最终结果（eg: w=abc p=aaabcbc => ans=a )

#include<queue>  
#include<set>  
#include<map>  
#include<cstdlib>  
#include<cstring>
#include<iostream>  
using namespace std;
typedef unsigned long long ULL;//若哈希结果超出LL的范围可以自动取模LL_max
const int maxn = 5e6 + 50;
char w[maxn], p[maxn], ans[maxn];
struct Hash
{
	ULL hash=131;
	ULL hash_b[maxn];//字符串哈希表（用于在母串中p[i]-p[i-len]*has_b[len]时使用，这时得到的才是该子串的哈希值）
	ULL w_hash;//w串的哈希值 
	ULL w_len;//w串的长度
	ULL p_hash[maxn];//p串的哈希值 
	void init()//初始化
	{
		w_hash=0;//串哈希值初始化
		p_hash[0]=0;
		hash_b[0]=1;//哈希表初始化
		for(int i=1;i<(int)5e6;i++)
		hash_b[i]=hash_b[i-1]*hash;
	}
	bool check(int pos)
	{
		if(pos >= w_len&&p_hash[pos] - p_hash[pos - w_len] * hash_b[w_len] == w_hash)//判断子串是否与母串pos开始len长度串匹配成功
		return true;
		return false;
	}
	void solve()
	{
		w_len=strlen(w);
		for(int i=0;i<w_len;i++)//获得w串的哈希值
		w_hash=w_hash*hash+(w[i]-'a'+1);
		int top=0,p_len=strlen(p);//top为ans串的长度
		for(int i=0;i<p_len;i++)//ans为输出的串,ans从1开始赋值，有与w串相同的就直接删除 
		{
			ans[top++]=p[i];
			p_hash[top]=p_hash[top-1]*hash+(p[i]-'a'+1);
			if(check(top))
			top-=w_len;//边计算哈希值边删 
		}
		for(int i=0;i<top;i++)
		printf("%c",ans[i]);
		printf("\n");
	}
}H;
int main()
{
	while(~scanf("%s%s",&w,&p))
	{
		H.init();
		H.solve();
	}
	return 0;
}

思路:

设一个ans串，在用p串不断给ans串赋值的时候同时来判断是否有与w串相同的，有就直接删除

说实话是第一次看结构体套函数的例子，原来不知道结构体还能套函数，妈的，原来结构体这么吊

感觉本题难点在于判断子串是否与母串pos开始len长度串匹配成功

也就是check函数中if(pos >= w_len&&p_hash[pos] - p_hash[pos - w_len] * hash_b[w_len] == w_hash)

deebcjrb

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
Censor （哈希）

原题目：frog is now a editor to censor so-called sensitive words (敏感词).She has a long text pp. Her job is relatively simple -- just to find the first occurence of sensitive word ww and remove it.fro...
复制链接

扫一扫