[jzoj5462]【NOIP2017提高A组冲刺11.8】好文章

最新推荐文章于 2019-02-05 19:11:59 发布

FarmerJohnLYH

最新推荐文章于 2019-02-05 19:11:59 发布

阅读量562

点赞数 1

分类专栏： hash 纪中的 Fortune OJ 文章标签：哈希

本文链接：https://blog.csdn.net/farmerjohnofzs/article/details/78493573

版权

纪中的 Fortune OJ 同时被 2 个专栏收录

43 篇文章 3 订阅

订阅专栏

hash

1 篇文章 0 订阅

订阅专栏

标签：哈希

传送门

Solution

题目实际要求我们求出重复的子串数

容易想到的可以求出每个子串的 hash 值然后排序最后扫一遍

那么这里也浅谈一下 hash 算法

My Style

我一般会设两个质数称较小的为 p 较大的为 P

比较基本的

H S t r = \sum s t r i * p i (m o d P)

$H_{Str}=\sum str_i*p^i (mod P)$

为了防止被卡时间减少代码复杂度

我们同时用多个 hash

一个或两个作为键值剩下的用于判断是否完全相同

这道题的特点在于所有字符串都是一个串的子串

所以令设 $h_i$ 表示 str[1,i] 的 hash 值

用上述的方式计算即可简便地判断

具体见标程的 hash 函数

Code

#include <cmath>
#include <cstdio>
#include <cstring>
#include <iostream>
#include <algorithm>
#define fo(i,x,y) for (int i=(x);i<=(y);++i)
#define fd(i,x,y) for (int i=(x);i>=(y);--i)
#define oo 2139062143
using namespace std;
const int N=200200,PRI1=39916801/*11!+1*/,PRI2=9191891;
int n,m;
char st[N];
struct node{
    int x,y;
}c[N];
int h1[N],h2[N];
int p1[N],p2[N];
int hash1(int sta)
{
    int l=sta,r=sta+m-1;
    return(1ll*(h1[r]-h1[l-1]+PRI1)%PRI1*p1[n-r]%PRI1);
}
int hash2(int sta)
{
    int l=sta,r=sta+m-1;
    return(1ll*(h2[r]-h2[l-1]+PRI2)%PRI2*p2[n-r]%PRI2);
}
bool cmp(node a,node b)
{
    return(a.x<b.x||(a.x==b.x&&a.y<b.y));
}
int main()
{
    // freopen("article.in","r",stdin);
    // freopen("article.out","w",stdout);
    scanf("%d%d\n%s",&n,&m,st+1);
    p1[0]=p2[0]=1,p1[1]=9209,p2[1]=3881;
    fo(i,2,n+100) p1[i]=(1ll*p1[i-1]*p1[1])%PRI1,p2[i]=(1ll*p2[i-1]*p2[1])%PRI2; 
    fo(i,1,n) 
    {
        int now=st[i]-'a'+1;
        h1[i]=(h1[i-1]+1ll*now*p1[i])%PRI1,h2[i]=(h2[i-1]+1ll*now*p2[i])%PRI2;

    }int tot=n-m+1;
    int tmp1=hash1(1);
    int tmp2=hash1(2); 
    fo(i,1,tot)
        c[i].x=hash1(i),c[i].y=hash2(i);    
    sort(c+1,c+1+tot,cmp);
    int ans=tot;
    fo(i,2,tot)
        if(c[i].x==c[i-1].x&&c[i].y==c[i-1].y) 
            --ans;
    printf("%d\n",ans);
    return 0;
}

FarmerJohnLYH

关注

1
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
[jzoj5462]【NOIP2017提高A组冲刺11.8】好文章

标签：哈希传送门Solution题目实际要求我们求出重复的子串数容易想到的可以求出每个子串的 hash 值然后排序最后扫一遍那么这里也浅谈一下 hash 算法My Style我一般会设两个质数称较小的为 p 较大的为 P比较基本的 HStr=∑stri∗pi(modP)H_{Str}=\sum str_i*p^i (mod P)为了防止被卡时间减少代码复杂度我们同时用多个 hash
复制链接

扫一扫