hdu2328(KMP+枚举)

最新推荐文章于 2018-11-23 17:12:03 发布

晓风残月xj

最新推荐文章于 2018-11-23 17:12:03 发布

阅读量879

点赞数

分类专栏：数据结构-KMP 文章标签：数据结构 KMP 动态规划

本文链接：https://blog.csdn.net/xiaofengcanyuexj/article/details/10013459

版权

数据结构-KMP 专栏收录该内容

13 篇文章 0 订阅

订阅专栏

本博客探讨了如何通过查找多个商标的最长公共子串来帮助企业清晰地定义其‘企业标识’，包括公司Logo和其他标识，如商标。通过分析多个商标，找到共同的部分并加以强调，形成新的标识方案，同时保持对现有标识的尊重。

摘要由CSDN通过智能技术生成

Corporate Identity

Time Limit: 9000/3000 MS (Java/Others) Memory Limit: 65536/32768 K (Java/Others)
Total Submission(s): 186 Accepted Submission(s): 89

Problem Description

Beside other services, ACM helps companies to clearly state their “corporate identity”, which includes company logo but also other signs, like trademarks. One of such companies is Internet Building Masters (IBM), which has recently asked ACM for a help with their new identity. IBM do not want to change their existing logos and trademarks completely, because their customers are used to the old ones. Therefore, ACM will only change existing trademarks instead of creating new ones.

After several other proposals, it was decided to take all existing trademarks and find the longest common sequence of letters that is contained in all of them. This sequence will be graphically emphasized to form a new logo. Then, the old trademarks may still be used while showing the new identity.

Your task is to find such a sequence.

Input

The input contains several tasks. Each task begins with a line containing a positive integer N, the number of trademarks (2 ≤ N ≤ 4000). The number is followed by N lines, each containing one trademark. Trademarks will be composed only from lowercase letters, the length of each trademark will be at least 1 and at most 200 characters.

After the last trademark, the next task begins. The last task is followed by a line containing zero.

Output

For each task, output a single line containing the longest string contained as a substring in all trademarks. If there are several strings of the same length, print the one that is lexicographically smallest. If there is no such non-empty string, output the words “IDENTITY LOST” instead.

Sample Input

  
  
   
   3
aabbaabb
abbababb
bbbbbabb
2
xyz
abc
0

Sample Output

  
  
   
   abb
IDENTITY LOST

Source

CTU Open Contest 2007

Recommend

teddy

本题要求多个字符串的最长公共子串。

求最长公共子串是个常见的算法。但本题所求的是多个字符串的，排除常见的两个字符串的DP做法。

可以用后缀数组，但我刚刚接触后缀数组，不能灵活运用。

尝试用KMP+枚举，不料没有超时。

找到最短的字符串，枚举该字符串的子串，然后分别与其他字符串做KMP，找到长度最长且在长度相同的情况下字典序最小的公共子串。

#include<iostream>
#include<cstdio>
#include<cstring>
using namespace std;

const int MAXN=200+10;
char str[4000+10][MAXN];
int next[MAXN];

//KMP算法中计算next[]数组
void getNext(char *p)
{
    int j,k,len=strlen(p);
    j=0;
    k=-1;
    next[0]=-1;
    while(j<len)
    {
        if(k==-1||p[j]==p[k])
        {
            next[++j]=++k;
        }
        else k=next[k];
    }
}


//返回模式串T在主串S中首次出现的位置
//返回的位置是从0开始的,若没有找到返回-1。
int KMP_Index(char *W,char *T)
{
    int i=0, j=0,wlen=strlen(W),tlen=strlen(T);
    getNext(W);
    while(i<tlen&&j<wlen)
    {
        if(j==-1||T[i]==W[j])
        {
            i++; j++;
        }
        else
            j=next[j];
    }
    if(j==wlen)
        return i-wlen;
    else
        return -1;
}


int main()
{
	int n,i,j,k,Minlen,Minid;
	char tmp[MAXN],ans[MAXN];
	bool flag;
	while(~scanf("%d",&n),n)
	{
		scanf("%s",str[0]);
		Minlen=strlen(str[0]);
		flag=false;
		for(i=1;i<n;i++)
		{
			scanf("%s",str[i]);
			Minid=0;
			if(strlen(str[i])<Minlen)
			{
				Minid=i;Minlen=strlen(str[i]);
			}
		}
	//	printf("str[Minid]=%s\n",str[Minid]);

		ans[0]=0;
		for(i=Minlen;i>=0;i--)
		{
			for(j=0;j<=Minlen-i;j++)
			{
				strncpy(tmp,str[Minid]+j,i);
				tmp[i]=0;
			//	printf("tmp=%s\n",tmp);
				
				for(k=0;k<n;k++)
				{
					if(k!=Minid)
					{
						if(KMP_Index(tmp,str[k])==-1)
							break;
					}
				}
				
				if(k==n)
				{
					if(strlen(ans)<i)
						strcpy(ans,tmp);
					else if(strlen(ans)==i&&strcmp(ans,tmp)>0)
						strcpy(ans,tmp);
					flag=true;
				}
			}
			//if(flag)break;
		}
	if(ans[0]==0)
		printf("IDENTITY LOST\n");
	else printf("%s\n",ans);
	}
	return  0;
}