其实这道题目完全可以不用trie,但权当熟悉trie了,练下手。
题目给出k个关键词 (全部小写)
然后给出e个句子让你找出关词出现次数最多的句子(句子中单词可能大写,句子所含关键词数量可能相同)。
注意不要使用stringstream直接分割句子,因为可能有两个单词之间使用标点符号连接的,这里只能手动敲代码了,其实也不多。
注意输出答案后要多输出一个空行。
Language:
Excuses, Excuses!
Description
Judge Ito is having a problem with people subpoenaed for jury duty giving rather lame excuses in order to avoid serving. In order to reduce the amount of time required listening to goofy excuses, Judge Ito has asked that you write a program that will search for a list of keywords in a list of excuses identifying lame excuses. Keywords can be matched in an excuse regardless of case.
Input
Input to your program will consist of multiple sets of data. Line 1 of each set will contain exactly two integers. The first number (1 <= K <= 20) defines the number of keywords to be used in the search. The second number (1 <= E <= 20) defines the number of excuses in the set to be searched. Lines 2 through K+1 each contain exactly one keyword. Lines K+2 through K+1+E each contain exactly one excuse. All keywords in the keyword list will contain only contiguous lower case alphabetic characters of length L (1 <= L <= 20) and will occupy columns 1 through L in the input line. All excuses can contain any upper or lower case alphanumeric character, a space, or any of the following punctuation marks [".,!?] not including the square brackets and will not exceed 70 characters in length. Excuses will contain at least 1 non-space character.
Output
For each input set, you are to print the worst excuse(s) from the list. The worst excuse(s) is/are defined as the excuse(s) which contains the largest number of incidences of keywords. If a keyword occurs more than once in an excuse, each occurrance is considered a separate incidence. A keyword "occurs" in an excuse if and only if it exists in the string in contiguous form and is delimited by the beginning or end of the line or any non-alphabetic character or a space.
For each set of input, you are to print a single line with the number of the set immediately after the string "Excuse Set #". (See the Sample Output). The following line(s) is/are to contain the worst excuse(s) one per line exactly as read in. If there is more than one worst excuse, you may print them in any order. After each set of output, you should print a blank line. Sample Input 5 3 dog ate homework canary died My dog ate my homework. Can you believe my dog died after eating my canary... AND MY HOMEWORK? This excuse is so good that it contain 0 keywords. 6 5 superhighway crazy thermonuclear bedroom war building I am having a superhighway built in my bedroom. I am actually crazy. 1234567890.....,,,,,0987654321?????!!!!!! There was a thermonuclear war! I ate my dog, my canary, and my homework ... note outdated keywords? Sample Output Excuse Set #1 Can you believe my dog died after eating my canary... AND MY HOMEWORK? Excuse Set #2 I am having a superhighway built in my bedroom. There was a thermonuclear war! Source |
#include<iostream>
#include<cstring>
#include<cstdlib>
#include<cstdio>
#include<string>
#include<sstream>
using namespace std;
#define MAXN 1000
#define MAX 26
struct Trie
{
int sz,t[MAXN][MAX];
int jud[MAXN];
Trie()
{
sz=1;
memset(t[0],-1,sizeof(t));
jud[0]=0;
}
void clear()
{
sz=1;
memset(t[0],-1,sizeof(t));
jud[0]=0;
}
int idx(char c)
{
if(c<='Z' && c>='A' )
c=c-'A'+'a';
return c-'a';
}
void insert(char* s,int v)
{
int u=0,n=strlen(s);
for(int i=0;i<n;i++)
{
int c=idx(s[i]);
if(t[u][c]==-1)
{
memset(t[sz],-1,sizeof(t[sz]));
jud[sz]=0;
t[u][c]=sz++;
}
u=t[u][c];
}
jud[u]=v;
}
bool search(char* s)
{
int u=0,n=strlen(s);
for(int i=0;i<n;i++)
{
int c=idx(s[i]);
if(t[u][c]==-1) return false;
u=t[u][c];
}
if(jud[u]) return true;
return false;
}
};
Trie t;
int k,e,ans[30];
char ks[50],es[30][500],tmp[100];
int cs;
int main()
{
cs=1;
while(~scanf("%d%d",&k,&e))
{
t.clear();
for(int i=0;i<k;i++)
{
scanf("%s",ks);
t.insert(ks,1);
}
getchar();
for(int i=0;i<e;i++)
gets(es[i]);
int maxs=-1,pp=0;
for(int i=0;i<e;i++)
{
int p=0,cnt=0,len=strlen(es[i]);
for(int j=0;j<len;j++)
{
if(isalpha(es[i][j]))
tmp[p++]=es[i][j];
else
{
tmp[p]='\0';
if(t.search(tmp)) cnt++;
p=0;
}
}
if(cnt>maxs)
{
pp=0;
ans[pp++]=i;
maxs=cnt;
}
else
if(maxs==cnt)
ans[pp++]=i;
}
printf("Excuse Set #%d\n",cs++);
for(int i=0;i<pp;i++)
printf("%s\n",es[ans[i]]);
printf("\n");
}
return 0;
}