UPC-Typo(字符串哈希)

最新推荐文章于 2022-12-07 10:52:57 发布

Round moon

最新推荐文章于 2022-12-07 10:52:57 发布

阅读量193

点赞数

分类专栏： ACM 中国石油大学OJ 字符串文章标签：字符串哈希算法

本文链接：https://blog.csdn.net/qq_35339563/article/details/108617906

版权

ACM 同时被 3 个专栏收录

78 篇文章 0 订阅

订阅专栏

中国石油大学OJ

30 篇文章 8 订阅

订阅专栏

字符串

3 篇文章 0 订阅

订阅专栏

世上没有绝望的处境
只有对处境绝望的人

UPC-Typo

题目描述

It is now far into the future and human civilization is ancient history. Archaeologists from a distant planet have recently discovered Earth. Among many other things, they want to decipher the English language.
They have collected many printed documents to form a dictionary, but are aware that sometimes words are not spelled correctly (typos are a universal problem). They want to classify each word in the dictionary as either correct or a typo. Naïvely, they do this using a simple rule: a typo is any word in the dictionary such that deleting a single character from that word produces another word in the dictionary.
Help these alien archaeologists out! Given a dictionary of words, determine which words are typos. That is,which words result in another word in the dictionary after deleting a single character.
For example if our dictionary is {hoose, hose, nose, noises}. Then hoose is a typo because we can obtain hose by deleting a single ’o’ from hoose. But noises is not a typo because deleting any single
character does not result in another word in the dictionary.
However, if our dictionary is {hoose, hose, nose, noises, noise} then the typos are hoose, noises,and noise.

题目大意

n组字符串，若每组字符串中的其中一个字符被删除掉，与其余的字符串有一样的，则这个字符串需要被删除
eg.

　　ILOVEYOU 和 LOVEYOU 第一个字符串去掉I后就会和后面的字符串一样，所以要把第一个字符串输出。

输入

The ﬁrst line of input contains a single integer n, indicating the number of words in the dictionary.
The next n lines describe the dictionary. The ith of which contains the ith word in the dictionary. Each word consists only of lowercase English letters. All words are unique.
The total length of all strings is at most 1 000 000.

输出

Display the words that are typos in the dictionary. These should be output in the same order they appear in the input. If there are no typos, simply display the phrase NO TYPOS.

Sample

InputⅠ

5
hoose
hose
nose
noises
noise

OutputⅠ

hoose
noises
noise

InputⅡ

4
hose
hoose
oose
moose

OutputⅡ

hoose
moose

InputⅢ

5
banana
bananana
bannanaa
orange
orangers

OutputⅢ

NO TYPOS

思路解析

看到这题的时候天真的我居然认为是一个水题，直接暴力存 unordered_map 找不就得勒。但是~~~
我忘记了一个很重要的事，切割字符串的复杂度没考虑在内。我想这题如果是T了那么就是卡在这了。
可能有的大佬会直接想到这样会T于是乎打算存个字符串合并的所有结果，但是无奈只能感叹到，字符串的长度让我头秃
那么有没有办法，既可以实现字符串合并和同时又duck不必开辟如此多的内存呢？欸！有哈希算法
哈希算法就是将一堆东西映射成一个值的算法，就和map的红黑树，unorder_map的散列表是一个效果。
字符串哈希是将每个字母当作一个26以上进制的其中一位，用ull来存储，且进制保证为质数，可以保证不会重复出现ID
类比十进制演示一下这个题的思路
存储147852369这个数字
我们需要存储
1
14
147
1478
14785
147852
1478523
14785236
147852369
这9组，当你想得到147~~(8)~~ 52369这个字符串时我们的操作为
147×10⁵+147852369-1478×10⁵=14752369

OK这样就可以节省内存了，接下来就是unorder_map的查询。就很简单了，然后注意的就是不确定字符串有多少，那么就动态开辟。

AC时间到

#include<algorithm>
#include<iostream>
#include<string.h>
#include <iomanip>
#include<stdio.h>
#include<utility>
#include<vector>
#include<string>
#include<math.h>
#include<cmath>
#include<queue>
#include<stack>
#include<deque>
#include<map>
#include<set>
#pragma warning(disable:4244)
#define PI 3.141592653589793
#pragma GCC optimize(2)
#define accelerate cin.tie(NULL);cout.tie(NULL);ios::sync_with_stdio(false);
#define EPS 1.0e-8
using namespace std;
typedef long long ll;
typedef unsigned long long ull;
const ll ll_inf = 9223372036854775807;
const int int_inf = 2147483647;
const short short_inf = 32767;
const char char_inf = 127;
ll gcd(ll a, ll b) { return b ? gcd(b, a % b) : a; }
inline ll read() {
	ll c = getchar(), Nig = 1, x = 0;
	while (!isdigit(c) && c != '-')c = getchar();
	if (c == '-')Nig = -1, c = getchar();
	while (isdigit(c))x = ((x << 1) + (x << 3)) + (c ^ '0'), c = getchar();
	return Nig * x;
}
inline void out(ll a) {
	if (a < 0)putchar('-'), a = -a;
	if (a >= 10)out(a / 10);
	putchar(a % 10 + '0');
}
ll phi(ll n)
{
	ll ans = n, mark = n;
	for (ll i = 2; i * i <= mark; i++)
		if (n % i == 0) { ans = ans * (i - 1) / i; while (n % i == 0)n /= i; }
	if (n > 1)ans = ans * (n - 1) / n; return ans;
}
ll qpow(ll x, ll n, ll mod) {
	ll res = 1;
	while (n > 0) {
		if (n & 1)res = (res * x) % mod;
		x = (x * x) % mod;
		n >>= 1;
	}
	return res;
}
ll mat_mod;
struct Mat {
	ll m[10][10];
};
Mat Mul(Mat A, Mat B, ll mat_size) {
	Mat res;
	memset(res.m, 0, sizeof(res.m));
	for (int i = 0; i < mat_size; i++)for (int j = 0; j < mat_size; j++)for (int k = 0; k < mat_size; k++)
		res.m[i][j] = (res.m[i][j] + (A.m[i][k] * B.m[k][j]) % mat_mod) % mat_mod;
	return res;
}
Mat mat_qpow(Mat data, ll power, ll mat_size) {
	Mat res;
	memset(res.m, 0, sizeof(res.m));
	for (int i = 0; i < mat_size; i++)res.m[i][i] = 1;
	while (power) {
		if (power & 1)res = Mul(res, data, mat_size);
		data = Mul(data, data, mat_size), power >>= 1;
	}
	return res;
}
#define Floyd for(int k = 1; k <= n; k++)for(int i=1;i<=n;i++)for(int j=1;j<=n;j++)
#define read read()
ull base = 211;
ull hash_pow[1000001];
vector<string>save;
#include<unordered_map>
using namespace std;
unordered_map<ull, bool>mp;
vector<vector<ull>>information;
int main()
{
	accelerate;
	int n;
	cin >> n;
	hash_pow[0] = 1;
	for (int i = 1; i < 1000001; i++)
		hash_pow[i] = hash_pow[i - 1] * base;
	for (int i = 0; i < n; i++)
	{
		string temp;
		cin >> temp;
		save.push_back(temp);
		vector<ull>hash_id;
		hash_id.push_back(0);
		ull sum = 0;
		int l = temp.length();
		for (int j = 1; j <= l; j++)
			hash_id.push_back(hash_id[j - 1] * base + temp[j - 1]);
		mp[hash_id[l]] = true;
		information.push_back(hash_id);
	}
	bool sw = true;
	for (int i = 0; i < n; i++)
	{
		int k = save[i].size();
		for (int j = 1; j <= k; j++)
		{
			int r = k, l = j + 1;
			ull ans = information[i][j - 1] * hash_pow[r - l + 1] + information[i][r] - information[i][l - 1] * hash_pow[r - l + 1];
			if (mp[ans])
			{
				sw = false;
				cout << save[i] << endl;
				break;
			}
		}
	}
	if (sw)
		cout << "NO TYPOS" << endl;
}

By-轮月

Round moon

关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
打赏
0
评论
UPC-Typo(字符串哈希)

世上没有绝望的处境只有对处境绝望的人UPC-Typo题目描述It is now far into the future and human civilization is ancient history. Archaeologists from a distant planet have recently discovered Earth. Among many other things, they want to decipher the English lang
复制链接

扫一扫