mysql查询带斜杠,使用mySQL语句查找带有斜杠的几乎重复的数据

最新推荐文章于 2022-07-29 15:21:55 发布

weixin_39635314

最新推荐文章于 2022-07-29 15:21:55 发布

阅读量319

点赞数

文章标签： mysql查询带斜杠

I am have a table named 'LINK_INFO' with URLs in a field called 'URL'. The problem is, many duplicates URLs exist EXCEPT some have used a trailing / to get around the unique field requirement.

Example:

What is the statement I can use to select these cases of near duplicates, so I can delete one of them? Many thanks if you can help.

解决方案

You can just use TRIM to find all unique values;

SELECT DISTINCT TRIM(TRAILING '/' FROM url) url

FROM link_info

To delete the duplicates right away, just do a delete join;

DELETE li1

FROM link_info li1

JOIN link_info li2

WHERE TRIM(TRAILING '/' FROM li1.url) =

TRIM(TRAILING '/' FROM li2.url)

AND li1.id

Always back up your tables before running arbitrary SQL found on the net, even mine :)

EDIT: If your database machine is limited, you may want to do it using indexes and avoid loading more into memory than necessary;

-- remove all trailing slashes

UPDATE link_info

SET url=TRIM(TRAILING '/' FROM url);

-- create an index on the resulting strings (if there isn't already one)

CREATE INDEX url_index ON link_info(url);

-- delete all duplicates

DELETE li1

FROM link_info li1

JOIN link_info li2

WHERE li1.url = li2.url

AND li1.id

-- drop the index if not needed anymore

DROP INDEX url_index ON link_info;

Yet another SQLfiddle.

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

weixin_39635314

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
mysql查询带斜杠,使用mySQL语句查找带有斜杠的几乎重复的数据

I am have a table named 'LINK_INFO' with URLs in a field called 'URL'. The problem is, many duplicates URLs exist EXCEPT some have used a trailing / to get around the unique field requirement.Example:...
复制链接

扫一扫