MySQL 消除重复行的一些方法

最新推荐文章于 2024-07-30 08:00:00 发布

winter13292

最新推荐文章于 2024-07-30 08:00:00 发布

阅读量1.6k

点赞数

分类专栏： Mysql

本文链接：https://blog.csdn.net/winter13292/article/details/8870562

版权

Mysql 专栏收录该内容

30 篇文章 0 订阅

订阅专栏

今天群里有人问了个问题

我有一个表test,里面有三个字段a,b,c，现在b字段有很多重复的数据，这些重复的数据对应的a字段，有的是空的，有的是有数据的，c字段没有问题，请问：我现在要根据b字段去除重复，如果两条数据的b字段相同，而a字段一个是空的，另一个是有数据的，那么就把a字段为空的那条数据删除；如果a字段都有值，那么随便删除一个即可。我该怎么写这个sql语句？

根据其需求我设计了下表

根据其表格我想到要求如下:

1,5 6 7 随便留一个;3,2 随便删一个;留下4

SELECT * from myuser m1 , myuser m2 WHERE ((m1.b =m2.b) OR ((m1.a =m2.a) or ((m1.a IS NULL)AND (m2.a IS NULL)))) GROUP BY m1.b

上面的sql 求出了所有不重复的项

查找信息时看到下面这篇文章也不错点击打开链接

/*

MySQL 消除重复行的一些方法

---Chu Minfei

---2010-08-12 22:49:44.660

--引用转载请注明出处:http://blog.csdn.net/feixianxxx

*/

----------------全部字段重复------------------------

  --1使用表替换来删除重复项

  create table test_1(id int,value int);

  insert test_1 select 1,2 union all select 1,2 union all select 2,3;

  --建立一个和源表结构一样的空的临时表

  create table tmp like test_1;

  --向临时表插入不重复的记录

  insert tmp select distinct * from test_1;

  --删除原表

  drop table test_1;

  --更改临时表名为目标表

  rename table tmp to test_1;

  --显示

  mysql> select * from test_1;

+------+-------+

| id   | value |

+------+-------+

|    1 |     2 |

|    2 |     3 |

+------+-------+

 --2.添加auto_increment属性列(这个方法只能用于MyISAM或者BDB引擎的表)

  create table test_1(id int,value int) engine=MyISAM;

  insert test_1 select 1,2 union all select 1,2 union all select 2,3;

  alter table test_1 add id2 int not null auto_increment,

 add primary key(id,value,id2);

  select * from test_1;

+----+-------+-----+

| id | value | id2 |

+----+-------+-----+

|  1 |     2 |   1 |

|  1 |     2 |   2 |

|  2 |     3 |   1 |

+----+-------+-----+

   delete from test_1 where id2<>1;

   alter table test_1 drop id2;

   select * from test_1;

   +----+-------+

| id | value |

+----+-------+

|  1 |     2 |

|  2 |     3 |

+----+-------+

-------------------部分字段重复---------------------

--1.加索引的方式

  create table test_2(id int,value int);

  insert test_2 select 1,2 union all select 1,3 union all select 2,3;

  Alter IGNORE table test_2 add primary key(id);

  select * from test_2;

  +----+-------+

| id | value |

+----+-------+

|  1 |     2 |

|  2 |     3 |

+----+-------+

  我们可以看到 1 3 这条记录消失了 

  我们这里也可以使用Unique约束 因为有可能列中有NULL值，但是这里NULL就可以多个了..

 --2.联合表删除

  create table test_2(id int,value int);

  insert test_2 select 1,2 union all select 1,3 union all select 2,3;

  delete A from test_2 a join (select MAX(value) as v ,ID from test_2 group by id) b

  on a.id=b.id and a.value<>b.v;

  select * from test_2;

  +------+-------+

| id   | value |

+------+-------+

|    1 |     3 |

|    2 |     3 |

+------+-------+

--3.使用Increment_auto也可以就是上面全部字段去重的第二个方法

--4.容易错误的方法

--有些朋友可能会想到子查询的方法，我们来试验一下

 create table test_2(id int,value int);

 insert test_2 select 1,2 union all select 1,3 union all select 2,3;

 delete a from test_2 a where exists(select * from test_2 where a.id=id and a.value<value);

 /*ERROR 1093 (HY000): You can't specify target table 'a' for update in FROM clause*/

 

 目前，您不能从一个表中删除，同时又在子查询中从同一个表中选择。

 

 

 ------------------删除特定重复行--------------

 --主要通过order by +limit 或者直接limit 

 create table test_3(id int,value int);

 insert test_3 select 1,2 union all select 1,3 union all select 1,4 union all select 2,3;

 --这是要保留ID=1 value最小的那个记录，删除其他id为的记录

 delete from test_3 where id=1 order by value desc limit 2;

 select * from test_3;

+------+-------+

| id   | value |

+------+-------+

|    1 |     2 |

|    2 |     3 |

+------+-------+

 如果你只想删除任意的记录 保留一条 就可以去掉order by