How to remove duplicate rows in MySQL?

Join thousands of students who advanced their careers with MachineLearningPlus. Go from Beginner to Data Science Expert through a structured road map of 70+ courses in 9 core specializations. Build industry grade Data Science projects.

Problem

Given a table in MySQL, how do you remove duplicate rows?

Input

first_name	last_name	email
John	Doe	[email protected]
Jane	Smith	[email protected]
John	Doe	[email protected]
Bob	Johnson	[email protected]
Alice	Williams	[email protected]
Jane	Smith	[email protected]

Try Hands-On: Fiddle

Create Input Table: Gist

Desired Output

first_name	last_name	email
John	Doe	[email protected]
Jane	Smith	[email protected]
Bob	Johnson	[email protected]
Alice	Williams	[email protected]

Solution 1:

Using SELECT DISTINCT

    CREATE TABLE employees_no_duplicates AS
    SELECT DISTINCT *
    FROM employees;

Explanation:

To remove duplicate rows from the employees table, you can use the DISTINCT keyword in a SELECT statement to identify unique rows and then insert the results into a new table.

Alternately you can use the DELETE statement to remove duplicates from the original table. Let’s see how to do that.

Solution 2:

Using a SELF JOIN

This approach works if there is an id field which does not have the values duplicated.

    DELETE e1
    FROM employees e1
    JOIN employees e2
    ON  e1.id < e2.id
    AND e1.first_name = e2.first_name
    AND e1.last_name = e2.last_name
    AND e1.email = e2.email;

Output:

id	first_name	last_name	email
3	John	Doe	[email protected]
4	Bob	Johnson	[email protected]
5	Alice	Williams	[email protected]
6	Jane	Smith	[email protected]

Try Hands-On: Fiddle

Explanation:

This approach uses a DELETE statement with a self-join to remove duplicate rows from the original table.

It directly modifies the existing table, so be cautious when using this approach, as it permanently removes data.

How to remove duplicate rows in MySQL?

Problem

Input

Desired Output

Solution 1:

Explanation:

Solution 2:

Explanation:

Recommended Courses

Recommended Tutorial

More SQL Questions

More Articles

How to sort multiple columns in SQL and in different directions?

How to count the number of work days between two dates?

How to compute maximum of multiple columns, aks row wise max?

How to use the GROUP BY clause on multiple columns in SQL?

How to get a list of dates between two dates in SQL?

What is the difference between CROSS JOIN and INNER JOIN in SQL?

Similar Articles

Complete Introduction to Linear Regression in R

How to implement common statistical significance tests and find the p value?

Logistic Regression – A Complete Tutorial With Examples in R

Machine Learning A-Z™: Hands-On Python & R In Data Science

Free Sample Videos: