- MySQL 基礎
- MySQL - 首頁
- MySQL - 簡介
- MySQL - 特性
- MySQL - 版本
- MySQL - 變數
- MySQL - 安裝
- MySQL - 管理
- MySQL - PHP 語法
- MySQL - Node.js 語法
- MySQL - Java 語法
- MySQL - Python 語法
- MySQL - 連線
- MySQL - Workbench
- MySQL 資料庫
- MySQL - 建立資料庫
- MySQL - 刪除資料庫
- MySQL - 選擇資料庫
- MySQL - 顯示資料庫
- MySQL - 複製資料庫
- MySQL - 資料庫匯出
- MySQL - 資料庫匯入
- MySQL - 資料庫資訊
- MySQL 使用者
- MySQL - 建立使用者
- MySQL - 刪除使用者
- MySQL - 顯示使用者
- MySQL - 修改密碼
- MySQL - 授予許可權
- MySQL - 顯示許可權
- MySQL - 收回許可權
- MySQL - 鎖定使用者賬戶
- MySQL - 解鎖使用者賬戶
- MySQL 表
- MySQL - 建立表
- MySQL - 顯示錶
- MySQL - 修改表
- MySQL - 重命名錶
- MySQL - 克隆表
- MySQL - 清空表
- MySQL - 臨時表
- MySQL - 修復表
- MySQL - 描述表
- MySQL - 新增/刪除列
- MySQL - 顯示列
- MySQL - 重新命名列
- MySQL - 表鎖
- MySQL - 刪除表
- MySQL - 派生表
- MySQL 查詢
- MySQL - 查詢
- MySQL - 約束
- MySQL - INSERT 查詢
- MySQL - SELECT 查詢
- MySQL - UPDATE 查詢
- MySQL - DELETE 查詢
- MySQL - REPLACE 查詢
- MySQL - INSERT IGNORE
- MySQL - INSERT ON DUPLICATE KEY UPDATE
- MySQL - INSERT INTO SELECT
- MySQL 運算子和子句
- MySQL - WHERE 子句
- MySQL - LIMIT 子句
- MySQL - DISTINCT 子句
- MySQL - ORDER BY 子句
- MySQL - GROUP BY 子句
- MySQL - HAVING 子句
- MySQL - AND 運算子
- MySQL - OR 運算子
- MySQL - LIKE 運算子
- MySQL - IN 運算子
- MySQL - ANY 運算子
- MySQL - EXISTS 運算子
- MySQL - NOT 運算子
- MySQL - NOT EQUAL 運算子
- MySQL - IS NULL 運算子
- MySQL - IS NOT NULL 運算子
- MySQL - BETWEEN 運算子
- MySQL - UNION 運算子
- MySQL - UNION vs UNION ALL
- MySQL - MINUS 運算子
- MySQL - INTERSECT 運算子
- MySQL - INTERVAL 運算子
- MySQL 連線
- MySQL - 使用連線
- MySQL - INNER JOIN
- MySQL - LEFT JOIN
- MySQL - RIGHT JOIN
- MySQL - CROSS JOIN
- MySQL - FULL JOIN
- MySQL - 自連線
- MySQL - DELETE JOIN
- MySQL - UPDATE JOIN
- MySQL - UNION vs JOIN
- MySQL 觸發器
- MySQL - 觸發器
- MySQL - 建立觸發器
- MySQL - 顯示觸發器
- MySQL - 刪除觸發器
- MySQL - BEFORE INSERT 觸發器
- MySQL - AFTER INSERT 觸發器
- MySQL - BEFORE UPDATE 觸發器
- MySQL - AFTER UPDATE 觸發器
- MySQL - BEFORE DELETE 觸發器
- MySQL - AFTER DELETE 觸發器
- MySQL 資料型別
- MySQL - 資料型別
- MySQL - VARCHAR
- MySQL - BOOLEAN
- MySQL - ENUM
- MySQL - DECIMAL
- MySQL - INT
- MySQL - FLOAT
- MySQL - BIT
- MySQL - TINYINT
- MySQL - BLOB
- MySQL - SET
- MySQL 正則表示式
- MySQL - 正則表示式
- MySQL - RLIKE 運算子
- MySQL - NOT LIKE 運算子
- MySQL - NOT REGEXP 運算子
- MySQL - regexp_instr() 函式
- MySQL - regexp_like() 函式
- MySQL - regexp_replace() 函式
- MySQL - regexp_substr() 函式
- MySQL 函式 & 運算子
- MySQL - 日期和時間函式
- MySQL - 算術運算子
- MySQL - 數值函式
- MySQL - 字串函式
- MySQL - 聚合函式
- MySQL 其他概念
- MySQL - NULL 值
- MySQL - 事務
- MySQL - 使用序列
- MySQL - 處理重複項
- MySQL - SQL 注入
- MySQL - 子查詢
- MySQL - 註釋
- MySQL - 檢查約束
- MySQL - 儲存引擎
- MySQL - 將表匯出到 CSV 檔案
- MySQL - 將 CSV 檔案匯入資料庫
- MySQL - UUID
- MySQL - 公共表表達式
- MySQL - ON DELETE CASCADE
- MySQL - Upsert
- MySQL - 水平分割槽
- MySQL - 垂直分割槽
- MySQL - 遊標
- MySQL - 儲存函式
- MySQL - SIGNAL
- MySQL - RESIGNAL
- MySQL - 字元集
- MySQL - 校對規則
- MySQL - 萬用字元
- MySQL - 別名
- MySQL - ROLLUP
- MySQL - 今日日期
- MySQL - 字面量
- MySQL - 儲存過程
- MySQL - EXPLAIN
- MySQL - JSON
- MySQL - 標準差
- MySQL - 查詢重複記錄
- MySQL - 刪除重複記錄
- MySQL - 選擇隨機記錄
- MySQL - SHOW PROCESSLIST
- MySQL - 更改列型別
- MySQL - 重置自動遞增
- MySQL - Coalesce() 函式
- MySQL 有用資源
- MySQL - 有用函式
- MySQL - 語句參考
- MySQL - 快速指南
- MySQL - 有用資源
- MySQL - 討論
MySQL - 刪除重複記錄
MySQL 刪除重複記錄
資料庫(包括 MySQL)中的重複記錄非常常見。MySQL 資料庫以包含行和列的表的形式儲存資料。現在,當資料庫表中的兩行或多行具有相同的值時,該記錄被認為是重複的。
這種冗餘可能由於各種原因而發生:
- 該行可能被插入兩次。
- 從外部來源匯入原始資料時。
- 資料庫應用程式中可能存在錯誤。
無論原因是什麼,刪除這種冗餘對於提高資料準確性、減少錯誤或提高資料庫效能效率都非常重要。
查詢重複值
在刪除重複記錄之前,我們必須找出它們是否存在於表中。可以使用以下方法:
GROUP BY 子句
COUNT() 方法
示例
讓我們首先建立一個名為“CUSTOMERS”的表,其中包含重複值:
CREATE TABLE CUSTOMERS( ID int, NAME varchar(100) );
使用以下 INSERT 查詢,將一些記錄插入到“CUSTOMERS”表中。在這裡,我們添加了“John”作為重複記錄 3 次:
INSERT INTO CUSTOMERS VALUES (1,'John'), (2,'Johnson'), (3,'John'), (4,'John');
獲得的 CUSTOMERS 表如下所示:
| id | name |
|---|---|
| 1 | John |
| 2 | Johnson |
| 3 | John |
| 4 | John |
現在,我們使用 COUNT() 方法和 GROUP BY 子句檢索表中重複的記錄,如下面的查詢所示:
SELECT NAME, COUNT(NAME) FROM CUSTOMERS GROUP BY NAME HAVING COUNT(NAME) > 1;
輸出
獲得的輸出如下所示:
| NAME | COUNT(NAME) |
|---|---|
| John | 3 |
刪除重複記錄
要從資料庫表中刪除重複記錄,我們可以使用 DELETE 命令。但是,此 DELETE 命令可以使用兩種方法從表中刪除重複項:
使用 DELETE... JOIN
使用 ROW_NUMBER() 函式
使用 DELETE... JOIN
為了使用 DELETE... JOIN 命令從表中刪除重複記錄,我們對其自身執行內部連線。這適用於並非完全相同的案例。
例如,假設客戶記錄中存在客戶詳細資訊的重複,但序列號不斷遞增。在這裡,即使 ID 不相同,記錄也是重複的。
示例
在下面的查詢中,我們使用前面建立的 CUSTOMERS 表來使用 DELETE... JOIN 命令刪除重複記錄:
DELETE t1 FROM CUSTOMERS t1 INNER JOIN CUSTOMERS t2 WHERE t1.id < t2.id AND t1.name = t2.name;
輸出
獲得的輸出如下所示:
Query OK, 2 rows affected (0.01 sec)
驗證
我們可以使用以下 SELECT 語句驗證是否已刪除重複記錄:
SELECT * FROM CUSTOMERS;
我們可以從獲得的表中看到,該查詢刪除了重複項,並在表中保留了不同的記錄:
| ID | NAME |
|---|---|
| 2 | Johnson |
| 4 | John |
使用 ROW_NUMBER() 函式
MySQL 中的 ROW_NUMBER() 函式用於為從查詢獲得的結果集中的每一行分配一個從 1 開始的順序號。
使用此函式,MySQL 允許您檢測重複行,可以使用 DELETE 語句將其刪除。
示例
在這裡,我們將 ROW_NUMBER() 函式應用於在“NAME”列中具有重複值的 CUSTOMERS 表。我們將使用以下查詢基於“NAME”列在分割槽內分配行號:
SELECT id, ROW_NUMBER() OVER (PARTITION BY name ORDER BY name) AS row_num FROM CUSTOMERS;
獲得的輸出如下所示:
| id | row_num |
|---|---|
| 1 | 1 |
| 3 | 2 |
| 4 | 3 |
| 2 | 1 |
現在,使用以下語句刪除重複行(行號大於 1 的行):
DELETE FROM CUSTOMERS WHERE id IN( SELECT id FROM (SELECT id, ROW_NUMBER() OVER (PARTITION BY name ORDER BY name) AS row_num FROM CUSTOMERS) AS temp_table WHERE row_num>1 );
我們得到如下所示的輸出:
Query OK, 2 rows affected (0.00 sec)
要驗證是否已刪除重複記錄,請使用以下 SELECT 查詢:
SELECT * FROM CUSTOMERS;
產生的結果如下所示:
| ID | NAME |
|---|---|
| 1 | John |
| 2 | Johnson |
使用客戶端程式刪除重複記錄
我們還可以使用客戶端程式刪除重複記錄。
語法
要透過PHP程式刪除重複記錄,需要使用**mysqli**函式**query()**執行包含“DELETE”命令的內連線,如下所示:
$sql = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name"; $mysqli->query($sql);
要透過JavaScript程式刪除重複記錄,需要使用**mysql2**庫的**query()**函式執行包含“DELETE”命令的內連線,如下所示:
sql = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name"; con.query(sql)
要透過Java程式刪除重複記錄,需要使用**JDBC**函式**execute()**執行包含“DELETE”命令的內連線,如下所示:
String sql = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name"; statement.execute(sql);
要透過Python程式刪除重複記錄,需要使用**MySQL Connector/Python**的**execute()**函式執行包含“DELETE”命令的內連線,如下所示:
delete_query = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name" cursorObj.execute(delete_query)
示例
以下是程式示例:
$dbhost = 'localhost';
$dbuser = 'root';
$dbpass = 'password';
$db = 'TUTORIALS';
$mysqli = new mysqli($dbhost, $dbuser, $dbpass, $db);
if ($mysqli->connect_errno) {
printf("Connect failed: %s
", $mysqli->connect_error);
exit();
}
//printf('Connected successfully.
');
//let's create a table
$sql = "CREATE TABLE DuplicateDeleteDemo(ID int,NAME varchar(100))";
if($mysqli->query($sql)){
printf("DuplicateDeleteDemo table created successfully...!\n");
}
//now lets insert some duplicate records;
$sql = "INSERT INTO DuplicateDeleteDemo VALUES(1,'John')";
if($mysqli->query($sql)){
printf("First record inserted successfully...!\n");
}
$sql = "INSERT INTO DuplicateDeleteDemo VALUES(2,'Johnson')";
if($mysqli->query($sql)){
printf("Second record inserted successfully...!\n");
}
$sql = "INSERT INTO DuplicateDeleteDemo VALUES(3,'John')";
if($mysqli->query($sql)){
printf("Third records inserted successfully...!\n");
}
$sql = "INSERT INTO DuplicateDeleteDemo VALUES(4,'John')";
if($mysqli->query($sql)){
printf("Fourth record inserted successfully...!\n");
}
//display the table records
$sql = "SELECT * FROM DuplicateDeleteDemo";
if($result = $mysqli->query($sql)){
printf("Table records(before deleting): \n");
while($row = mysqli_fetch_array($result)){
printf("ID: %d, NAME %s",
$row['ID'],
$row['NAME']);
printf("\n");
}
}
//now lets count duplicate records
$sql = "SELECT NAME, COUNT(NAME) FROM DuplicateDeleteDemo GROUP BY NAME HAVING COUNT(NAME) > 1";
if($result = $mysqli->query($sql)){
printf("Duplicate records: \n");
while($row = mysqli_fetch_array($result)){
print_r($row);
}
}
//lets delete dupliacte records
$sql = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name";
if($mysqli->query($sql)){
printf("Duplicate records deleted successfully...!\n");
}
$sql = "SELECT ID, NAME FROM DuplicateDeleteDemo";
if($result = $mysqli->query($sql)){
printf("Table records after deleting: \n");
while($row = mysqli_fetch_row($result)){
print_r($row);
}
}
if($mysqli->error){
printf("Error message: ", $mysqli->error);
}
$mysqli->close();
輸出
獲得的輸出結果如下所示:
DuplicateDeleteDemo table created successfully...!
First record inserted successfully...!
Second record inserted successfully...!
Third records inserted successfully...!
Fourth record inserted successfully...!
Table records(before deleting):
ID: 1, NAME John
ID: 2, NAME Johnson
ID: 3, NAME John
ID: 4, NAME John
Duplicate records:
Array
(
[0] => John
[NAME] => John
[1] => 3
[COUNT(NAME)] => 3
)
Duplicate records deleted successfully...!
Table records after deleting:
Array
(
[0] => 2
[1] => Johnson
)
Array
(
[0] => 4
[1] => John
)
var mysql = require('mysql2');
var con = mysql.createConnection({
host: "localhost",
user: "root",
password: "Nr5a0204@123"
});
// Connecting to MySQL
con.connect(function (err) {
if (err) throw err;
console.log("Connected!");
console.log("--------------------------");
// Create a new database
sql = "Create Database TUTORIALS";
con.query(sql);
sql = "USE TUTORIALS";
con.query(sql);
sql = "CREATE TABLE DuplicateDeleteDemo(ID int,NAME varchar(100));"
con.query(sql);
sql = "INSERT INTO DuplicateDeleteDemo VALUES(1,'John'),(2,'Johnson'),(3,'John'),(4,'John');"
con.query(sql);
sql = "SELECT * FROM DuplicateDeleteDemo;"
con.query(sql, function(err, result){
if (err) throw err
console.log("**Records of DuplicateDeleteDemo Table:**");
console.log(result);
console.log("--------------------------");
});
//Fetching records that are duplicated in the table
sql = "SELECT NAME, COUNT(NAME) FROM DuplicateDeleteDemo GROUP BY NAME HAVING COUNT(NAME) > 1;"
con.query(sql, function(err, result){
if (err) throw err
console.log("**Records that are duplicated in the table:**");
console.log(result);
console.log("--------------------------");
});
sql = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name";
con.query(sql);
sql = "SELECT * FROM DuplicateDeleteDemo;"
con.query(sql, function(err, result){
if (err) throw err
console.log("**Records after deleting Duplicates:**");
console.log(result);
});
});
輸出
獲得的輸出結果如下所示:
Connected!
--------------------------
**Records of DuplicateDeleteDemo Table:**
[
{ ID: 1, NAME: 'John' },
{ ID: 2, NAME: 'Johnson' },
{ ID: 3, NAME: 'John' },
{ ID: 4, NAME: 'John' }
]
--------------------------
**Records that are duplicated in the table:**
[ { NAME: 'John', 'COUNT(NAME)': 3 } ]
--------------------------
**Records after deleting Duplicates:**
[ { ID: 2, NAME: 'Johnson' }, { ID: 4, NAME: 'John' } ]
import java.sql.Connection;
import java.sql.DriverManager;
import java.sql.ResultSet;
import java.sql.Statement;
public class DeleteDuplicates {
public static void main(String[] args) {
String url = "jdbc:mysql://:3306/TUTORIALS";
String user = "root";
String password = "password";
ResultSet rs;
try {
Class.forName("com.mysql.cj.jdbc.Driver");
Connection con = DriverManager.getConnection(url, user, password);
Statement st = con.createStatement();
//System.out.println("Database connected successfully...!");
String sql = "CREATE TABLE DuplicateDeleteDemo(ID int,NAME varchar(100))";
st.execute(sql);
System.out.println("Table DuplicateDeleteDemo created successfully...!");
//let's insert some records into it...
String sql1 = "INSERT INTO DuplicateDeleteDemo VALUES (1,'John'), (2,'Johnson'), (3,'John'), (4,'John')";
st.execute(sql1);
System.out.println("Records inserted successfully....!");
//print table records
String sql2 = "SELECT * FROM DuplicateDeleteDemo";
rs = st.executeQuery(sql2);
System.out.println("Table records(before deleting the duplicate rcords): ");
while(rs.next()) {
String id = rs.getString("id");
String name = rs.getString("name");
System.out.println("Id: " + id + ", Name: " + name);
}
//let delete duplicate records using delete join
String sql3 = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name";
st.execute(sql3);
System.out.println("Duplicate records deleted successfully....!");
String sql4 = "SELECT * FROM DuplicateDeleteDemo";
rs = st.executeQuery(sql4);
System.out.println("Table records(after deleting the duplicate rcords): ");
while(rs.next()) {
String id = rs.getString("id");
String name = rs.getString("name");
System.out.println("Id: " + id + ", Name: " + name);
}
}catch(Exception e) {
e.printStackTrace();
}
}
}
輸出
獲得的輸出結果如下所示:
Table DuplicateDeleteDemo created successfully...! Records inserted successfully....! Table records(before deleting the duplicate rcords): Id: 1, Name: John Id: 2, Name: Johnson Id: 3, Name: John Id: 4, Name: John Duplicate records deleted successfully....! Table records(after deleting the duplicate rcords): Id: 2, Name: Johnson Id: 4, Name: John
import mysql.connector
# Establishing the connection
connection = mysql.connector.connect(
host='localhost',
user='root',
password='password',
database='tut'
)
# Creating a cursor object
cursorObj = connection.cursor()
# Creating the table 'DuplicateDeleteDemo'
create_table_query = '''CREATE TABLE DuplicateDeleteDemo(ID int, NAME varchar(100))'''
cursorObj.execute(create_table_query)
print("Table 'DuplicateDeleteDemo' is created successfully!")
# Inserting records into 'DuplicateDeleteDemo' table
sql = "INSERT INTO DuplicateDeleteDemo (ID, NAME) VALUES (%s, %s);"
values = [(1, 'John'), (2, 'Johnson'), (3, 'John'), (4, 'John')]
cursorObj.executemany(sql, values)
print("Values inserted successfully")
# Display table
display_table = "SELECT * FROM DuplicateDeleteDemo;"
cursorObj.execute(display_table)
# Printing the table 'DuplicateDeleteDemo'
results = cursorObj.fetchall()
print("\nDuplicateDeleteDemo Table:")
for result in results:
print(result)
# Retrieve the duplicate records
duplicate_records_query = """
SELECT NAME,
COUNT(NAME)
FROM DuplicateDeleteDemo
GROUP BY NAME
HAVING COUNT(NAME) > 1;
"""
cursorObj.execute(duplicate_records_query)
dup_rec = cursorObj.fetchall()
print("\nDuplicate records:")
for record in dup_rec:
print(record)
# Delete duplicate records
delete_query = "DELETE t1 FROM DuplicateDeleteDemo t1 INNER JOIN DuplicateDeleteDemo t2 WHERE t1.id < t2.id AND t1.name = t2.name"
cursorObj.execute(delete_query)
print("Duplicate records deleted successfully")
# Verification
display_table_after_delete = "SELECT * FROM DuplicateDeleteDemo;"
cursorObj.execute(display_table_after_delete)
results_after_delete = cursorObj.fetchall()
print("\nDuplicateDeleteDemo Table (After Delete):")
for result in results_after_delete:
print(result)
# Closing the cursor and connection
cursorObj.close()
connection.close()
輸出
獲得的輸出結果如下所示:
Table 'DuplicateDeleteDemo' is created successfully!
Values inserted successfully
DuplicateDeleteDemo Table:
(1, 'John')
(2, 'Johnson')
(3, 'John')
(4, 'John')
Duplicate records:
('John', 3)
Duplicate records deleted successfully
DuplicateDeleteDemo Table (After Delete):
(2, 'Johnson')
(4, 'John')