Python Pandas - 指出重複索引值,最後一齣現值除外


若要指明重複的索引值(最後出現的值除外),請使用 index.duplicated()。將 keep 引數與值 last 一起使用。

首先,匯入必需的庫 −

import pandas as pd

使用一些重複值建立索引 −

index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])

顯示索引 −

print("Pandas Index with duplicates...\n",index)

指明重複的索引值(最後一齣現的值除外)。將“keep”引數設定為“last” −

print("\nIndicating duplicate values except the last occurrence...\n", index.duplicated(keep='last'))

示例

以下是程式碼 −

import pandas as pd

# Creating the index with some duplicates
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])

# Display the index
print("Pandas Index with duplicates...\n",index)

# Return the dtype of the data
print("\nThe dtype object...\n",index.dtype)

# get the dimensions of the data
print("\nGet the dimensions...\n",index.ndim)

# Indicate duplicate index values as True, except the last occurrence
# Set the "keep" parameter as "last"
print("\nIndicating duplicate values except the last occurrence...\n", index.duplicated(keep='last'))

輸出

這將生成以下程式碼 −

Pandas Index with duplicates...
Index(['Car', 'Bike', 'Airplane', 'Ship', 'Airplane'], dtype='object')

The dtype object...
object

Get the dimensions...
1

Indicating duplicate values except the last occurrence...
[False False True False False]

更新於: 13-10-2021

88 次瀏覽

開啟您的 職業

完成課程以獲得認證

開始
廣告
© . All rights reserved.