Split with pandas dataframe column duplicate values into two dataframes, one with duplicate, one without duplicate

Question

eg.

INPUT: one dataframe

   Name     id     Price
   Apple     01       13.86
   Cherry    02       13.24
   Banana    02       1.99
   Peach     03       14.76
   Orange    04       2.48

OUTPUT: two dataframes

one with with duplicate dataframe[id]:

   Name     id     Price
   Cherry    02       13.24
   Banana    02       1.99

other without duplicate dataframe[id]:

   Name     id     Price
   Apple     01       13.86
   Peach     03       14.76
   Orange    04       2.48

Many thanks

erwachen · Accepted Answer · 2022-04-05 10:15:33Z

1

INPUT: df; OUTPUT: df_duplicated, df_unique

df_duplicated = df[df['id'].duplicated(keep=False)]
df_unique = pd.concat([df, df_duplicated]).drop_duplicates(keep=False)

print(df_duplicated)
print(df_unique)

answered Apr 5, 2022 at 10:15

erwachen

215 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

SM1312 · Accepted Answer · 2022-04-05 10:16:34Z

1

noDuplicate = data.drop_duplicates('id', keep=False)
print("No Duplicates:", noDuplicate)

duplicate = data[data['id'].duplicated(keep=False)]
print("Duplicates:", duplicate)

answered Apr 5, 2022 at 10:16

SM1312

5984 silver badges18 bronze badges

Comments

user7375116 · Accepted Answer · 2022-04-05 10:08:29Z

0

You can count the occurrence of each unique identifier and then merge the result on your dataframe to get the unique and duplicate values.

As an example:

df = pd.DataFrame(data={'Id': [1, 2, 2, 3, 4]})
agg_df = df.groupby(by='Id').agg(count=('Id', 'count'))
agg_df.reset_index(inplace=True)
filtered_df = agg_df.loc[agg_df['count'] == 1].merge(df, on=['Id'])
unique_df = agg_df.loc[agg_df['count'] > 1].merge(df, on=['Id'])

answered Apr 5, 2022 at 10:08

user7375116

2131 silver badge8 bronze badges

Collectives™ on Stack Overflow

Split with pandas dataframe column duplicate values into two dataframes, one with duplicate, one without duplicate

3 Answers 3

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related