Delete duplicated rows

Question

Delete duplicated rows

first100 81

Hy everyone,

i have an issue with a large query, when i execute it i have more line with same ItemBoxId repeating, how i can remove duplicated rows ?
my query return something like:

Id SecondIdentifier Year ItemBoxId BoxId
008 1029 2020 1C192F5D NULL
009 1129 2020 1C192F5D NULL

the problem is the ItemBoxId its the same and i would to have only first row.

Thanks for the help.

EchoLiu-MSFT 14,626 Reputation points

2020-09-18T04:17:12.35+00:00

Do you have any updates？
Please remember to accept the answers if they helped. Your action would be helpful to other users who encounter the same issue and read this thread.

Echo
EchoLiu-MSFT 14,626 Reputation points

2020-09-23T02:49:56.893+00:00

Do you have any updates？
Please remember to accept the answers if they helped. Your action would be helpful to other users who encounter the same issue and read this thread.

Echo

4 answers

Your answer

EchoLiu-MSFT 14,626 Reputation points

2020-09-18T04:17:12.35+00:00

Do you have any updates？
Please remember to accept the answers if they helped. Your action would be helpful to other users who encounter the same issue and read this thread.

Echo
EchoLiu-MSFT 14,626 Reputation points

2020-09-23T02:49:56.893+00:00

Do you have any updates？
Please remember to accept the answers if they helped. Your action would be helpful to other users who encounter the same issue and read this thread.

Echo

Answer 1

Roy wu 1

e.g.

DELETE a from tablename AS a WHERE EXISTS(SELECT 1 FROM tablename WHERE [ItemBoxId]=a.[ItemBoxId] AND [Id]<a.[Id] )

0 comments

Answer 2

Hi @MassimoPallara,

In addition to row_number(), rank() can also be used.
Please refer to:

    declare @test table  (  
         Id  varchar(3),SecondIdentifier  int,Year int, ItemBoxId varchar(30),BoxId int)  
    insert into @test values('008',1029 ,2020 ,'1C192F5D', NULL),  
         ('009', 1129, 2020 ,'1C192F5D', NULL)  
          
    select * from @test  
      
    ;with cte as (  
     select *, rank() over(partition by  ItemBoxId order by SecondIdentifier ) rn  
     from @test)  
       
    delete  from cte  
    where rn>1  
      
    select * from @test

If you have any question, please feel free to let me know.
If the response is helpful, please click "Accept Answer" and upvote it.

Best Regards
Echo

If the answer is helpful, please click "Accept Answer" and upvote it.
Note: Please follow the steps in our documentation to enable e-mail notifications if you want to receive the related email notification for this thread.

Answer 3

Jingyang Li 5,901 Volunteer Moderator

create table test (
Id  varchar(3),SecondIdentifier  int,Year int, ItemBoxId varchar(30),BoxId int)
insert into test values('008',1029 ,2020 ,'1C192F5D', NULL),
('009', 1129, 2020 ,'1C192F5D', NULL)

Select * from test

;with mycte as (
select *, row_number() over(partition by  ItemBoxId order by SecondIdentifier ) rn
from test)

delete from mycte where rn>1

Select * from test

drop table test

0 comments

Answer 4

Guoxiong 8,221

You can use ROW_NUMBER() OVER(PARTITION BY ItemBoxId ORDER BY Id, SecondIdentifier) to remove the duplicates of ItemBoxId:

;WITH CTE AS (
SELECT Id, SecondIdentifier, Year, ItemBoxId, BoxId, ROW_NUMBER() OVER(PARTITION BY ItemBoxId ORDER BY Id, SecondIdentifier) AS RN
FROM YourOutputSet
)
-- List rows without the duplicates
--SELECT Id, SecondIdentifier, Year, ItemBoxId, BoxId
--FROM CTE
--WHERE RN = 1;
-- Remove the duplicates:
DELETE FROM CTE WHERE RN > 1;

0 comments

Share via

Delete duplicated rows

4 answers

Your answer