How to delete duplicate records if we have huge volume of
records in a table ?
(rowid is not the correct approach)
Answers were Sorted based on User's Feedback
Answer / keyrun
Hi, There are many ways to delete the duplicates!
By Using the following Transformations you can delete the Duplicates:
Source Qualifier: Use SQ and check 'Distinct' property (If the Source is Relational)
Sorter: Use sorter and in Properties tab Check 'Distinct'(If the Source is Flat File)
Aggregator: Use Aggregator & Group by on the key port.
Is This Answer Correct ? | 7 Yes | 0 No |
Answer / chandu
select * from from <table_name> where rowid not in(select
min(rowid) from <table_name> group by column(primary or
unique key column))
Is This Answer Correct ? | 5 Yes | 0 No |
First we can count the no of records using aggregator
Transformation with group by (port) for ex, in emp table
take empno. Second we can take the Filter condition like
count=1. Other records are rejected not loaded into the target.
Hope it will help little bit...
Is This Answer Correct ? | 1 Yes | 1 No |
Answer / cmanojkumar
Hi,
Thanks for your answer.
I am sorry this question i need to post in oracle forum not
informatica.
Any way your answer is usefull for me.
Could you please tell me how can we delete with our rowid in
oracle ?
Is This Answer Correct ? | 0 Yes | 0 No |
Answer / ravikumar2614
DELETE FROM EMP E1 WHERE ROWID<
(SELECT MAX(ROWID) FROM EMP E2 WHERE E1.ROWID=E2.ROWID)
IF I AM WRONG PLEASE CORRECT ME ON ravi.info2614@gmail.com
Is This Answer Correct ? | 1 Yes | 1 No |
Answer / dilip ingole
Delete from emp where rowid not in(select min(rowid) from emp group by eid,ename)
in group by column you need mention all column in table
Is This Answer Correct ? | 0 Yes | 0 No |
there is a mapping with expression and mapping we create some condition for insert and update and followed by update startegy ,can we update and insert on a single target based on condition?
What is mapplet and a reusable transformation?
How can i set break points in debugging mode ? can explain steps please?
What is the main purpose of Unconnected lookup other than updating slowly changing dimensions? or In which case u use Unconnected lookup?
performance wise which one is better in joiner and lookup transformation?why?explain clearly?
how can u approach u r client
i have source file data empno,ename,sal 100,ram,1000 200,tyu,2000 300,gh,3000 out put as e empno,ename,sal 100,ram,null 200,tyu,1000 300,gh,2000 how to load it?
what is the diff b/w cached and unchaed look ups
what is meant by data driven.. in which scenario we use that..?
i have source as Column1 101 a 101 b 101 c so i want to load target as column1 ---> 101 abc
in my source table i want to delete first and last records and load in between records into target? how can it possible?
3 Answers HCL, Thomson Reuters,
what is parameter file?