How to delete duplicate records in a flat file source?

Answers were Sorted based on User's Feedback



How to delete duplicate records in a flat file source?..

Answer / bsgsr

conncet the flat file to sorter transformation and select
distinct option.

Is This Answer Correct ?    17 Yes 6 No

How to delete duplicate records in a flat file source?..

Answer / sushanth1234

Connect the flat file to sorter tx and make ascending and connect from sorter to aggregator
tx make group by port and one o/p port count(port) after use router tx 2 group 1)count<0 and 2)count=0 transfer to target for both group Unique and dublicate.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Informatica Interview Questions

How to write a procedure for a date which is in three different formats,and you want to load into datawarehouse in any single date formate

2 Answers   AHM,


There are 2 files, Master and User. We need to compare 2 files and prepare a output log file which lists out missing Rolename for each UserName between Master and User file. Please find the sample data- MASTER.csv ---------- Org|Tmp_UsrID|ShortMark|Rolename ---|---------|----------|------------ AUS|0_ABC_PW |ABC PW |ABC Admin PW AUS|0_ABC_PW |ABC PW |MT Deny all GBR|0_EDT_SEC|CR Edit |Editor GBR|0_EDT_SEC|CR Edit |SEC MT103 GBR|0_EDT_SEC|CR Edit |AB User USER.csv -------- Org|UserName|ShortMark|Rolename ---|--------|---------|------------ AUS|charls |ABC PW |ABC Admin PW AUS|amudha |ABC PW |MT Deny all GBR|sandya |CR Edit |Editor GBR|sandya |CR Edit |SEC MT103 GBR|sandya |CR Edit |AB User GBR|sarkar |CR Edit |Editor GBR|sarkar |CR Edit |SEC MT103 Required Output file: --------------------- Org|Tmp_UsrID|UserName|Rolename |Code ---|---------|--------|------------|-------- AUS|0_ABC_PW |charls |ABC Admin PW|MATCH AUS|0_ABC_PW |charls |MT Deny all |MISSING AUS|0_ABC_PW |amudha |ABC Admin PW|MISSING AUS|0_ABC_PW |amudha |MT Deny all |MATCH GBR|0_EDT_SEC|sandya |Editor |MATCH GBR|0_EDT_SEC|sandya |SEC MT103 |MATCH GBR|0_EDT_SEC|sandya |AB User |MATCH GBR|0_EDT_SEC|sarkar |Editor |MATCH GBR|0_EDT_SEC|sarkar |SEC MT103 |MATCH GBR|0_EDT_SEC|sarkar |AB User |MISSING Both the files are mapped through Organization, Shor_mark. So, based on each Organization, Short_Mark, for each UserName from User.csv, we need to find the Matching and Missing Rolename. I am able to bring Matching records in the output. But really I don't find any concept or logic to achieve "MISSING" records which are present in Master and not in User.csv for each UserName. Please help out guys. Let me know if you need any more information. Note:- In User.csv file, there are n number of Organization, under which n number Shortmark comes which has n number of UserName.

1 Answers  


Can we schedule a workflow(not a session) to run after completition of another workflow? Can we make a workflow depend on completition of another one?

2 Answers   GE,


WHAT IS EDM?WHAT IS THE USE IN INFORMATICA? chandumba2005@gmai.com

0 Answers   TCS,


why do we go for update strategy tr in SCD rather using the session properties?

3 Answers   IBM,






I have done MBA in 2008. i got job as business analyst in 2008 january through consultany. but after 3 months they are giving training Informatica developer. now iam continuing this job. my question is when iam going to interview HR people ask me many times like this " YOU ARE MBA GRADUATE. HOW YOU ARE SELECT THIS POSTION. IAM EXPLAINING WHAT I HAVE MENTION ABOVE". PLEASE TELL HOW IAM TELLING THIS QUESTION ANSWER.

0 Answers  


What are the join types in joiner transformation?

1 Answers   Informatica,


Explain joiner transformation in informatica

0 Answers   Informatica,


Does an informatica transformation support only aggregate expressions?

0 Answers  


What is power center repository?

1 Answers  


How can you define user defied event?

0 Answers  


Mine is Insurance Domain, So interviewer asked about terms like underwriting, disbursement amt, Reinsurance

0 Answers   TCS, Wipro,


Categories