Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


How to remove duplicates in transformer stage? in parallel
mode

Answers were Sorted based on User's Feedback



How to remove duplicates in transformer stage? in parallel mode..

Answer / kiran

partition the data by key and sort the data and click on
unique value. This will automatically delete duplicate
data.

Is This Answer Correct ?    20 Yes 3 No

How to remove duplicates in transformer stage? in parallel mode..

Answer / praveen sarva

STEP 1) TRANSFORMER STAGE PROPERTIES--> ADVANCED -->
EXECUTION MODE ---> PARLLEL

STEP 2) TRANSFORMER STAGE PROPERTIES --> INPUT -->
PARTITIONING--> PARTITION TYPE --> HASH ---> ENABLE SORT ---
> ENABLE UNIQUE

Simple u will get non duplicate records....

Is This Answer Correct ?    11 Yes 0 No

How to remove duplicates in transformer stage? in parallel mode..

Answer / kiran

i am not sure who marked my answer as wrong. Can you please
be responsible enough to state why its wrong?

Is This Answer Correct ?    1 Yes 0 No

How to remove duplicates in transformer stage? in parallel mode..

Answer / satya

run u r job in sequencial mode and sort the source data
then play with stage variable's in Transformer.

because in parallel mode data is partioned .

Is This Answer Correct ?    1 Yes 1 No

How to remove duplicates in transformer stage? in parallel mode..

Answer / prasad

Take 2 Stage variables in transformer stage

sV1 =Column_Name
sV2 =if Column_Name=sV1 Then 0 Else 1

put it constraint sV2=1 (only will get unique records)

if u want duplicates sV2=0

Is This Answer Correct ?    0 Yes 1 No

How to remove duplicates in transformer stage? in parallel mode..

Answer / santhosh

go to transformer stage properties->input->define any kind of partition over there and enable perform sort check box....

n also define the particular column need to be sorted..

it gives the sorted column out view...

Is This Answer Correct ?    1 Yes 6 No

Post New Answer

More Data Stage Interview Questions

Difference between data warehousing and olap?

0 Answers  


How and where you used hash file?

0 Answers  


What are the types of containers and how to create them?

0 Answers  


in sequtial file 2 columns avaliable,iwant only one column load the target how do it.

10 Answers   Wipro,


How to exclude first and last lines while reading data into a sequential file(having some 1000 records).I guess probably by using unix filter option but not sure which to use

6 Answers   IBM,


I am having two tables called MASTER and DETAIL. I want to insert records to both tables. But one condition is that whenever the insert for MASTER table is success then only the records will inserted into the DETAIL table, otherwise abort the job. How can u design this job?

1 Answers   TCS,


i have the source from Uk,north america how can i pass the data two tables based on the locations

2 Answers  


Why fact table is in normal form?

0 Answers  


I have a few records all are same structures data, I want to store data in multiple targets how?

4 Answers   Polaris,


what is the differeces between hash and modulus partition methods

4 Answers   TCS,


Source contains the metadata source ----- hyderabad,chennai,bangalore by using nested loop sequence select source as hyderabad in target just run target as hyderabad only not others please give me answer thanks in advance(give reply how to wrote logic using nested loop sequence)

1 Answers   IBM,


A flat file contains 200 records. I want to load first 50 records at first time running the job, second 50 records at second time running and so on, how u can develop this job?

1 Answers   TCS,


Categories