Difference in the implementation of lookup and join
stages,in joining two tables?
Answers were Sorted based on User's Feedback
Answer / kiran
Hai This is Kiran...
If u want to join more than one table ,u can use join,lookup
and merge also.
Join: it is used join more than one table based one key
column .it can perform 4 join as inner join,left outer
join,right join and full outer join.
Lokk-UP:it is used join more than one table,but not necesary
to join based ont he key column but it need data-type.it
give reference link and single out put link and give reject
link also.it can perform inner join and left outer jojn.
main difference: if the huge amount of the data contain in
reference table refer to join else look-up.
| Is This Answer Correct ? | 14 Yes | 2 No |
Answer / krishna
Generally we are using lookup for comparision purpose,
based on the reference table size we r using join or lookup.
If the reference table size is less than the main table
then use lookup stage other wise use join stage.
In lookup we have two types:
Normal lookup - reference lookup is less no of rows
compared to main table
sparse lookup - reference lookup is more no of rows
compared to main table.
But the peformance wise use join if the reference table has
more data.
Regards,
Krishna
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / zulfi123786
Hi This is Zulfi
Basically Join is used when you have large amount of data
about in millions and it performs inner join,left
outer,right outer and full outer joins
The join stage requires the incomming data to be hash
partitioned and sorted on the joining keys
The look up is used when the reference records are fewer in
number about less than one lakh and it doesnot require the
incomming source data to be sorted, instead the refrence
link should be in Entire partition mode.
In look up there are two types
Normal and Sparse
Sparse is available only when the reference is a database.
usually Normal has to be used unless when the refrence to
source rows ratio is 100:1
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / sadanand
HI All,
I would like to add one more point to JOIN.
To achieve full outer join the number of inputs need would
be only two.
The Primay table need to be sorted.
Memory used is very less compared to Lookup.
Regards,
Sadanand.
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / indian
Hi Zulfi..you are answer is more explained one and clear
we will go for Merge if we want the rejected data for every
update link
| Is This Answer Correct ? | 0 Yes | 0 No |
What is a merge?
HOW WILL YOU IMPLEMENT SURROGATE KEY IN SCD BY USING SURR_KEY GENERATOR,THE VALUE OF S_KEY SHOULD NOT REPEAT EVEN IF THE JOB IS COMPILED REPEATEDELY?
Where do the datastage jobs get stored?
DB2 connector> transformer > sequential file Data will be exported into a csv format in a sequential file. This file will be send in a email using a sequence job. Problem here is, how to avoid sending a blank csv file? When I ran the job there are chances that it might return zero records but in the sequence job csv file is going blank. how can I avoid this? thanks
convert yyyy mm dd to dd mm yyyy?
How a source file is populated?
how can we extract data with out having any common column
I HAVE EMP TABLE, 4 COLS R THERE COL1,COL2,COL3,COL4 ID-- 101,102,103,104 SAL-- 1000,4000,2000,5000 DATE-- COLUMN. I WANT TO DISPLAY THE DATA PREVIOUS MONTH HIGEST SAL ?
my soure table is emp having columns sal,deptno in the deptno 10,20,30deptno row are there expected out put is min(sal) of 10th deptno,max(sal) of 20th deptno,mean(sal) of 30th deptno using aggregation stage
Have you used Unstructured data?
What are some prerequisites for datastage?
how many dimentions and fact tables used in your project and what are names of it?