What is the difference between lookup and sparse lookup?
Answer Posted / subhash
Normal lookup will provide data for an in-memory look up
whereas Sparse lookup will access the database directly.
Normal might provide poor performance when the reference
data is huge as it has to load large data into memory.
In such scenarios a join stage may work better(It will put
the data onto an internal dataset file for each link,
and then do the join based on the data provided). If your
stream/source data is small then a sparse lookup is
suggested
as a row is retrieved from the database for each input row
instead of the full reference data being brought into
memory.
Sparse lookup sends individual SQL statement for every
incoming row (If stream/source data is huge you can imagine
the number
of times it has to hit DB and hence the down side on
performance). It can be used when you want to get the next
sequence
number from your database (Again expensive overhead on your
job as noted before). Also note that sparse lookup is only
available for DB2 and Oracle. Normal Lookup stage can have
multiple reference link but Sparse can only have one
reference
link. When normal is used it is a good practice to choose
less volume data as reference data.
| Is This Answer Correct ? | 15 Yes | 1 No |
Post New Answer View All Answers
project Steps,hits, Project level HArd things,Solved methods?
How one source columns or rows to be loaded in to two different tables?
1)How will u implement SCD2 by using surrogate key. 2)What are the disadvantages with surrogate key. 3)How will you handle nulls in your project for the varchar, integer data types. 4)Can I use two fact tables in star schema. 5)3 jobs are running on the 2 nodes after I added one more node so can I compile those jobs to run on three nodes.
How to implement complex jobs in data stage?
how to use self join using datastage ? can u tell me using stage how can we implemnet the self join
How complex jobs are implemented in datstage to improve performance?
What is difference between join, merge and lookup stage?
What is the method of removing duplicates, without the remove duplicate stage?
Can anyone tell me a difficult situation who have handled while creating Datastage jobs?
Differentiate between data file and descriptor file?
How many types of views are there in a datastage director?
Hi everyone,I have kept a project Sales And Distribution for a pharmaceutical company.can anybody explain one complex business rule that we had in our project and how did you accomplish it using DS?
A signal has a wavelength of 1 micro min in air.how far can the front of the wave travel during 1000periods?
What is the precedence of stage variables,derivations, and constraints?
Demonstrate experience in maintaining quality process standards?