Main Function of the Staging area in DWH ?

Answers were Sorted based on User's Feedback



Main Function of the Staging area in DWH ?..

Answer / guru avula

1.In a project source systems are differnet and also all
source systems are not availble in same time.for eg 1
source is availble at 1AM and 2n one at 2 AM etc
But our schedule jobs run at different time ,so we have to
pick the data from source system based on source availble
time and put into staging area.
2.All source systems table formats and column formats are
diffrent ,so we have to sync all .
For above two reasons we go for staging area

Is This Answer Correct ?    5 Yes 1 No

Main Function of the Staging area in DWH ?..

Answer / srinivas

I will add some points to first answer

Source system send the raw data what ever data they have simply they will send to us.

In staging we do the cleansing, remove-duplicate and null handling process and load the data into staging tables.

Then we applying business logic's and loading into dim and fact tables.

Is This Answer Correct ?    2 Yes 1 No

Post New Answer

More Data Stage Interview Questions

in one scenario source flat file like Fileld1 00122001550056200568 00256002360014500896 00123004560078900258 00147004560025800256 divide each 5 numbers as one column i.e here i need field1 field2 field3 field4 00122 00155 00562 00568 00256 00236 00145 00896 00123 00456 00789 00258 00147 00456 00258 00256 plz help me....

4 Answers  


What is lookup table?

5 Answers  


i have a scenario with i/p as ID,salary with values 1,1000 2,2000 and 3,4000 i need an extra column in the o/p named amount with values 2000,4000 and NULL. how can i get it?

2 Answers   L&T,


why we use hash file for lookup?

5 Answers  


1.What is the flow of Transformer? 2.How can you do INDEX table in DataStage level?

0 Answers   EDS,






What are constraints and derivations?

0 Answers  


How to enter a log in auditing table whenever a job get finished?

2 Answers   L&T,


on how many columns we can perform aggregation in the aggregator stage?

2 Answers   Reliance,


Can you explain engine tier in information server?

0 Answers  


create a job to get the previous row salary for the current row.if there is no previous row exists for the current row,then the previous row salary should be displayed as null? empid   salary   previoussalary 10      1000     null 20      2000     1000 30      3000     2000       40      4000     3000

5 Answers   Genpact,


input like 2 7 8 9 5 1 7 3 6 output:2 5 6 how to find out this plz explain?

2 Answers   HSBC,


in source is like seq file in date column have dd-mm-yy dddd-mmmm-yyyy mm-dd-yy yy-dd-mm yy-mm-dd i want to display only yy-dd-mm date formats only in tgt?

2 Answers   Wipro,


Categories