2.What is staging area ?

Answers were Sorted based on User's Feedback



2.What is staging area ?..

Answer / chandrasekar

Staging area is a nothing but storage area or parking area
to store the multiple source date. It will
transform,combine, cleanse data,do the simple business logic
and prepare the source data for datawarehouse.

Is This Answer Correct ?    2 Yes 0 No

2.What is staging area ?..

Answer / nrkreddy

Staging area is nothing but to apply our logic to extract
the data from source and cleansing the data and put the
data into meaningful and summaries of the data for
datawarehouse

Is This Answer Correct ?    4 Yes 5 No

2.What is staging area ?..

Answer / sharma

A Staging area simplifies building summaries and general
warehouse management.

Is This Answer Correct ?    4 Yes 6 No

2.What is staging area ?..

Answer / prodyot sarkar

The staging area is:-
* One or more database schema(s) or file stores used to “stage” data extracted from the source OLTP systems prior to being published to the “warehouse” where it is visible to end users.
* Data in the staging area is NOT visible to end users for queries, reports or analysis of any kind. It does not hold completed data ready for querying.
* It may hold intermediate results, (if data is pipelined through a process)
* Equally it may hold “state” data – the keys of the data held on the warehouse, and used to detect whether incoming data includes New or Updated rows. (Or deleted for that matter).
* It is likely to be equal in size (or maybe larger) than the “presentation area” itself.
* Although the “state” data – eg. Last sequence loaded may be backed up, much of the staging area data is automatically replaced during the ETL load processes, and can with care avoid adding to the backup effort. The presentation area however, may need backup in many cases.
* It may include some metadata, which may be used by analysts or operators monitoring the state of the previous loads (eg. audit information, summary totals of rows loaded etc).
* It’s likely to hold details of “rejected” entries – data which has failed quality tests, and may need correction and re-submission to the ETL process.
* It’s likely to have few indexes (compared to the “presentation area”), and hold data in a quite normalised form. The presentation area (the bit the end users see), is by comparison likely to be more highly indexed (mainly bitmap indexes), with highly denormalised tables (the Dimension tables anyway).
The staging area exists to be a separate “back room“ or “engine room” of the warehouse where the data can be transformed, corrected and prepared for the warehouse.
It should ONLY be accessible to the ETL processes working on the data, or administrators monitoring or managing the ETL process.
In summary. A typical warehouse generally has three distinct areas:-
1. Several source systems which provide data. This can include databases (Oracle, SQL Server, Sybase etc) or files or spreadsheets
2. A single “staging area” which may use one or more database schemas or file stores (depending upon warehouse load volumes).
3. One or more “visible” data marts or a single “warehouse presentation area” where data is made visible to end user queries. This is what many people think of as the warehouse – although the entire system is the warehouse – it depends upon your perspective.
The “staging area” is the middle bit.

Is This Answer Correct ?    0 Yes 6 No

Post New Answer

More Data Warehouse General Interview Questions

What are universe requirements?

0 Answers  


How do you create surrogate key using ab initio?

0 Answers  


Hi Guys, I have been to L& T last week ..Please find the questions mentioned below 1)Slowly changing dimension types 2)DML statement 3)DDL statement 4)write a syntax to create database 5) differences between count(*) and count(column) 6) differences between union and union all 7) what is decode 8) what is NVL 9) Arrange the values in ascending order 10,5,20,15,null 10)Primary and surrogate key 11)what is group by clause does? 12)what is difference between truncate and delete 13)what is difference between OLAP and OLTP ? 14)Architecture of Datawarehouse 15)how do you test the data validation from source as (flat file and xml) to staging tables. 16)what is shedule and time bound the mentioned above question asked in two rounds.all the best

2 Answers   L&T,


what is data validation strategies for data mart validation after loading process

1 Answers  


Do you view contain data?

0 Answers  






What are the stages that are required in data warehousing?

0 Answers  


Explain the structure of cold fusion?

0 Answers  


How to handle time period attribute for time dependent data?? a.)add time period as a part of pk b.)add time period simply and make it nullablec.)add time period simply and make it not null.

1 Answers  


What are the different problems that ?Data mining? can solve?

0 Answers  


Explain what type of indexing mechanism do we need to use for a typical datawarehouse?

0 Answers  


Explain the data types present in bo n wht happens if we implement view in the designer n report?

0 Answers  


Explain what is the main difference between inmon and kimball philosophies of data warehousing?

0 Answers  


Categories