What is a star schema? Why does one design this way?
Answer Posted / tanmay kumar meher
The star schema (sometimes referenced as star join schema)
is the simplest data warehouse schema, consisting of a
single "fact table" with a compound primary key, with one
segment for each "dimension" and with additional columns of
additive, numeric facts.
The star schema makes multi-dimensional database (MDDB)
functionality possible using a traditional relational
database. Because relational databases are the most common
data management system in organizations today, implementing
multi-dimensional views of data using a relational database
is very appealing. Even if you are using a specific MDDB
solution, its sources likely are relational databases.
Another reason for using star schema is its ease of
understanding. Fact tables in star schema are mostly in
third normal form (3NF), but dimensional tables are in de-
normalized second normal form (2NF). If you want to
normalize dimensional tables, they look like snowflakes
(see snowflake schema) and the same problems of relational
databases arise - you need complex queries and business
users cannot easily understand the meaning of data.
Although query performance may be improved by advanced DBMS
technology and hardware, highly normalized tables make
reporting difficult and applications complex.
| Is This Answer Correct ? | 5 Yes | 0 No |
Post New Answer View All Answers
Where the cache files stored?
What are Non-additive facts?
Explain what are the methodologies of data warehousing?
what are different storage options supported by oracle ?
Explain difference between snow flake and star schema. What are situations where snow flake schema is better than star schema to use and when the opposite is true?
What are different deliverables according to phases?
Ist the schema that a data warehouse system can implements.
Start a batches with in a batch?
What is a Decision Tree Algorithm?
What is the main difference between schema in rdbms and schemas in datawarehouse?
What is drill-through?
What is a real-time data warehouse? How is it different from near to real-time data warehouse?
Tell me what is full load & incremental or refresh load?
Explain dimensional modelling?
What are the different types of scd's used in data warehousing?