Star Schemas The star schema is the simplest data warehouse schema. It

in which situations do u go for starflake schema ?

Question Posted / nagaraju bhatraju

6 Answers
19867 Views
TCS, I also Faced
E-Mail Answers

Answer Posted / nagaraju bhatraju

Star Schemas

The star schema is the simplest data warehouse schema. It
is called a star schema because the diagram resembles a
star, with points radiating from a center. The center of
the star consists of one or more fact tables and the points
of the star are the dimension tables
Star schema contains the dimesion tables mapped around one
or more fact tables.
It is a denormalised model.
No need to use complicated joins.
Queries results fastly.
Snowflake schema
It is the normalised form of Star schema.
contains indepth joins ,bcas the tbales r splitted in to
many pieces.We can easily do modification directly in the
tables.
We hav to use comlicated joins ,since we hav more tables .
There will be some delay in processing the Query .
1.Star Schema contains denormalized Dimensions. Snowflake
contains one or
more Normalized Dimensions.
2. Snowflake Schema give Less performance b'cos, giving
single result it
needs more joins. that is performance speed less in
Snowflake.

The snowflake schema is a variation of the star schema used
in a data warehouse.

The snowflake schema (sometimes callled snowflake join
schema) is a more complex schema than the star schema
because the tables which describe the dimensions are
normalized.

Flips of "snowflaking"

- In a data warehouse, the fact table in which data values
(and its associated indexes) are stored, is typically
responsible for 90% or more of the storage requirements, so
the benefit here is normally insignificant.

- Normalization of the dimension tables ("snowflaking") can
impair the performance of a data warehouse. Whereas
conventional databases can be tuned to match the regular
pattern of usage, such patterns rarely exist in a data
warehouse. Snowflaking will increase the time taken to
perform a query, and the design goals of many data
warehouse projects is to minimize these response times.

Benefits of "snowflaking"

- If a dimension is very sparse (i.e. most of the possible
values for the dimension have no data) and/or a dimension
has a very long list of attributes which may be used in a
query, the dimension table may occupy a significant
proportion of the database and snowflaking may be
appropriate.

- A multidimensional view is sometimes added to an existing
transactional database to aid reporting. In this case, the
tables which describe the dimensions will already exist and
will typically be normalised. A snowflake schema will hence
be easier to implement.

- A snowflake schema can sometimes reflect the way in which
users think about data. Users may prefer to generate
queries using a star schema in some cases, although this
may or may not be reflected in the underlying organisation
of the database.

- Some users may wish to submit queries to the database
which, using conventional multidimensional reporting tools,
cannot be expressed within a simple star schema. This is
particularly common in data mining of customer databases,
where a common requirement is to locate common factors
between customers who bought products meeting complex
criteria. Some snowflaking would typically be required to
permit simple query tools such as Cognos Powerplay to form
such a query, especially if provision for these forms of
query weren't anticpated when the data warehouse was first
designed.

In practice, many data warehouses will normalize some
dimensions and not others, and hence use a combination of
snowflake and classic star schema.

Is This Answer Correct ?

11 Yes

1 No

Post New Answer View All Answers

Please Help Members By Posting Answers For Below Questions

Explain what are the different versions of informatica?

1105

Design a mapping to get the pervious row salary for the current row. If there is no pervious row exists for the current row, then the pervious row salary should be displayed as null.

1301

How to convert multiple rows to single row (multiple columns) in informatica

1387

What does update strategy mean, and what are the different option of it?

1098

explan ur project architecture?

2075

how tokens will generate?

2210

What is depict expression change?

1129

What is the commit type if you have a transaction control transformation in the mapping?

1037

What is primary and backup node?

1137

What are the tasks that source qualifier perform?

1111

Explain sessions and how many types of sessions are there?

1013

What is the status code in stored procedure transformation?

1189

How to extract sap data using informatica? What is abap? What are idocs?

1190

what is the size of u r source(like file or table)?

2277

What is a predefined event?

1097