in which situations do u go for starflake schema ?
Answers were Sorted based on User's Feedback
Answer / nagaraju bhatraju
Star Schemas
The star schema is the simplest data warehouse schema. It
is called a star schema because the diagram resembles a
star, with points radiating from a center. The center of
the star consists of one or more fact tables and the points
of the star are the dimension tables
Star schema contains the dimesion tables mapped around one
or more fact tables.
It is a denormalised model.
No need to use complicated joins.
Queries results fastly.
Snowflake schema
It is the normalised form of Star schema.
contains indepth joins ,bcas the tbales r splitted in to
many pieces.We can easily do modification directly in the
tables.
We hav to use comlicated joins ,since we hav more tables .
There will be some delay in processing the Query .
1.Star Schema contains denormalized Dimensions. Snowflake
contains one or
more Normalized Dimensions.
2. Snowflake Schema give Less performance b'cos, giving
single result it
needs more joins. that is performance speed less in
Snowflake.
The snowflake schema is a variation of the star schema used
in a data warehouse.
The snowflake schema (sometimes callled snowflake join
schema) is a more complex schema than the star schema
because the tables which describe the dimensions are
normalized.
Flips of "snowflaking"
- In a data warehouse, the fact table in which data values
(and its associated indexes) are stored, is typically
responsible for 90% or more of the storage requirements, so
the benefit here is normally insignificant.
- Normalization of the dimension tables ("snowflaking") can
impair the performance of a data warehouse. Whereas
conventional databases can be tuned to match the regular
pattern of usage, such patterns rarely exist in a data
warehouse. Snowflaking will increase the time taken to
perform a query, and the design goals of many data
warehouse projects is to minimize these response times.
Benefits of "snowflaking"
- If a dimension is very sparse (i.e. most of the possible
values for the dimension have no data) and/or a dimension
has a very long list of attributes which may be used in a
query, the dimension table may occupy a significant
proportion of the database and snowflaking may be
appropriate.
- A multidimensional view is sometimes added to an existing
transactional database to aid reporting. In this case, the
tables which describe the dimensions will already exist and
will typically be normalised. A snowflake schema will hence
be easier to implement.
- A snowflake schema can sometimes reflect the way in which
users think about data. Users may prefer to generate
queries using a star schema in some cases, although this
may or may not be reflected in the underlying organisation
of the database.
- Some users may wish to submit queries to the database
which, using conventional multidimensional reporting tools,
cannot be expressed within a simple star schema. This is
particularly common in data mining of customer databases,
where a common requirement is to locate common factors
between customers who bought products meeting complex
criteria. Some snowflaking would typically be required to
permit simple query tools such as Cognos Powerplay to form
such a query, especially if provision for these forms of
query weren't anticpated when the data warehouse was first
designed.
In practice, many data warehouses will normalize some
dimensions and not others, and hence use a combination of
snowflake and classic star schema.
| Is This Answer Correct ? | 11 Yes | 1 No |
Answer / venugopal
mainly in real time ...when we want to use existing data
warehousing as source we will go for snow flake schema
| Is This Answer Correct ? | 7 Yes | 2 No |
Answer / prabakaran
when we have single or multiple facts and not bothered
about query performance.
When we dont have disk storage limitations
| Is This Answer Correct ? | 6 Yes | 3 No |
Answer / vidyanand
Mostly preferable is Star schema where query performance
will be high comparing to snowflake Schema.
Snowflake schema is used only where performance issue will
not come into matter.
So, better to use star schema rather than snowflake schema.
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / r.lakshmi
to design a DW, starflake schema is an effective method to
process the business instead of queries. provides speedy
retrival of information than snowflake schema
| Is This Answer Correct ? | 1 Yes | 0 No |
Answer / padmavathi
When we bother about query performance we will go for
starschema modelling because in this modelling data
redundancy will be there while writing queries to retrive
data need not use more join conditions.
| Is This Answer Correct ? | 1 Yes | 1 No |
I have 10 columns in a flat file and 10 rows corresponding to that columns. I want column number 5 and 6 for last five records. In unix as well as informtica.
suppose if we have dublicate records in a table temp n now i want to pass unique values to t1 n dublicat values to t2 in single mapping using aggregator & router? how
I have three same source structure tables. But, I want to load into single target table. How do I do this? Explain in detail through mapping flow.
how many tasks are there in informatica ?
15 Answers DELL, iGate, TCS,
Suppose we have a source qualifier transformation that populates two target tables. How do you ensure tgt2 is loaded after tgt1?
What is the difference between bitmap and btree index?
Some flat files are there, out of these having some duplicate. How do you eliminate duplicate files while loading into targets?
Q. WE ARE LOADING ORACLE TABLE THE PROCESS RUNS THREE HOURS. THIS TABLE IS BEING USED BY SOME DOWNSTREAM TEAMS SO WHAT WE WANT IS IN BETWEEN RUN IS PROGRESS IF ANYONE IS FETCHING THE DATA FROM THE TABLE THEY SHOULD SEE DATA TILL YESTERDAY TILL THAT. AFTER THAT PROCESS IS COMPLETED ONLY THEN TEAM SHULD BE AVAIBLE TO SEE TODAY DATA UPDATED. WE DONOT WANT TO LOG THE TABLE. NEED APROACH FROM YOUR SIDE.
If u select groupby port in aggregator what is output and dont select groupby option what is output
Can we write to_date(three arguments) ? Will it work ? Advance thanks
WE HAVE 10 RECORDS IN SOURCE IN THAT GOOD RECORDS GO TO RELATIONAL TARGET AND BAD RECORDS GO TO TARGET FLAT FILE ? HERE IF ANY BAD RECORDS MEANS ITS LOAD INTO FLAT FILE AND SEND AN EMAIL , IF NO BAD RECORDS MEANS NO NEED TO SEND EMAIL . PLZ HELP ME ...
hi talents, how do u get sequece numbers with oracle sequence generator function in informatica.... i dont need to use sequence generator transformation..... how do u achieve this???