wht is cdc?how to use it in creation of mappings?
Answers were Sorted based on User's Feedback
Answer / bidhar
CDC is used when you want to pull the records which have
changed or newly added in the OLTP system.
Normally the OLTP tables have 2 columns
last_updated_timestamp and Added_timstamp.
Whenever a new record is added for the first time in these
tables then both the columns have the same timestamp ie
System timestamp.
Then when that particular record is changed only the column
last_updated_timestamp will change and the other column
Added_timstamp will remain same forever.
Now you need to pull this record when it was added as well
as when it was modified to keep your warehouse in sync with
OLTP system.
So based on last_updated_timestamp column (not
Added_timstamp)you need to pull the records.
This can be achieved by overriding the SQ query in the
where clause.
Example :- if product table in OLTP has 2000 records on
11th may and on 12th may 10 new records have come up and 5
records have been changed.Then in the next load 15 records
should be pulled to your warehouse.
Select Prd_nam,Typ,grp,category from product where
last_updated_timestamp>&&date_parameter
This is your SQ override query.
&&date_parameter is a mapping parameter which can be picked
up from a file.(you need to have the previous load max date
in that file which will be used as mapping parameter).
Hope this clears your doubt.
| Is This Answer Correct ? | 17 Yes | 0 No |
Answer / infa developer
Not always necessary to be a Date field. When there are 5
key colummns and you want to check if any of them changed
and would like to extract only those records with these 5
key column values updated then you will have use this
concept of change data capture.
| Is This Answer Correct ? | 7 Yes | 1 No |
Answer / vaibhav bhandeo
CDC stands for change data capture. This is used to
implement incremental load approach in data warehouse.
In this approach we traditionally keep a date field and
pull data on incremental date values.
This assures that we are picking latest data (Or may be new
batch data which ever is applicable).
Implementing this in mapping would require you to use
parameter files which will keep HiWaterMark and LoWatermark
which is repeatedly used to capture fresh data.
| Is This Answer Correct ? | 6 Yes | 5 No |
Answer / satya
CDC defines Whenever data is changed in OLTP Systems Only
that data Will be captured and loaded into our Target
SCD's works internally based on cdc logic
basically cdc's implented by using effective date
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / sreekanth
CDC stands for change data capture
Daily several transaction may happen among those some may
adds new records to the OLTP and some transaction updates
the existing so we need to capture the changes then while
loading the data we need to update on the existing date ware
house with changes made to OLTP for this we will maintains
when created and when updated audit column in OLTP according
those fields we can load the data to targt.
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / mike
capturing only changes is ok.. but in realtime as soon as the data changes in the source ... changes should be implmented in the target... so how's z dat done ? means hw we will know that source rows have been changed(suppose like evy 15 mins)? do we need to run the run mapping for evy 15mins? how does CDC works actually? i would really appreciate if some one can explain me?
| Is This Answer Correct ? | 0 Yes | 0 No |
Hi, Can anyone just give me the list of all versions of Informatica ? Advance Thanks
what is parameter file?
-Which expression we can not use in Maplets?, -Can we join(relate) two dimensions in a schema? -Why and where we use 'sorted input' option?
What are the performance considerations when working with aggregator transformation?
What are the Advantages of de normalized data?
How the facts will be loaded? explain
Which kind of index is preferred in DWH?
Define maping and sessions?
Can we schedule a workflow(not a session) to run after completition of another workflow? Can we make a workflow depend on completition of another one?
If we use sorted ports in aggregator transformation and somehow records are not in order by format so what will happen,session fails or it succeeded with incorrect data?
3.how will u get information about bugs how will u rectify the bugs in realtime whch tool we are using to rectify the bugs
If a session fails after loading of 10,000 records in to the target.How can you load the records from 10001?