How to Improve Performance of graphs in Ab initio?
Answers were Sorted based on User's Feedback
Answer / ananya
There are many ways to improve the performance of the
graph. It also depends on a particular graph, the
components used in it.
In general the following tips can be used for improving
performance:
1> Try to use partitioning in the graph
2> try minimising the number of components
3> Maintain lookups for better effeciency
4> Components like join/ rollup should have the option
Input must be sorted, if they are placed after a sort
component.
5> If component have In memory: Input need not be sorted
option selected, use the MAX_CORE parameter value
efeciently.
6> Use phasing of a graph effeciently.
7> Ensure that all the graphs where RDBMS tables are used
as input, the join condition is on indexed columns.
8> Try to perform the sort or aggregation operation of data
in the source tables at the database server itself, instead
of using it in AbInitio.
| Is This Answer Correct ? | 26 Yes | 1 No |
Answer / mahendra garewal
Hi,
Performance of graphs can be improved by employing the
following methods.
1:use data parallelism(but efficienty).
2:try to use less no of phases in graphs.
3:use component parallelism.
4:use component folding.
5:always use the oracle tuned query inside the input table
component this will give huge performance improvement.
6:Try to use as less as possible the components which does
not allows the pipeline parallelism.
7:Do not use huge lookups.
8:if data is not huge always use in memory sort option.
| Is This Answer Correct ? | 15 Yes | 1 No |
what is the relation between EME , GDE and Co-operating system ?
What are the reasons for which business can trust the approach of data processing?
If m loading a file of 1 million records.and the graph fails after loading 10,000 records. In this case if we use rollback command then what will happen?
How to execute the graph from start to end stages? Tell me and how to run graph in non-Abinitio system?
How to create project (public,private,common,client) what are the differences between them?
What is the significance of ‘mp run’ command ?
What is the best way of creating huge test feeds?
When we should use 'jobid' for commit table (Output table component??
What is the role of co-operating system in abinitio?
Describe the process steps you would perform when defragmenting a data table. This table contains mission critical data?
How do you add default rules in transformer?
Explain about ab initio’s dependency analysis?