Explain the methods to improve the performance of a graph?
Answer Posted / radhakrishna shanbhag
1. You may convert the serial file to multifile system using a partition by key, if it is a serial file.
2. You may filter out all the records from the file that are unwanted for the process. Elimination of records helps the cause.
3. If there are joins with any tables/files, try to use look up files for smaller tables/files.
Also you should use the larger file as the driver port for joins with bigger tables/files.
4. Use in memory sort for smaller file joins.
5. By any chance if you are unloading from a table, you may use order by in the SQL which eliminates use of Sort component in the graph.
I hope these helps. Please correct me if I am wrong.
| Is This Answer Correct ? | 5 Yes | 0 No |
Post New Answer View All Answers
Difference between output_index and output_indexes in reformat
what is the difference between usersandbox,privatesandbox,publicsandbox,commonproject sandbox?
What do you mean by data sorting?
Mention what is a partition and what are the different types of partition components in abinitio?
When we should use 'jobid' for commit table (Output table component??
Describe the process steps you would perform when defragmenting a data table. This table contains mission critical data?
Explain what does dependency analysis mean in abinitio?
How do you add default rules in the transformer?
What are the Types of parallelism in detail?
Given input file with single column having 1 2 2 4 3 .. Required output 1 2 2 4 4 4 4 3 3 3.. How?
What are the features of ab initio?
How transaction file different from that of a sort file?
What is the use of aggregation when we have rollup as we know rollup component in abinitio is used to summarize a group of data record? Then where we will use aggregation?
How scientific data processing is different from commercial data processing?
How do you truncate a table?