which is more expensive hash or modulus partitioning? when do
you use modulus partitioning?
Answer Posted / ankit gosain
Hi All,
Hash partitioning is more expensive than modulus but it'll
give better performance as well.
The basic difference is, you can apply Modulus partitioning
for the number fields only while you can apply Hash
partitioning for Varchar and any type of field.
If you have more queries, you can mail me on
ankitgosain@gmail.com
Cheers,
Ankit :)
| Is This Answer Correct ? | 4 Yes | 0 No |
Post New Answer View All Answers
What is data partitioning?
create a job that splits the data in the Jobs.txt file into
four output files. You will direct the data to the
different output files using constraints. • Job name:
JobLevels
• Source file: Jobs.txt
• Target file 1: LowLevelJobs.txt
− min_lvl between 0 and 25 inclusive.
− Same column types and headings as Jobs.txt.
− Include column names in the first line of the output file.
− Job description column should be preceded by the
string “Job
Title:” and embedded within square brackets. For example, if
the job description is “Designer”, the derived value
is: “Job
Title: [Designer]”.
• Target file 2: MidLevelJobs.txt
− min_lvl between 26 and 100 inclusive.
− Same format and derivations as Target file 1.
• Target file 3: HighLevelJobs.txt
− min_lvl between 101 and 500 inclusive.
− Same format and derivations as Target file 1.
• Rejects file: JobRejects.txt
− min_lvl is out of range, i.e., below 0 or above 500.
− This file has only two columns: job_id and reject_desc.
− reject_desc is a variable-length text field, maximum
length
100. It should contain a string of the form: “Level out of
range:
What are the functionalities of link partitioner and link collector?
Define Merge?
What is the method of removing duplicates, without the remove duplicate stage?
Difference between data warehousing and olap?
If you want to use a same piece of code in different jobs, how will you achieve this?
how many rows sorted in sort stage by default in server jobs
What are the stages in datastage?
How do you run datastage job from the command line?
Differentiate between datastage and datastage tx?
How many types of views are there in a datastage director?
What is the difference between hashfile and sequential file?
What is the use of datastage director?
What are the functionalities of link collector?