You need to perform an analysis on a massive dataset by
groups, but are unable to sort the data due to memory
constraint. How would you accomplish the task?
Answers were Sorted based on User's Feedback
Answer / satheesh
you can use index statement,
proc sql;
create index indexname on Tablename(Variable or Columnname);
quit;
the above code is equivalent to sort.
| Is This Answer Correct ? | 5 Yes | 0 No |
Answer / rock
Before grouping the dataset first compress the dataset
using compress=yes option whicle creation of dataset like
data x(compress=yes reuse=yes);
set y;
run;
proc sql;
select --- do the grouping now
Sure that sorting will take less time. Try this once
Main trick here is we are compressing the unused space in
dataset so the size of the dataset will get reduce.
| Is This Answer Correct ? | 2 Yes | 0 No |
how can you create zero observation dataset? : Sas programming
What is Linear Regression?
In the flow of DATA step processing, what is the first action in a typical DATA Step?
what are informats in sas? : Sas-administrator
what is study design in while working with SAS? what are screening variables in SAS?
List out some key concept of SAS
what is the difference between calculating the 'mean' using the mean function and proc means? : Sas programming
How would you generate 1000 observations from a normal distribution with a mean of 50 and standard deviation of 20. How would you use PROC CHART to look at the distribution? Describe the shape of the distribution.
What other SAS features do you use for error trapping and data validation?
What is the command used to find missing values?
What is the difference between a PROC step and a DATA step?
data data1; input dt account; format dt date9.; cards; 1745 1230 1756 1120 1788 1130 1767 1240 ; data data2; input startdt enddt total; format startdt date9. enddt date9.; cards; 1657 1834 12300 1557 1758 16800 1789 1789 12300 1788 1345 12383 1899 1899 13250 ; proc sql; create table data3 as select * from data1 as x left join data2 as y on x.dt>=y.startdt and x.dt<=y.enddt; quit; Here, we are getting cartision product. But,I want left join report consisting of this program. It should not get duplicate values. you can modify the program also.