You need to perform an analysis on a massive dataset by
groups, but are unable to sort the data due to memory
constraint. How would you accomplish the task?
Answers were Sorted based on User's Feedback
Answer / satheesh
you can use index statement,
proc sql;
create index indexname on Tablename(Variable or Columnname);
quit;
the above code is equivalent to sort.
| Is This Answer Correct ? | 5 Yes | 0 No |
Answer / rock
Before grouping the dataset first compress the dataset
using compress=yes option whicle creation of dataset like
data x(compress=yes reuse=yes);
set y;
run;
proc sql;
select --- do the grouping now
Sure that sorting will take less time. Try this once
Main trick here is we are compressing the unused space in
dataset so the size of the dataset will get reduce.
| Is This Answer Correct ? | 2 Yes | 0 No |
What do the put and input function do?
What does a PROC TRANSPOSE do?
wt is a-z and a--z?
Are the preferred term counts are always equal to Body system counts? If so, Why are they equal if not why they are not equal?
What has been your most common programming mistake?
What is program data vector (pdv) and what are its functions?
How to import the Zip files into SAS? If it is possible in SAS? If it is posible write the code...
What is the work of tranwrd function?
How do you debug and test your SAS programs? What can you learn from the SAS log when debugging? How do you test for missing values? How would you create multiple observations from a single observation? What are some good SAS programming practices for processing very large data sets? Briefly describe 5 ways to do a "table lookup" in SAS. Why is SAS considered self-documenting? Are you sensitive to code walk-throughs, peer review, or QC review? What other SAS features do you use for error trapping and data validation? How does SAS handle missing values in: assignment statements, functions, a merge, an update, sort order, formats, PROCs?
how does sas handle missing values in assignment statements? : Sas programming
i have a data set with 20 observations i want label from 8 to 15 observations ? how you create this one.
How would you code a merge that will write the matches of both to one data set, the non-matches from the left-most data?