You need to perform an analysis on a massive dataset by
groups, but are unable to sort the data due to memory
constraint. How would you accomplish the task?

Answers were Sorted based on User's Feedback



You need to perform an analysis on a massive dataset by groups, but are unable to sort the data du..

Answer / satheesh

you can use index statement,

proc sql;
create index indexname on Tablename(Variable or Columnname);
quit;

the above code is equivalent to sort.

Is This Answer Correct ?    5 Yes 0 No

You need to perform an analysis on a massive dataset by groups, but are unable to sort the data du..

Answer / rock

Before grouping the dataset first compress the dataset
using compress=yes option whicle creation of dataset like

data x(compress=yes reuse=yes);
set y;
run;
proc sql;
select --- do the grouping now

Sure that sorting will take less time. Try this once
Main trick here is we are compressing the unused space in
dataset so the size of the dataset will get reduce.

Is This Answer Correct ?    2 Yes 0 No

Post New Answer

More SAS Interview Questions

describe the interaction table in sas di? : Sas-di

0 Answers  


Explain the use of proc gplot? : sas-grid-administration

0 Answers  


What are the different operating system platforms in which we can use sas? : sas-grid-administration

0 Answers  


describe how to adjust the performance of data integrator? : Sas-di

0 Answers  


data data1; input dt account; format dt date9.; cards; 1745 1230 1756 1120 1788 1130 1767 1240 ; data data2; input startdt enddt total; format startdt date9. enddt date9.; cards; 1657 1834 12300 1557 1758 16800 1789 1789 12300 1788 1345 12383 1899 1899 13250 ; proc sql; create table data3 as select * from data1 as x left join data2 as y on x.dt>=y.startdt and x.dt<=y.enddt; quit; Here, we are getting cartision product. But,I want left join report consisting of this program. It should not get duplicate values. you can modify the program also.

1 Answers  






DATA ABC; INPUT TDATE DATE9. AVG; CARDS; 18APR2008 150.00 19APR2008 167.00 20APR2008 123.00 21APR2008 145.00 ; RUN HOW CAN I FIND THE DIFFERENCE BETWEEN AVG OF 18APR2008 ANF 21APR2008?? IF ANY ONE GETS IT PLS TRY TO POST IT.

8 Answers   Verinon Technology Solutions,


what is null hypothesis? why do you consider that?

0 Answers   Accenture, Quintiles,


how to rearrange the data as our wish by using dataset block?

4 Answers   Cognizant,


What is the maximum length of the macro variable? : sas-macro

0 Answers  


What do the sas log messages “numeric values have been converted to character” mean? What are the implications?

0 Answers  


How do you generate random samples?

5 Answers  


what is the limit of the number of the rows and columns available in the worksheet? : Sas-bi

0 Answers  


Categories