What is logistic and linear regression? How do you treat multicollinearity and heteroscedasticity in regression? Name some packages in r and python for building regression models.
111What is nosql? Name some examples of nosql databases. What is a key value store? What is column storage? What is a document database?
98What is the central limit theorem? How is a normal distribution different from chi square distribution?
130Post New Data Science Questions
Create a program in a language of your choice to read a text file with various tweets. The output should be 2 text files-one that contains the list of all unique words among all tweets along with the count for repeated words and the second file should contain the medium number of unique words for all tweets.
Why data cleaning plays a vital role in the analysis?
How will you cut a circular cake into 8 equal pieces?
What is the difference between a cluster and systematic sampling?
How can you produce co-relations and covariances?
How would you create a taxonomy to identify key customer trends in unstructured data?
What is meant by selection bias?
What is data science? How would you say it is similar or different to business analytics and business intelligence?
A stranger uses a search engine to find something and you do not know anything about the person. How will you design an algorithm to determine what the stranger is looking for just after he/she types few characters in the search box?
What is random forests and how is it different from decision trees?
What is a generating function?
What is the difference between cluster and systematic sampling?
What do you understand by statistical power of sensitivity and how do you calculate it?
Define a sql query?
How do you do data import in sas?