What is root Linear Regression analysis?
Explain while working on a data set, how do you select important variables?
Why is it mandatory to clean a data set?
What is the difference between Stack and Queue
What are the drawbacks of the linear model?
What is term Pearson’s Correlation Coefficient?
How will you test that there is increased probability of a user to stay active after 6 months given that a user has more friends now?
Why are generators used in Python?
What is meant by binomial distribution?
What is nosql? Name some examples of nosql databases. What is a key value store? What is column storage? What is a document database?
What is the central limit theorem? How is a normal distribution different from chi square distribution?
How do you check for data quality?
Explain why data cleansing is essential and which method you use to maintain clean data?
Explain k-mean?
What is the science of data in simple words?