Define naive bayes?
How would you do market basket analysis in r and python?
How to determine the number of clusters in k-means clustering algorithm?
What is f test?
Discuss normal distribution
How is data science different from data analytics?
What prior knowledge is required to become data scientist?
Define standard deviation, mean, mode and median.
How often should an algorithm be updated?
How do you treat missing values during analysis?
Which is the best suitable language among python and r for text analytics?
Explain how can you assess a good logistic model?
What is churn?
What is logistic regression? Or state an example when you have used logistic regression recently?
Given 2 vectors, how will you generate a sorted vector?