Can you define power analysis?
How do you treat missing values during analysis?
Can you explain collaborative filtering?
Which one would you prefer for text analytics python or r?
What are the major skills data scientist need?
In k-means or knn, we use euclidean distance to calculate the distance between nearest neighbors. Why not manhattan distance?
Can you explain cross-validation?
Can you explain data cleansing?
What is the difference between an analyst and a data scientist?
What do you understand by type I vs type ii error?
Can you explain supervised learning?
How do you understand by bias variance trade off?
Can you explain data munging or data wrangling?
What is difference between sas, r and python programming?
Can you define feature vector?