State some use cases where Hadoop MapReduce works well and where it does not.
Can you define data discretization?
What approach will you follow to develop the love,like, sad feature on Facebook?
How many people are using Facebook in California at 1.30 PM on Monday?
Define linear regression?
What is a univariate analysis?
How to work towards a random forest?
Explain the difference between overfitting and underfitting?
Discuss decision tree algorithm?
Explain how can you assess a good logistic model?
What are distance measures in R statistics?
What is the goal of A/B Testing?
Explain about the various time series forecasting technqiues.
How and by what methods data visualizations can be effectively used?
Can you cite some examples where a false positive is important than a false negative?