How different is a mean value different from expected value?
Can you compare the validation set with the test set?
What is interpolation and extrapolation?
What is the importance of selection bias?
How will you define the number of clusters in a clustering algorithm?
A jar has 1000 coins, of which 999 are fair and 1 is double headed. Pick a coin at random, and toss it 10 times. Given that you see 10 heads, what is the probability that the next toss of that coin is also a head?
Explain auto-encoder
Give some situations where you will use an SVM over a RandomForest Machine Learning algorithm and vice-versa?
List the variants of backpropagation?
Explain the difference between an analyst and a data scientist?
Explain survivorship bias?
What is a linear regression in data science?
How can you assess a good logistic model?
Why is data munging useful?
Explain p-value?