Explain the structure of artificial neural networks?
What is column storage?
What is the central limit theorem and why is it important?
What is logistic regression in data science?
What are the roles and responsibilities of a data scientist?
Why do you want to work at this company as a data scientist?
Which methods are defined for a class of iterators?
What is market basket analysis?
Pick up a coin C1 given C1+C2 with probability of trials p (h1) =.7, p (h2) =.6 and doing 10 trials. And what is the probability that the given coin you picked is C1 given you have 7 heads and 3 tails?
What assumptions does linear regression machine learning algorithm make?
What do you understand by ensemble learning?
State some use cases where Hadoop MapReduce works well and where it does not.
What is the svm algorithm?
How do you check for data quality?
How can you deal with different types of seasonality in time series modelling?