Why is Python used in data science?
A stranger uses a search engine to find something and you do not know anything about the person. How will you design an algorithm to determine what the stranger is looking for just after he/she types few characters in the search box?
Can you provide an example of features extraction?
What is f test?
Define a sql query? What is the difference between select and update query?
Can you define a/b testing?
What would you do to summarize a Twitter feed?
What is the difference between iterator and generator in Python?
Explain data preparation?
How would you say data science is similar or different to business analytics and business intelligence?
What are outlier values and how do you treat them?
What is a random forest?
Can you cite some examples where a false negative important than a false positive?
How do you take millions of users with 100's of transactions each, amongst 10000's of products and group the users together in a meaningful segments?
Explain while working on a data set, how do you select important variables?